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1. Introduction 

In technical materials, the microstructure develops during processing. Apart from rare 
cases where microstructural features persist unchanged in nature and geometry from the 
raw material to the final product (e.g., hard nonmetallic refractory particles), each 
individual processing step influences the amount, the composition and the geometric 
appearance of the constituents and of defects in a material. This is due to the fact that 
microstructures usually are far from the ideal thermodynamic and geometric equilibrium 
predicted by thermodynamic (or rather thermostatic) considerations (see ch. 5 on 
metallurgical thermodynamics and ch. 6 on phase diagrams). The usual route — casting, 
plastic forming, heat-treating — leads to microstructures vastly different from those 
obtained after powder-metallurgical production, for example. Vice versa, the mechanics 
of the individual processing techniques can be best studied by monitoring the microstruc- 
tural changes as a function of processing conditions (compare, for example, ch. 8 on 
solidification; ch. 9, §2; chs. 15 and 16 on diffusive and nondiffusive phase transfor- 
mation, respectively, or ch. 28 on recrystallization). Knowledge of the details of the 
formation of microstructures is essential in order to understand the relationships between 
processing parameters and the behaviour of materials in practical application. Since the 
most important technological properties are strongly influenced by the microstructure 
(see, for example, ch. 25 on the mechanical properties of multiphase alloys) this under- 
standing is important for the development of metallic (as well as non-metallic) materials. 

Several definitions of the term microstructure have been proposed in the literature 
(see, for example Hornbogen and Petzow [1970.1991], Schatt [1991], Hougardy 
[1981], Hornbogen [1981, 1984, 1986a, b], LOcke [1984], Metals Handbook [1985], 
Herold-Schmidt [1988] or Jeglitsch [1989]. For the purpose of this book, the 
following seems appropriate: The microstructure of crystalline materials is defined by the 
type, the structure and the number of phases, by the number, the geometric appearance 
(size, shape etc.) and the topological arrangement of the individual phase regions and 
their interfaces, and by the type, structure and geometry of lattice defects (which are in 
most cases not part of the thermodynamic equilibrium structure). The experimental study 
of metallic microstructures, and their qualitative and quantitative description is termed 
metallography. (Sometimes, this term has also been used for the preparation of ceramic 
materials and polymers for microscopic inspection. Eventually, this improper use will be 
substituted by materialography or a similar term to be newly introduced to include 
metallography, ceramography and plastography.) 

Metallography dates back to the 17th century when English, French and German 
scientists first studied metallic objects by means of simple optical devices (see, for 
example, Smith [1960], Tensi [1968] or Pusch [1979]). The birth of modem metallo- 
graphy took place 200 years later and is dated to 1863 when H. C. Sorby developed an 
incident-light microscope, or to 1865 when he first observed and described some 
microstructural elements of technical iron. Today, a large arsenal of devices and techniques 
for microstructural investigations has become available (see, for example. Metals Hand- 
book, Vol. 9 [1985] and Vol. 10 [1986], Lifshin [1992, 1994a], Cahn and Lifshin [1993], 
and the books and journals listed under Further Reading at the end of this chapter). 
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This chapter deals with the techniques of microscopy and metallography as means for 
microstructural investigation. It focuses on qualitative and quantitative methods of 
optical, scanning-electron and scanning tunneling surface microscopies with short reviews 
of the other imaging and compositional analyzing techniques. A special chapter (chapter 
11) is devoted to transmission electron microscopy, including analytical TEM. 

2 . Optical microscopy 

Metallic materials are usually opaque; therefore investigations of plane cross-sections 
by incident light prevail in metallography. However, the transparency of some metals and 
silicon to infrared light in thin sections has been effectively exploited. Optically, the 
individual components of a metallic alloy differ in their amplitude and phase characteris- 
tics. While amplitude objects become visible owing to differences in light absorption and 
thus appear in different grey shades or even colours, phase objects only differ in the 
refractive indices which cannot be recognized without additional provision. The preparation 
of cross-sections, the enhancement of contrast by etching and other methods, as well as 
the microscopic set-up must be carefully optimized for the material under investigation 
and adjusted to the purpose of the investigation in order to get maximum information 
from a microscopic study. 

2.1. Metallographic specimen preparation 

The essential steps and techniques of metallographic sample preparation are shown 
in table 1. This large variety of methods has been described in handbooks, monographs 
and review articles, e.g., by Petzow and Exner [1968], Samuels [1971], in Metals 
Handbook [1985, 1986], by Elssner and Kopp [1984], Vander Voort [1984a], 
Schumann [1990], Louthan [1992], Telle and Petzow [1992], Ash Handbook 
[1992], or Petzow [1994]. Details are discussed in a multitude of original papers in a 
variety of journals and conference proceedings (see also Further Reading at the end of 
this chapter). Though some systematic studies of the construction of metallographic 
devices (e.g., Waschull [1985], Kopp and MOller [1987] or Fundal and Gross 
[1993]) and of the consecutive steps of sample preparation (e.g.. Nelson [1989], 
Muller and Kopp [1989], Telle and Petzow [1992] or Waschull [1993]) have been 
published, successful preparation of metallographic laboratories samples is still a matter of 
skill. Since accreditation of metallographic became a major issue, systematic evaluations of 
procedures and standard documentation are necessary additions to the empirically developed 
recipes (see, for example, ROcker and Bjerregard [1993] or Wieland [1993]). In the 
following, a few of the more basic aspects of the present state of the art are reviewed. 

2.1.1. Sampling 

The location from which a specimen is taken depends on whether the investigation 
is aimed (1) to give data for a specific area (systematic sampling), e.g., if the origin of 
a failure is clearly visible, (2) to characterize a larger piece (e.g., a laboratory sample) or 
(3) to characterize the quality of a large amount of material (as in quality control). In the 


References: p. 1016. 



Table 1 

Steps of metallographic preparation (after Petzow and Exner [1968]). 


systematic 

statistic 



clamping 


electro-erosive ■ 


electrochemical - 


inorganic material - 


sawing 
disc cutting 
ultrasonic cutting 
turning 
breaking 

powder-jet cutting 

sheet electrode 

rod electrode 

acid sawing 
acid milling 
acid-jet cutting 

low-melting alloys 
sulphur, glass 
cement, plaster 
glue 


embedding - 


organic material 


natural resin 

elastomers 

duromers 


galvanic 



stamping 

vibro-engraving 


writing 

co-embedding of label 

electro-erosive 

electro-erosive 


electrolytic 

dry 


microtome cutting 
micromilling 

wet 

rotating disc 

vibrational 

chemical 

electrolytic 

chemical/mechanical 

electrolytic/mechanical 

| combined 
alternating 

manual 

electrolytic 
ultrasonic 
water jet 



air jet 



| vacuum 
heating 
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last two cases the statistical fluctuations due to unavoidable inhomogeneities must be 
considered, and usually more than one specimen is necessary to get a reliable result 
(statistical sampling). Since usually nothing is known as to the degree of homogeneity, 
statistical parameters (usually taking the arithmetic mean and the relative standard error, 
see, e.g., Petzow and Exner [1968]) should be determined from samples which are 
taken either at arbitrary or at specially defined locations. Furthermore, damaging the 
specimen during cutting it from a larger piece gives rise to erroneous results: Electro- 
erosive cutting (“spark-machining”), for example, changes the composition near the cut 
faces to an appreciable depth: e.g., 0.9 and 0.3 wt% carbon (stemming from the 
electrolyte) and 0.8 and 0.2 wt% copper (from the electrode) were found in pure iron at 
50 and 150 fim depth, respectively, below the electro-eroded surface. Careful work (slow 
and interrupted cutting) reduces the depth of influence to 10 /tm. Mechanical cutting 
(usually by water-cooled wheels) does not change the composition but introduces stresses 
to a depth of 100 /tm and more (Waver [1973], Wellner [1980], Kiessler et al, 
[1982], Vander Voort [1984a] or Telle and Petzow [1992]). In spite of the dis- 
advantages of these commonly used techniques, others, like chemical cutting by a fast- 
moving endless wire wetted by an aggressive liquid, available commercially as “acid 
saws”, are only used for special purposes (for single crystals, semiconductors, brittle 
intermetallics, etc.) because of the long cutting times needed (hours, instead of the 
minutes needed for mechanical cutting). The same considerations apply to ultrasonic 
erosion, electrochemical sectioning or laser cutting. In order to avoid artifacts, a careful 
choice of the sampling technique adjusted to the specific material and its conditions and 
a careful control of the result must be made. 

2.1.2. Mounting 

Embedding or clamping are relatively uncritical operations. Some resins reach a 
temperature up to 150°C during curing, which may lead to annealing effects in the 
specimen; others are cold-setting. Galvanic deposition of a thin copper or nickel layer 
reduces edge-rounding during preparation to an acceptable level even for oblique 
sectioning (see below). Smearing and edge-rounding of open porosity during polishing 
can be avoided by infiltration of a low-viscosity resin under vacuum or by a well-wetting 
melt (solder for metals, glass for ceramics). 

2.1.3. Grinding 

The surface of a cut cross-section usually shows a high degree of irregularity which is 
removed in successive steps of grinding with emery paper (paper covered with SiC particles 
closely graded from coarse to fine between 80 and 20 /tin, see fig. 1). Heating can be limited 
to a tolerable degree using water-cooling, but deformation of the surface is unavoidable 
(Samuels [1971], Petzow and Exner [1968], Waver [1973], Kiessler et al. [1978], 
Vander Voort [1984a], Telle and Petzow [1992], and Petzow [1994]). It was found 
empirically (Lihl and Meyer [I960]) that the deformation depth X D is a square function of 
scratch depth X s (X D = aX s - bX$, where a and b are material constants). Figure 1 shows the 
depth of scratches, the deformation depth and the total depth influenced in grinding of steel. 
In an oblique taper section, the deformed layer becomes visible after etching (fig. 2). 


References: p. 1016. 
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Fig. 1. Depth of roughness, depth of deformation and total depth of damage as a function of particle size of the 
grinding and polishing medium (after Petzow and Exner [1968]). 


2.1.4. Polishing 

In order to prepare a surface free of artefacts, the damaged layer is removed by 
polishing. While it is relatively easy to obtain scratch-free surfaces, preparation of an 
ideally undisturbed surface is difficult if not impossible by mechanical polishing. As 
shown repeatedly (e.g., by Turley and Samuels [1981] for copper) mechanically 
polished surfaces show all the signs of plastic deformation (shear-bands, recrystallization, 
subgrains, etc.) even after prolonged polishing with fine grades (< 1 /im). The depth of 
the remaining disturbance is small (< 1 /im) and is tolerable for most purposes. Chemical 
and electrolytic polishing techniques do not cause deformation. Another technique leaving 
little surface damage is ion-milling or ion-polishing (Lehtinen and Melander [1980]), 
i.e., sputtering of the surface resulting from the impact of energetic argon ions (used 
more frequently for preparation of TEM foils). Removal of the layer damaged during 
grinding by polishing is time-consuming. If sample preparation is not carefully control- 
led, deformation from the coarse grinding steps may resist and may influence not only 
the microscopic appearance (Buhler and Hougardy [1979], Samuels [1971] and 
Pokorny [1980]) but also X-ray measurements (line broadening, blurring of reflections) 
or mechanical tests like indentation hardness and toughness (Bernst [1965] and Exner 
[1969a]). Figure 3 shows the influence of various polishing methods on reflectivity, 
which is sensitive to surface damage. Investigations under polarized light suffer from 
surface strain as well. Thermal polishing , i.e., annealing in vacuo after mechanical 
polishing, was shown to produce smooth strain-free surfaces by uniform evaporation 
(Irani and Cahn [1971]). 

Disc-polishing using emulsions of carefully graded loose diamond or alumina 
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powders on cloth is the usual way of mechanical polishing. For hard materials, a drill- 
like set-up using a wooden stick with diamond paste has advantages. Material removal 
is adequate and most materials can be polished this way. Microcutting by diamond blades 
has been used as a one-step, time- and labour-saving preparation mode for soft materials. 
Such ult ramie rotoming (for references see Petzow and Exner [1968] and Klocken- 
kamper et al. [1979]) has been shown to produce very large localized deformation 
(BOckle [1964]); nevertheless, if the cutting parameters are optimized, a surprisingly 
perfect surface quality of microcut metals can be achieved (Petzow and Knosp [1973] 
and Petzow and Exner [1975]). By micromilling cross-sections and serial sections of 
medium-hard metals and alloys can be prepared (Kiessler and Elssner [1980] and 
Petzow and Exner [1975]). 

There is a nearly infinite number of recipes for chemical and electrochemical 
polishing. Reviews are to be found in early metallographic standard texts, e.g., Tegart 
[1957], Petzow and Exner [1968], Biggs [1970] and Shigolev [1974] while more 
modem references are scarce (Metals Handbook [1985], Petzow [1994]). Theoretical 
approaches are available (e.g., Wagner [1954] and Tousek [1981], for reviews and early 
references see Petzow and Exner [1968] and Biggs [1970]); however, they do not 
allow deduction of optimized polishing procedures for a given material. For electrolytic 
polishing, the form of the current density-voltage curve suggests which potentials should 



Fig. 2. Deformed layer in brass (CuZn 30) after 
cutting by a new diamond wheel. Oblique section 
(5° to surface which is on the left side). Etched. 
200 x. (From Kiessler et al. [1982].) 



Fig. 3. Reflectivity (in percent of cleavage surface) 
of metal surfaces polished by different techniques 
(after Petzow and Knosp [1973]). 


References: p. 1016. 
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be used to avoid etching effects. (Usually a plateau is observed where polishing occurs.) 
It has been shown that these curves should be measured under practical conditions and 
not in special cells (ROschenbleck and Wolter [1979]). In spite of the limited exact 
theoretical knowledge on the mechanisms of mechanical, chemical and electrolytic 
processes, preparation of metallographic cross-sections is not a limiting factor for 
microstructural investigations since the state of the art allows the investigator to deal with 
even the most difficult materials. 

2.1.5. Replica techniques 

Surface studies by transmission electron microscopy are possible by pressing a foil of 
a suitable substance tightly to the surface or to form a replicating foil by casting an 
organic replica. This technique was used extensively prior to the development of scanning 
electron microscopy for high-resolution surface studies. Sample preparation is well 
developed (see, for example, Goodhew [1973]). By the so-called double-replica 
technique, fine precipitates can be extracted from a matrix by chemical or electrochemical 
etching and embedded in a resin foil. This technique is a quick and reliable standard pro- 
cedure for investigating the morphology and the spatial distribution of dispersed particles 
and, in particular, is useful for EDX analysis of small features avoiding signals from the 
matrix (see, for example, SchrOder et al. [1990] or Czyrska-Filemonowicz et al. 
[1992]). Replicas can also be studied by optical microscopy, scanning electron microsco- 
py (§ 3) and soft X-ray microscopy (§ 5.5). For remote sampling (e.g. for large engineer- 
ing components which must not be destroyed, for taking samples at temperatures up to 
120°C or for radioactive materials), replica techniques using movable preparation 
equipment are frequently the best if not the only way for microscopic inspection (see, for 
example, Pelloux [1970], Wendler and Neubauer [1979], Birner and LOhberg 
[1980], or Lobert [1982]). Replication techniques have also been applied in optical and 
SEM fractography, e.g., for the study of stress-corrosion cracking (Conor [1972]) and fatigue 
crack initiation (Brown and Smith [1982], Dieser [1984], or Brugel etal. [1988]). Serial 
sectioning of plastic replicas is much easier than that of the sample itself, and was proposed 
for three-dimensional reconstruction of fracture surfaces (Bauer and Haller [1981]). 

2.2. Etching and other contrasting techniques 

The human observer is capable of distinguishing between different phases and lattice 
defects if these show a grey or colour contrast in microscopic viewing. Polished surfaces 
rarely provide sufficient contrast owing to the fairly similar reflectivity of metallic phases, and 
contrast enhancement is usually necessary. A number of metallographic techniques are 
available to reveal the microstructure. 

2.2.1. Chemical and electrolytic etching 

When a polished surface is attacked by an etching medium, different phases and 
different lattice orientations usually show differing rates of dissolution. Crystal imperfec- 
tions and grain boundaries are locations of increased dissolution potential. These 
differences in chemical and electrolytic attack are the basis for the most frequently used 
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metallographic techniques for optical contrast enhancement. In the monographs by 
Tegart [1957], Beckert and Klemm [1984] and Petzow [1978, 1994] or in handbooks 
as, for example, Metals Handbook [1985], known recipes for technical metals and 
alloys are reviewed. Theoretical understanding of material removal (see, for example, 
Engel [1958], Schaarwachter [1968], Biggs [1970], Herbsleb and Schwaab 
[1978]), though well-established in corrosion science, is rarely used for finding the 
optimum etching conditions for a new material as, for example, in potentiostatic etching 
(Worch et al. [1994]). Usually these are established empirically, aided by educated 
guesses. Local changes in reflectivity and shadows produced by the rough topography of 
the specimen surface give rise to grey contrast when viewed in the microscope. Thus, 
grain boundary grooves and facets, height differences between grains of different 
orientation and between phases, or etch pits at points where dislocations penetrate the 
cross-section, are typical contrast features created by attack-etching. Sometimes, deep- 
etching by chemical or electrochemical attack may favourably be used to show the 
spatial geometry of microstructural features and to make them accessible to stereometric 
measurement as shown, for example, by Paul and Murrle [1981], Feijoo and Exner 
[1990], or Feijoo etal. [1990] (see also §2.4.1). 

2.2.2. Thermal etching 

The thermodynamic instability of a polished surface will lead to effects similar to 
those mentioned in §2.2.1 when material transport is activated by heating. The basic 
mechanism is surface diffusion (rather than selective evaporation), and the kinetics of 
thermal grain-boundary grooving and facetting are well understood (see, for example, 
Mullins [1961]). Thermal etching is advantageously used for chemically stable materials 
such as ceramics (compare fig. 7c below). 

2.2.3. Ion-etching 

The basis of the well known but infrequently used technique of ion-etching, reviewed 
by Wechsung [1977], Pohl and Burchard [1980a, b], Graf et al. [1993], Pohl 
[1994], Petzow [1994], Pechenyakov and Kovacheva [1995], is cathodic atomization 
(sputtering) by bombardment of the surface with chemically neutral (e.g., argon) or 
reactive (e.g., oxygen) ions. The physics of sputtering has been surveyed by ochsner 
[1975] and by Pivin [1983], The rate of material removal depends on the atomic weights 
of the material and the ions (the highest rates being observed when these are approx- 
imately equal), on the energy and density of the ions hitting the surface, and on the 
atomic bonding in the material. By adjusting the sputtering parameters (voltage, gas 
pressure), selective material removal can be made to produce a clear topography and 
clean surfaces. Ion-etching is advantageously used for composite materials as shown in 
fig. 4 and coated metals (Stapf et al. [1986], Graf et al. [1993], Pohl [1994]). 
However, artifacts are easily produced (fig. 5). Ion etching is also useful, if the micro- 
structure is prepared for the application of surface-sensitive analytical techniques as well 
as interference-layer contrasting (see § 2.2.5). 


References: p. 1016. 
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2.2.4. Staining (tinting) and anodic oxidation 

A large number of so-called etching techniques do not produce a surface relief by 
dissolving the surface but produce a surface layer by a chemical reaction (Jeglitsch 
[1968]). These layers vary in thickness as a function of composition and orientation of 
the microstructural components. They are transparent and rather than having a specific 
colour themselves, produce interference colours varying with thickness, d. Light waves 
reflected at the surface and at the layer-substrate interface interact, causing extinction of 
a specific wavelength A e according to the equation (for normal incidence) 


where m determines the order of interference (m= 1, 3, 5 ... corresponds to 0, 1,2 ... 
order) and n is the refractive index of the deposited layer; for a more detailed discussion 
see § 2.2.5. 

Reaction layers of which the thickness varies with composition of the substrate can 
be deposited by chemical attack, by electrolytic processes, e.g. by potentiostatic oxidation 
(anodizing), or by oxidation when heating a metallic specimen in air (thermal tinting). 
Tinting techniques have been extensively discussed in the literature (see, for example, 
Jeglitsch [1968], GrOtzner and SchOller [1969], Beraha [1970], Yankovith 
[1970], Beraha and Spighler [1977], Herbsleb and Schwaab [1978], Gahm and 
Jeglitsch [1981], Gahm et al. [1982], Weck and Leistner [1982-1986], Vander 
Voort [1984a, b, 1985a], Zhou etal. [1993], Petzow [1994], and many others). Anodic 
oxidation is another possibility to produce layers with a thickness varying with orien- 
tation and composition of the microstructural features resulting in an orientation- or 
phase-specific colour contrast. Anodic oxidation is carried out by immersing the sample 
in an acid solution of carefully adjusted pH and applying a voltage in the order of 100 V. 
Intermetallic phases have been identified in this way (Seeger etal. [1990]) and the grain 
structure of aluminium can be revealed (Yang [1990]), among numerous other applicat- 



Fig. 4. Ion-etched cross section of a graphite fibre- Fig. 5. Artefacts produced by ion-etching of a Ni- 
reinforced polymer. Scanning electron micrograph, base superalloy. (Cones formed owing to the pres- 
300 x (courtesy I. Graf). ence of non-conducting inclusions). Scanning elec- 

tron micrograph, 10,000 x (courtesy I. Graf). 
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ions (Danielson [1985]). In its early days, the colour contrast obtained after depositing 
interference layers in the presence of oxygen (reactive sputtering) was also attributed to 
varying thickness (Ondracek and Spieler [1973]); in fact it depends on a different 
optical principle, as discussed in detail in the next section. 


2.2.5. Interference-layer contrast 

A plane-parallel layer of a non-absorbing or weakly absorbing substance acts like an 
optical reflection-interference filter which, by multiple reflection at the metal-layer and 
layer-air interfaces, causes contrast enhancement between neighbouring phase regions, 
provided these differ in their optical constants. The optical principles of these effects have 
been reviewed by Pepperhoff and Ettwig [1970], Zogg etal. [1977], BOhler and Hou- 
gardy [1979, 1980], BOhler [1981] or Petzow [1994], and a large number of applicat- 
ions in metallographic practice have been published (for references see BOhler and 
Hougardy [1979, 1980], Exner and Roth [1980], Gahm and Jeglitsch [1981], Graf 
[1981] and Wu etal. [1982]). The important factor in interference-layer colour contrasting 
is the phase shift of the light wave reflected at the layer-metal interface. In normal bright- 
field microscopy, differences of this phase angle are much too small for metallic phases 
to be detectable. The transparent or semitransparent layers enhance these differences 
dramatically, revealing a pronounced colour contrast if two conditions are fulfilled: 

(i) The phase condition relates thickness of the layer d, the optical properties of the 
metallic phase (phase shift of the reflected wave through the angle 8 ) and of the layer 
(refractive index n), and the order of interference (defined by m as above) to the 
wavelength A [ni0 for which interference causes maximum reduction in intensity: 

A min = Airnd/[n{m - 1) + <5]. (2) 

Usually, the first-order interference (m= 1) yields the best results. 

(ii) The amplitude condition describes the relative intensity of the reflected light 
with wavelength A min . For non-absorbing layers, 

1 - 9 ' 


intensity of reflected wave 
intensity of incident wave 



with 

q =(n- 1 )/(n + 1), 



n m and k m are the refractive index and the absorption coefficient of the metallic phase, 
respectively. Much effort has been undertaken (BOhler and Hougardy [1979], BOhler 
and Kossel [1981], Aydin and BOhler [1981, 1984], Aydin etal. [1983] and BOhler 
[1986]), to determine n m and k m values for phases occurring in metallic materials in order 
to be able to calculate the required n, A^ and d values for maximum contrast between 
phases present in a material. The contrast is defined by K=(R l -R 1 )/R i , where R { and R 2 
(R, >R 2 ) are the relative reflectivities of two phases, and reaches a maximum if R 2 = 0, 
i.e, when interference causes complete extinction of the colour under consideration in one 
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of the two phases (A"= 1). If, then, a filter for the corresponding wavelength A is used, 
this phase appears black. If white light is used, maximum colour contrast will be 
obtained. 

Plane-parallel interference layers can be deposited by evaporation or by sputtering. In 
order to fulfill the amplitude condition R^ n = 0, a non-absorbing layer must have a high 
refractive index if the substrate has a high reflectivity (as do all metallic phases). ZnS, 
ZnSe, Ti0 2 and ZnTe (n = 2.4-3 .5) are used for evaporation. Absorbing layers can be 
deposited by reactive sputtering. This technique, first described by Bartz [1973], uses 
a low gas pressure (-10 -6 bar, usually oxygen) in the sputtering chamber. A commercial- 
ly available device has proved very useful in practical application (BOhler and Hougar- 
dy [1979], Exner and Roth [1980]) and, at least in principle, allows layers to be 
produced with widely varying optical properties. A disadvantage is the fact that calcu- 
lating the optimum contrast conditions becomes somewhat more difficult than for non- 
absorbing layers (Zogg etal. [1977]). A general difficulty is that the quality of surface 
preparation before applying the interference layer is crucial for the result. For example, 
scratches or contamination by polishing liquids will produce artefacts. Ion polishing, 
therefore, is preferable compared to electropolishing or mechanical polishing techniques 
as shown, for example, for steels by Gaudig and Scheck [1984]. Also, the accuracy of 
measurement of the optical constants depends on the reproducibility of surface prepar- 
ation (Kroner [1986]). Since calculation of optimum contrast conditions can be applied 
only if the optical constants are known to an accuracy of 2% (Schroder and Hougar- 
dy [1985]), the trial-and-error approach can often be quicker and more practicable. The 
reason that interference contrasting is described here more fully than other methods is 
that this technique is still not widely familiar. It has excellent reproducibility and versatil- 
ity and could substitute many of the classical contrasting techniques. The extreme sensi- 
tivity of the human eye for colour hues and the possibility of using filters makes colour- 
contrasting highly attractive. In addition, quantitative evaluation with respect to phase 
composition and exact phase identification (see, for example, Zogg et at. [1977]) are 
possible. Care must be taken in the latter cases in photographic reproduction which may 
change the original colours appreciably (Crouse etal. [1977] and Exner etal. [1980]). 

23. Principles of light microscopy and optical contrast enhancement 

The highly developed state of the mechanical and optical design of microscopes used 
in metallographic work makes it impossible to come near to an adequate description in 
the context of a book on physical metallurgy. A large number of monographs (e.g., 
Payne [1957], Oettel [1959], Malies [1959], Freund [1960, 1969], Kinglake [1965], 
Biggs [1970], Loveland [1970], Philips [1971], Galopin and Henry [1972], Modin 
and Modin [1973], Beyer [1974], Rost [1981], Bradbury [1991], Schade [1993]) as 
well as articles in handbooks (e.g., Vander Voort [1985b], Louthan [1987], Schu- 
mann [1990], Telle and Petzow [1992] or Holik [1993]) are available which treat the 
basic as well as the practical aspects of optical microscopy and photography comprehen- 
sively. Accordingly, the optical fundamentals (e.g., wave optics, properties of lenses and 
correction for aberrations) or the various components of the optical microscope (illumi- 



Ch. 10, § 2 


Surface microscopy 


955 


nation systems and light sources, objectives and eyepieces, polarizers, interferometric 
attachments, phase contrast equipment, stages etc.) need not be discussed in detail here. 
Important features are the resolution limit, depth of focus, and the different ways to 
enhance contrast by optical manipulations. 

2.3.1. Resolution and depth of focus 

Figure 6 shows the limiting resolution (minimum distance between two points and 
maximum number of lines per unit length, seen as separate features) and the depth of 
focus as a function of the objective’s numerical aperture n sin a (where n is the refractive 
index of the medium between the objective’s front lens, e.g., 1 for air, 1.25 for cedar oil, 
and a is half the opening angle of the objective; thus, n sin a is a quantitative measure 
for the amount of light reaching the objective). The resolution limit is, in theory, 
d=0.6 A/nsina, where A is the wavelength used (Abb6 limit). For green light (A = 
500 nm) using an immersion oil between specimen and high-power objective (resulting 
in a numerical aperture n sin a = 1.25), we get d= 0.24 pm. In practice, however, the 
resolution limit is close to 1 /Am even if the illumination system as well as all the lenses 
and apertures are optimally adjusted. Magnifications between 500 and 1000 times the 
numerical aperture, i.e., up to 1000 x, are useful while higher magnifications yield no 
additional information and therefore are called empty magnification. The depth of focus 
is inversely proportional to the square of the numerical aperture and is extremely limited 
at high magnifications (approx. 0.1 /Am). 

2.3.2. Bright-field illumination 

By far the majority of microstructural investigations by optical microscopy are carried 
out with vertical illumination, usually called bright field, and most optical micrographs 
of metallic materials are taken this way. Regularly reflected light is used and no 
additional manipulations of the light beam are necessary. Thus, high light intensities are 
obtained. The instrumentation is rather simple, and the use does not require great skill. 
The contrast is a result of locally varying light intensity due to one of the pretreatments 
of the specimen section discussed above. The human eye can differentiate between two 
grey levels if the contrast K is approximately 0.2, i.e., five grey levels between black and 
white are easily distinguishable. Photomultipliers and television cameras are more 
sensitive, and more than a hundred grey levels can be registered. If higher contrast is 
needed, one of the special techniques described in the next four sections (sometimes 
improperly called “optical etching”) can be useful. 

2.3.3. Oblique illumination, dark field and stop contrast 

If the direction of the incident light is changed from vertical to oblique, the contrast 
can be reversed with a gain in contrast which, for suitable specimen surfaces, is often 
striking. This can be achieved by simply moving the condenser aperture slightly off the 
optical axis, which produces a shadow-like contrast. While such oblique illumination is 
only applicable for low magnifications (long-working-distance objectives), a very useful 
alternative is dark-field illumination'. The light from the light source does not pass 
through the objective but is reflected to the surface by a ring-shaped mirror or lens 
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Fig. 6. Theoretical resolution (resolved lines per mm and distance between two distinguishable points) and 
depth of focus as a function of the objective’s numerical aperture (theoretical for green light). 


around the objective so that only stray (diffusely reflected) light reaches the objective. 
Rough surfaces, fissures, pores, grain boundaries and other surface irregularities are 
revealed, appearing bright on a dark background. This technique also lends itself for 
checking the quality of polish since scratches clearly show as bright lines. Opaque-stop 
microscopy produces images similar to dark field. Instead of changing the illumination, 
a ring stop is placed between the light source and the condenser lens. By moving the 
stop, different areas may be illuminated. Tilt angles with respect to flat portions of the 
specimen surface have been measured, for example during studying tilt and twist 
boundaries (Biggs [1970]). This is possible to a high degree of accuracy (depending on 
magnification, of the order of T to 60' of arc). For qualitative inspection, opaque-stop 
microscopy provides a sensitive type of dark-field contrast at no loss in resolution. 

2.3.4. Polarized-light microscopy 

Plane-polarized light (produced by placing a polarizer in front of the condenser lens) 
vibrates in one plane only. When reflected from an optically isotropic surface, the direct- 
ion of polarization does not change and will be transmitted by an analyzer placed behind 
the eyepiece and set parallel to the polarizer. If the analyzer is rotated, the transmitted 
light intensity is reduced and ideally drops to zero at crossed position of polarizer and 
analyzer. If the plane-polarized beam is reflected by an optically anisotropic surface it is 
sub-divided into two components vibrating at right angles to each other. The intensities 
of the components vary as a function of crystallographic orientation and its relation to 
the plane of polarization. Therefore, the amount of light transmitted by the analyzer is a 
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function of the orientation of a crystal which causes changes in brightness ( degree of 
extinction ) when the specimen stage is rotated or, for a polycrystalline material, for the 
various crystals when viewed with crossed polarizers. Furthermore, some optically 
anisotropic substances (e.g., nonmetallic inclusions such as cuprous oxide) show 
distinctly different tints in white polarized light due to an optical effect called reflection 
pleochroism, owing to a variation of reflectivity with wavelength or degree of extinction. 
The tint and its change when rotating the analyzer or the stage are characteristic for such 
materials. 

Polarized light is particularly useful in metallography for differentiating between 
optically isotropic and anisotropic components of the structure and for revealing the grain 
structure and twins in anisotropic metals and alloys, such as Zn, Mg, Ti and V. Though 
it is difficult to achieve a polished surface which is wholly strain-free, some metals 
which are hard or impossible to etch can be effectively examined (Irani and Cahn 
[1971]). Even the cubic crystals can become optically active if etch pits or grooves are 
produced by etching (for references see Phillips [1971]) or by coating with an aniso- 
tropic film e.g., by anodizing or by other epitaxially grown films. Comprehensive 
reviews on the use of polarized reflected light as an aid in metallography (and mineral- 
ogy) are available (Conn and Bradshaw [1952], Hartshorne and Stuart [1952], 
Mott [1952], Phillips [1971], Galopin and Henry [1972], Modin and Modin [1973], 
McCrone etal. [1978], Schumann [1990], Telle and Petzow [1992], among others). 
The quantitative use of polarized light has been restricted mainly to transmitted light in 
the fields of petrography and biology, though various measurements are possible in 
reflection on metals as well. A special application is the imaging of magnetic domains 
in metal crystals making use of the Kerr effect, reviewed by Bowman and Booth 
[1971]. The contrast which results from a rotation of polarization direction by only 1' to 
20' of arc can be improved by interference layers (see § 2.2.5). 

2.3.5. Phase contrast and interference contrast 

Before some of the modem sample preparation techniques (e.g., interference-layer 
deposition or reproducible electrolytic etching) were fully developed, optical techniques 
transforming phase-angle or height differences into grey or colour contrast gained some 
interest in metallography (Jeglitsch and Mitsche [1967], Beyer [1974], Modin and 
Modin [1973], Vander Voort [1985b], Schumann [1990], Telle and Petzow [1992] 
or Holik [1993]). Though they are developed more than 30 years ago, they are now 
available with some higher-priced metal microscopes, and have been described in detail 
in numerous publications (e.g., Oettel [1959], Jeglitsch and Mitsche [1967] and 
Beyer [1974]) their applications in studying metallic microstructures have remained few. 

Phase contrast (used extensively for transmitted-light studies in biology, see, for 
example, Yamamoto and Taira [1983]) transforms the invisibly small phase-angle shift 
caused by a small difference in height (or in optical properties) of an object and its 
surrounding into an amplitude (light-intensity) difference visible to the human eye. This 
is effected by retarding (positive phase contrast) or accelerating (negative phase contrast) 
a portion of the directly reflected wave by half a wavelength by inserting a phase 
platelet. Interference of this modified reflected wave from the phase object with the 
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unmodified wave yields a grey contrast, i.e. the image of the phase object, in the field of 
view. Extremely small height differences (1-5 nm) become visible whereas in bright field 
a step height of approximately 100 nm is necessary for detection. 

Interference contrast uses polarized light and a prism arrangement which splits the 
polarized light into two beams of equal intensity, the directions of which diverge by a 
small angle. These two beams hit the surface at slightly different positions and, after 
reflection, interfere with each other after passing a crossed polarizer. Several technical 
arrangements are possible; those used in practice are the differential interference contrast 
due to Nomarski (generally known as Nomarski contrast) and Michelson interference 
(Conn and Bradshaw [1952], Jeglitsch and Mitsche [1967], Phillips [1971], 
Rosenberger [1977] or Holik [1993]). If height differences exist at a specimen surface, 
the different levels appear in different colour hues which can be changed by shifting the 
prism. Jeglitsch and Mitsche [1967] have demonstrated applications of interference 
contrast with steel and cast iron as examples; other applications covering a multitude of 
materials are interesting as well (Benesovsky and Ihrenberger [1965], Spiess [1965], 
Beyer [1974] and Rosenberger [1977] or Schumann [1990]). 

2.3.6. Filters 

By the appropriate choice of illumination wavelengths the contrast between differently 
coloured constituents may be enhanced. Alternatively to coloured glass, a movable-wedge 
interference filter allows monochromatic light of any wavelength to be selected. The con- 
trast obtained is especially suitable for black and white photography (where green light 
is usually used) and for automatic image-analyzing equipment with a black and white 
camera as the detector. 

2.4. Special optical devices and accessories 

2.4.1. Stereomicroscopy 

Optical stereomicroscopes (see, for example. Watts [1982]) are useful for rough 
surfaces up to 100 x magnification. A recent version allows vertical as well as 45° side 
viewing without tilting the object. Today, the larger depth of focus and wide range of 
magnification available with scanning electron microscopy has nearly completely 
replaced optical stereomicroscopy for studies of rough surfaces. Optical stereomicro- 
scopes have kept their place in the metallographic laboratory for quick inspection and 
quality control purposes. If a higher resolution or a larger depth of focus than available 
with optical stereomicroscopes is needed, scanning light macrography or confocal 
microscopy (see §2.4.2) and particularly scanning electron microscopy (which is 
presently clearly the best choice, see § 3.4.1) can be used. 

2.4.2. Laser scanning and confocal microscopy 

The light scanning microscope dates back to an invention of Young and Roberts 
[1951] called the flying spot microscope. A similar principle is used in scanning light 
macrography developed in 1968 and applied for imaging of fracture surfaces with a high 
depth of field (Clarke [1987]). With the development of small gas lasers producing 
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monochromatic and virtually ideal parallel light rays, a light spot of high intensity can be 
focussed on tiny areas of a polished or rough surface. The first laser scanning micro- 
scopes (Brakenhoff et al. [1979], Sheppard [1987]) were soon followed by a 
commercial instrument built by Carl Zeiss, first in 1982. A major advantage is the 
application of the confocal principle first described and patented in 1961 (see Minsky 
[1988]). By reducing the depth of focus to a very small dimension with an effective 
depth resolution of 50 to 20 gm, rough surfaces can be optically sectioned plane by 
plane. The consecutive images can then be put together to produce a micrograph with a 
high depth of resolution orders of magnitude larger than in the conventional microscope. 
The information can also be stored in a computer and, using available software, can be 
used for a three-dimensional reconstruction of rough surfaces. The optical principles and 
instrumental design have repeatedly been discussed (see, for example, Wilson and 
Sheppard [1984], Boyde [1985, 1990], Vander Voort etal. [1989], or Turner and 
Szarowsky [1993]). Applications of the laser scanning microscope in materials science 
and other fields have recently been reviewed by Elssner etal. [1991] and Turner and 
Szarowsky [1993]. 

2.4.3. Scanning near-field optical microscopy 

As discussed in § 3 and 4, the scanning principle is used in many ways to increase 
the resolution up to the atomic level. With the upcoming of nanoscale science and 
technology, nano-optics, i.e., microscopes which collect light from extremely localized 
sources by fibre optics and produce images with a resolution in the range of 20-100 nm 
with a theoretical limit of approx. 5 nm, were developed. Though the principle idea dates 
back to 1928, the systematic development started only in the 80’s resulting in the so- 
called scanning near-field optical microscope (SNOM) which can be regarded as a 
member of the super-resolution microscopes (Wickmarasinghe [1992a,b]) described in 
§4. The resolution of this new type of microscope is not limited by the Abb6 limit, i.e., 
it is not determined by the wave length of light but by the size of the probe aperture. A 
review on near-field microscopy, nano-optics and SNOM was recently published by 
Pohl [1992], 

2.4.4. High-temperature microscopy 

Heating stages for high-temperature optical microscopy need a long-working-distance 
objective (e.g. a reflecting objective) that will offer the necessary oxidation protection. 
They are also on the way to obsolescence since stages for scanning electron microscopy 
are now available and used much more often than optical hot stages. Reviews of earlier 
work which often resulted in important new information on the transformation behaviour 
of metallic alloys, on melting and on other kinetic processes have been published by 
Reinacher [1965], Lozinsku [1961], Mitsche etal. [1964,1969], Modin and Modin 
[1973], Kulmburg et al. [1974], Metals Handbook [1985] or Schumann [1990]. 
Besides the technical problems which now seem under better control (Muggli and Me 
Crone [1977], Hoekstra and Bruis [1979]), the question remains unsettled whether 
processes observed on the polished surface take place in the bulk qualitatively or 
quantitatively in the same way. 
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2.4.5. Television cameras 

First attached to the optical microscope for demonstration purposes, television 
cameras, as was soon realized by an English company, could be adapted for quantitative 
analyses of microstructural images by adding a voltage threshold and a few electronic 
components. This started a rapid development of quantitative television microscopes. The 
modem versions are now the most powerful automatic devices for image analysis, details 
of which will be discussed below (§7.1). 

2.4.6. Microphotometry and ellipsometry 

Only twenty years ago, microscope photometers were introduced in metallography 
after having been used for some time in ore microscopy. Several highly sensitive 
commercial instruments are now available which use photomultipliers to register the 
intensity of reflected light (usually compared to a standard). Identification of small 
microstructural components (e.g. nonmetallic inclusions), detection of surface damage, 
orientation and concentration analysis, and sensitive detection of phase boundaries in 
quantitative image analysis were prominent early applications (Mitsche and Scheidl 
[1964] and Petzow and Knosp [1973]). Measurement of optical constants, e.g., for 
contrast calculations in interference-layer contrasting, have become an important domain 
of photometry (see § 2.2.5). Differential reflectometry is an interesting but seldomly used 
surface analytical technique filling the gap between X-ray diffraction and the ion- and 
electron probes discussed in § 6. Numerous applications (study of ordering in solid 
solutions, of corrosion phenomena and of lattice defect) have been reported by Hummel 
[1983]. Another sensitive reflection technique is ellipsometry (see, for example, Visscher 
[1973] and Azzam and Bashara [1977]) which uses the phase shift of linearly polarized 
light for unambiguous determination of optical constants and for following the growth 
kinetics of thin surface films down to atomic dimensions. 

2.4.7. Interferometry 

One of the most useful techniques for measuring the height of surface steps and other 
topographic features with high resolution (routinely 10-100 nm in reflection) is inter- 
ferometry (Keller [1977]). For double-beam interferometry, the same set-up as for 
interference contrast (§2.3.5) is used. The prism arrangement is set in such a way that 
interference of the two beams after reflection at an oblique plane surface causes a linear 
sinusoidal fluctuation of brightness which to the human eye appears as a series of 
parallel black stripes. If the surface is not plane, the stripes are distorted or set off (at 
steps). The width of the stripes can be reduced by a more difficult multiple-beam 
interference arrangement and their distance from each other, corresponding to half the 
wavelength in height, can be suitably chosen by adjusting the prisms. (For interference 
contrast as described in § 2.3.5 infinite stripe distance, i.e., only one brightness minimum, 
is used for illumination.) A large variety of microinterferometers are available (see, for 
example Phillips [1971], Modin and Modin [1973], Beyer [1974] and Keller [1977]). 
By measuring stripe distances and set-offs, height differences are determined. Also, 
angles of surface tilt can be measured with high accuracy. Among the applications in 
metallography have been studies of crystal growth kinetics (e.g., Nanev [1981]), of 
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grain-boundary grooving (Mykura [1955]) and of surface films and surface roughness 
(for references see Beyer [1974]). A recent application of interferometry is the character- 
ization of targets for inertial-confinement fusion (Foreman et al. [1993]), and special 
devices have been developed for this purpose. 

2.4.8. Microhardness 

As a local mechanical test, indentation hardness measurements using low loads 
(5-500 mN) are carried out in microscopically small regions. Micrometer eyepieces or, 
more often, specially designed arrangements which keep the image and the scale sharp 
simultaneously are used. Detailed specifications for microhardness testing are found in 
literature (Mott [1956], BCckle [1965], Semlitsch and Bergmann [1969], Gahm 
[1969], Dengel [1973], Mornheim [1977], Schumann [1990], Tate [1993], among 
others). Microhardness indentations are often used as markers for measuring distances 
and for accessing surface deformations induced subsequent to the indentations. More 
recently, ultra-hardness testing with loads in the range of 0.5 to 5 mN was developed. 
However, the resolution of optical microscopy is at its limits as the indentation size drops 
below 1 /j,m and the scanning electron microscope (see, for example, Zahn [1992]) or 
depth measurements (as with instruments called mechanical microprobe or nanoindenter; 
see, for example, Oliver [1993]) must be used. 

3. Scanning electron microscopy 

One of the most versatile instruments for microstructural investigations is the 
scanning electron microscope (SEM). Compared to the optical microscope it not only 
expands the resolution range by at least one order of magnitude (useful magnifications 
beyond 10 4 x) and the depth of field by two orders of magnitude (ranging from 1 /j,m at 
10 4 x to 2 mm at 10 x) but also offers a series of possibilities for image formation which 
are more or less easy to interpret and reveal a clear picture of plane cross-sections as 
well as three-dimensional surfaces such as, for example, fracture surfaces, deep-etched 
or corroded surfaces and surfaces of porous materials. 

3.1. Basic features of scanning electron microscopy 

Excellent monographs and handbook articles on the physical fundamentals, the 
various techniques, the instrumental details and the application of scanning electron 
microscopy are available (e.g., Seiler [1968], Heywood [1971], Hearle et al. [1972], 
Oatley [1972], Reimer and Pfefferkorn [1977], Loretto [1984], Exner [1985], 
Verhoeven [1986], Newbury et al. [1986], Gabriel [1987, 1992a], Chescoe and 
Goodhew [1990], Lyman [1990], Lyman et al. [1990], Joy [1992], Wells [1993], 
Lifshin [1993b], Goldstein et al. [1994]), and a large number of various types of 
commercial instruments are in daily use in almost every materials research and develop- 
ment laboratory around the world. 

The principle of the scanning electron microscope is rather simple: An electron beam 
scans the surface of the sample in the same way as in a cathode-ray tube which is used 
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to display the image. The electrons are emitted usually from a heated tungsten cathode. 
For higher electron currents (about 10 to 100 times that of a W electron gun) yielding a 
higher brightness, a lanthanum hexaboride (LaB 6 ) cathode may be used. The beam is 
focussed at the surface to a small diameter (approx. 10 to 100 nm). More recently, field- 
emission cathodes became commercially available which provide high-resolution images 
by reducing the probe diameter further and increasing the brightness by another 100 
times needing a higher vacuum (10' 7 Pa as compared to 10" 5 Pa for the thermionic guns). 
The acceleration voltage ranges between 1 and 50 kV while the current through the 
surface is in the order of 10 pA. 

The magnification is increased by reducing the current in the deflection coils; the 
normal range is 1-50 000 x. The SEM image is written on a high-quality screen (cathode- 
ray tube) with an image resolution typically of 0.1 mm which, at 10000 times magnif- 
ication, corresponds to 10 nm resolution on the specimen. The brightness of each picture 
point is determined by the detector signal which is of the order of a few pA and must be 
amplified by direct electron multiplication or by conventional amplifiers. Noisy back- 
ground due to high amplification is the major cause of unclear images. The dramatic 
improvement of image quality and resolution during the three decades of commercial 
development is due to improvement of all components of the microscope (more sensitive 
detectors, more powerful cathodes and more precise scanning devices) and to noise 
reduction in all electronic devices. 

Figure 7 shows the interaction between the primary electron beam and the sample: 
Various kinds of radiation are emitted which, when collected in a suitable detector, can 
be used for image formation. Table 2 gives some data important for imaging metallic 
microstructures. By far the most popular techniques are the secondary electron mode (SE) 
revealing surface topography and, in advantageous situations, also atomic-number and 
crystal-orientation contrast with high resolution and information from a small depth 
below the surface, and the backscattered electron mode (BE) which gives topographic as 
well as pronounced materials contrast from a much larger depth and width and therefore 
reduced resolution. Figure 8a shows a typical SE image, fig. 8b a typical BE image of 
polished metal surfaces. Detector electronics can be adjusted so as to respond to either 
SE or BE electrons with their quite different energy distributions. 

Back-scattered electrons (BE) are produced by single large-angle as well as by 
multiple small-angle elastic scattering events. In the 10-20 kV range of incident beam 
voltage, approximately 50% of each type leave the surface with a wide spectrum of 
energies in a range higher than 50 eV. As the atomic number of the material hit by the 
incident beam decreases, a smaller number of electrons are back-scattered and more 
energy is lost. In materials with high atomic number, a large number of electrons is 
back-scattered by atoms close to the surface, with little change in energy. Thus, yield, 
energy spectrum and depth of escape of back-scattered electrons are directly related to 
the atomic number of the material (fig. 9, see also § 3.3.2). 

Low-energy secondary electrons (SE) are formed by interaction of the primary 
electrons with loosely bound atomic electrons. The energy distribution of secondary 
electrons depends on the primary energy of incident electrons, the number of outer-shell 
electrons, the atomic radius and, most pronouncedly, on the surface barrier of the 
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Fig. 7. Types of electron-beam-excited electrons and radiation (schematic) and depth of information in the 
scanning electron microscope. (The fluorescence emission can come from as deep as 100 p.m.) 


material. The energy of secondary electrons is in the range of 0 to 10 eV with a 
pronounced maximum at around 3 eV. At 50 eV the frequency of SEs approaches zero, 
i.e., there is no overlap with the energy spectrum of back scattered electrons. However, 
high-energy BEs excite SEs as well which contributes to noise and decreases lateral as 
well as depth resolution (see also §3.3.1). The probability that low-energy secondary 
electrons will escape from the surface decreases exponentially with the depth of their 
generation. More than half of the total yield is emitted within a depth of about 0.5 nm. 
The yield is not strongly dependent on atomic number (for Au it is higher only by a 
factor of two than for C, see fig. 9). The main factor for secondary electron yield is the 
angle between incident beam and sample surface. Since the envelope of the excited 
volume (see fig. 7) moves closer to the surface when the beam hits the surface at a small 
angle, SE yield increases. Thus, variations of surface inclination cause pronounced 
changes of SE yield (fig. 9). This fact is used for obtaining information on surface 
topography (§3.3.1). 

A series of techniques are available to modify the signals in order to obtain better or 
additional information, as, for example, black-level suppression (i.e. differential amplification 
which distributes the contrast over the full range of the cathode-ray tube or the photographic 
film) or nonlinear amplification (contrast enhancement by preferential contrast expansion at 
either end of the gray scale, improving the visibility of features in otherwise dark holes). In 
Y-modulation , the CRT (cathode-ray tube) beam is deflected proportionately to the detector 
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Physical effects producing radiation, detector types and detected signals used for imaging and analyzing metal surfaces in the scanning electron microscope. 


Detected signal 

Type of detector 

Information 

Basic effects 

Maximum 

resolution 

Minimum depth of 

information 

Remarks 

Secondary elec- 
trons (SE) 

Scintillator photo- 
multiplier with 
Faraday cage 

Surface topography, 
material contrast, crys- 
tal orientation constrast 

SE yield depends 
strongly on surface 
tilt and weakly on atomic 
number and crystal 

orientation 

5-20 nm 

1-2 nm 

Background due to SE excited by BE 
reduces resolution and enlarges depth of 
information. Material contrast can be sup- 
pressed by superimposing the inverted BE 
signal 

Back-scattered 
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Fig. 8. Typical images produced in the scanning electron microscope (reduced in size for printing): (a) Topographic 
contrast by secondary electrons, aluminium-12 wt% silicon (aluminium matrix deep-etched). 300x. (From 
Paul and MOrrle [1981].) (b) Atomic-number (material) contrast by backscattered electrons, silver- 
matrix/nickel-fibre composite. 1080x. (From MOrrle el al. [1980].) (c) Topographic contrast by secondary 
electrons, alumina- 10 wt% zirconium oxide ceramic, thermally etched. The black line is a contamination trace 
of the line scan which produces the intensity profile shown by the white line (y-modulation). 3600 x. (From 
Bauer and Haller [1981].) (d) Secondary electron image revealing material roughness contrast. (The rough 
aluminium matrix appears brighter than the smoother silicon particles, due to polishing scratches.) Alumi- 
nium-12 wt% silicon alloy. 600x. (e) High-resolution- backscattered electron image of partially stabilized 
zirconium oxide. (The martensitically transformed platelets are normal to the surface). 9000 x. (0 Electron 
channelling pattern of tungsten grains in a tungsten-10 wt% nickel heavy metal. (From Takajo [1982].) 





966 


H. E. Exner 


Ch. 10, §3 



Fig. 9. Backscattering coefficient and secondary electron yield as a function of atomic number of the material 
hit by the electron beam (normal incidence), and as a function of inclination of surface with respect to the 
incident beam (schematic, after Goldstein and Yakowitz [1977]). 


signal (fig. 8c) allowing detection of low contrast not apparent to the eye in the intensity- 
modulated image. If one of these signal transformations is used, its limitations and defi- 
ciencies must be considered in interpreting the image, otherwise artefacts may provide 
erroneous information. Some practical aspects of the more frequently used techniques for 
investigation in the materials laboratory have been reviewed by Hillmer [1979a], Exner 
[1985], Lyman et al. [1990] or Gabriel [1990b], Goldstein et al. [1994], among others. 

3.2. Specimen preparation 

The major advantage of scanning electron microscopy is that in many cases little 
effort is necessary for specimen preparation. Practical experience has been reviewed for 
example by Newbury and Yakowitz [1977a], by Hillmer [1979b], Exner [1985] or 
Newbury et al. [1986], and, more recently, by Gabriel [1992b] and Goldstein et al. 
[1994], Clean, deformation-free cross-sections and clean fracture surfaces of metallic 
materials can be investigated directly. Non-conducting materials must be coated by a thin 
conducting layer (10-100 nm, usually gold applied by sputtering or carbon and metals 
applied by high-vacuum evaporation) to avoid charging effects. Oxides caused by long- 
time exposure to the atmosphere or by high-temperature reaction can be removed 
electrolytically (Yuzawich and Hughes [1978]) or by hydrogen reduction (Madeski 
[1980]). Cross-sections are prepared in exactly the same way as for light microscopy but 
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care must be taken in cleaning since residual polishing liquids or etchants trapped in 
pores or cracks cause contamination of the surface when putting the sample into the 
vacuum chamber. Organic mounting resins and any other hydrocarbons (grease etc.) must 
be removed in order to keep contamination by carbon and cracking products as small as 
possible. The shape of microstructural features in multiphase alloys can advantageously 
be studied after deep etching to remove some of the matrix metal (see, for example, 
McCall [1973], Fremunt et al. [1980], Madeski [1980], Hun-De and Jing-Yun 
[1980], or Paul and MBrrle [1981]). Figure 8a shows an aluminium-silicon alloy with 
part of the aluminium matrix removed by etching. The octahedral shape of the primary 
silicon crystals and the comphcated arrangement of the eutectic sihcon lamellae would 
not show up as clearly in a plane cross-section. 

Etching is not necessary when material ( atomic-number ) contrast is utilized for image 
formation: Differences of atomic number show up as differences in brightness, the phase 
containing lighter elements appearing darker. Unavoidable height differences of hard and 
soft phases after mechanical polishing result in additional and usually unwanted 
topographic effects. However, differences in surface roughness after polishing have been 
shown recently to produce pronounced contrast between hard and soft phases if contam- 
ination is avoided (see §3.3.1). Weak contrast mechanisms, as for example channelling 
electron contrast or magnetic contrast (see §3.3.3 and §3.3.4 respectively), are imposs- 
ible to detect in the presence of a deformed layer or of topographic features. Therefore, 
deformation-free and plane cross-sections must be prepared by careful polishing when 
studying microstructures by means of these contrast mechanisms. Sample preparation by 
electron polishing or by ion polishing is advantageous for achieving maximum contrast 
and avoiding artefacts in these cases (Nakagawa [1987]). 

3.3. Typical forms of contrast 

3.3.1. Topographic contrast 

The most pronounced contrast effects result from the dependence of secondary 
electron (SE) yield, back-scattering coefficient and detector collection efficiency on the 
angle between the surface element and the primary electron beam (see §3.1) or the 
detector position, respectively. The resulting contrast (see fig. 8a) is analogous to an 
optical image where the light comes from the detector and the observation direction is 
that of the incident electron beam (Newbury [1977], Reimer and Pfefferkorn [1977] 
or Joy [1992]). The stereoscopic effect is enhanced by shadows in regions which are 
hidden from the detector and by enhanced emission at edges. These latter effects are 
disadvantageous when deep holes and cracks or transparent edges are present. On the 
other hand, the edge effect can be advantageously be used to differentiate between phases 
of similar atomic weight for which material contrast (§ 3.3.2) is not pronounced. This has 
been demonstrated for an aluminium silicon alloy (fig. 8d), e.g., by Paul and Bauer 
[1983], see also Exner [1985]. 

The high spatial resolution (both transversal and in depth) obtained with secondary 
electrons excited by the incident beam is disturbed by secondary electrons excited by the 
back-scattered electrons coming from deeper in the specimen; thus the clear topographic 


References: p. 1016. 



968 


H. E. Exner 


Ch. 10, §3 


image can be obscured by atomic-number contrast and the resolution is reduced 
pronouncedly if special measures are not taken. There are several ways to reduce these 
effects (Blaschke [1979]). Superposition of the SE signal and the inverted BE signal 
seems to reduce interference from deep specimen layers almost completely (Volbert 
[1981] and Pfefferkorn and Blaschke [1982]). 

The vast majority of applications of the SEM in materials science (as well as other 
fields) rely on this mode while all other types of contrast together are used much less 
(estimated to less than 10 percent). The major reason for this fact is the high resolution 
obtained with SE micrography ranging from 5 to 10 nm in conventional instruments to 
1 nm and below in the currently most advanced (ion-emission gun equipped) scanning 
electron microscopes. 

3.3.2. Material (atomic number) contrast 

As discussed in §3.1, the yield of back-scattered electrons as well as that of 
secondary electrons is a function of the atomic number Z (fig. 9). Material contrast is 
useful for qualitative identification of phases and is especially suitable for quantitative 
evaluation of microstructural geometry by stereological techniques (see § 7). However, 
the atomic number contrast is usually obscured by topographic contrast (fig. 9) and, for 
secondary electrons, is sufficiently strong only in favourable cases, i.e., for ideally flat 
surfaces. For backscattered electrons, the topographic contrast can be greatly reduced 
even if the surface is rough by using ring-shaped detectors. Edge effects may still be a 
problem, and ideally flat polishing is necessary for difficult specimens in which the 
phases have similar compositions or consist of elements which are near neighbours in the 
periodic table. A difference of average atomic number, AZ<1 (i.e., even mixtures of 
nearest neighbours in the periodic table) is sufficient for slight material contrast, at least 
in the lower range of atomic numbers where the contrast is more pronounced (see fig. 9). 
Figure 8b shows an example of strong BE material contrast. 

The limitation for many problems in physical metallurgy is the lack in resolution due 
to the large depth of information (see fig. 7) which, for highly dispersed phases, results 
in blurred edges and detection of features not intersected by the cross-section but close 
below it. However, if phase interfaces are normal to the cross-section, excellent 
resolution is obtained (fig. 8e). 

By reducing the energy of the primary beam, disturbing effects can be reduced. Table 
3 shows the depth of information as a function of primary voltage. Even at low voltages, 
the depth of information from metal surfaces is at least one order of magnitude larger for 
backscattered electrons than for secondary electrons. Material contrast is reduced at lower 
voltages and additional contrast-enhancement techniques by surface treatment have been 
worked out for individual cases. 

A large number of problems in physical metallurgy and materials science have been 
solved using the BE mode, particularly since the early semiconductor-based detectors 
were substituted by optimized scintillator-light guide-photomultiplier (SLP) detectors 
(Robinson or YAG type). Some useful applications of these types of detectors were 
reported by Bauer and Egg [1984], for example. 
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Table 3 

Information depth obtained with secondary and backscattered electrons for a few elements as a function of 
acceleration voltage of the primary electron beam, and with secondary electrons (according to Seiler [1968] 

and Reimer and Pfefferkorn [1977]). 


Element Atomic 
number 

Density 

(Mg/m 3 ) 



Information depth (nm) 



back-scattered electrons 


secondary 

electrons 

5 keV 

10 keV 

20 keV 

30 keV 

50 keV 

C 

6 

2.3 

330 

970 

2800 

5300 

11 600 

10 

A1 

13 

2.7 

120 

360 

1050 

1950 

4290 

1.2 

Cu 

29 

8.9 

40 

110 

320 

590 

1300 

0.5 

Ag 

47 

10.5 

30 

90 

270 

500 

1100 

1.0 

Au 

79 

19.3 

20 

50 

150 

270 

600 

1.8 


3.3.3. Electron channelling contrast, electron channelling and Kossel patterns 

Electron channelling patterns (ECP) arise because of the fact that the primary 
electrons penetrate into the crystal to a depth which depends on the atomic packing 
density along different crystallographic directions. If the electrons follow the channels 
between rows of atoms, their re-escaping probability becomes low. A large number of 
Bragg conditions are satisfied with quite small angular variations. By rocking the primary 
electron beam about a point in the sample, diffraction lines from a small area (< 10 pm) 
are obtained. These selected-area electron channelling patterns (SACP) are similar to 
Kikuchi patterns obtained in transmission electron microscopy (ch. 11, §3.2) and are 
therefore often called pseudo-Kikuchi patterns. Information on the crystal orientation and 
crystal perfection, grain boundaries, twins and other crystallographic features is obtained 
from minute regions of a surface layer less than 50 nm thick. An example is shown in 
fig. 8f. Extensive discussions of electron channelling contrast show its usefulness for 
materials investigations (McCall [1973], Newbury and Yakowitz [1977b], Payne 
[1982], Verhoeven [1986], Newbury etal. [1986], Joy [1992], Riedle etal. [1994]). 
In the investigation of fracture surfaces, bend contours were observed by ECP (David- 
son [1974]). The study of orientation relationships of twins and of local textures are 
prominent examples of successful application for SACP. Though more often used in 
transmission or with higher primary energy in the electron microprobe analyzer, Kossel 
patterns (named after the explorer of this effect) are another efficient means to analyze 
the structure of small volumes in the order of /zm 3 in the scanning electron microscope. 
Kossel lines arise by interference of X-rays excited by the focussed electron beam at the 
surrounding lattice planes. On the film, lines with bright-dark fine structure are obtained 
which are easily interpreted and give useful information on orientation, lattice parameter 
and lattice deformation after distortion (Brummer and Nieber [1975], Dingley etal. 
[1982] or Newbury etal. [1986]). 

An interesting study of orientation relationships of new grains formed by recrystal- 
lization of aluminium, using the transmission Kossel technique in a modified electron 
microprobe analyzer by Doherty and Cahn [1972] is discussed in ch. 28, § 3.3. 
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A new type of microscopy called “orientation image microscopy ” (Adams et al. 
[1993] and Wright [1993]) uses backscattered Kichuchi patterns to determine lattice 
orientation in localized regions (approximately 0.2 /xm). Rapid indexing of the diffraction 
patterns by computer analysis is combined with computer-controlled movement of the 
electron beam or the sample stage. Thus, a computer image of the surface is obtained 
displaying orientation differences of the grains in polycrystalline material with a 
resolution of 1 degree misorientation. This technique was shown to yield information 
exceeding that by X-ray texture analysis and stereological grain size analysis as 
demonstrated for rolled aluminium and iron-silicon alloys (Adams etal. [1993]). 

The above-mentioned methods have recently been discussed and critically compared 
by Randle [1993], 

33.4. Magnetic contrast 

Magnetic fields of ferromagnetic crystals can affect the interaction of the primary 
beam or the resulting emission. Type-I magnetic contrast uses the deflection of the highly 
directional electrons by the leakage field which may amount up to 20% for materials 
with strong fields such as cobalt. The resolution with which the boundaries of magnetic 
domains can be picked up is only of the order of several /xm owing to the diffuse nature 
of leakage fields. Type-II magnetic contrast arises from the deflection of primary 
electrons by the Lorentz force inside the crystal. Magnetic domains appear in light-dark 
contrast due to differences in backscattering coefficient, with a strong tilt-dependence, 
and can be enhanced by filtering using high-energy BE only, with a resolution in the 
order of 100 nm. (For details and applications of magnetic contrast see Newbury and 
Yakowitz [1977b], Reimer and Pfefferkorn [1977], Elsbrock and Balk [1982], 
Vander Voort [1985c], Newbury etal. [1986], Verhoeven [1986] or Joy [1992]). 

33.5. Charge collection microscopy and electron-beam-induced current 
measurements 

Secondary electron emission is sensitive to surface potentials and electric field 
gradients. Therefore, in samples where local variations of potentials appear, regions of 
differing brightness will show up in the SE image. This effect can be enhanced by a 
positive or negative bias of a few volts which impede or enhance SE emission, respect- 
ively. Usually (though incorrectly) called voltage contrast, this technique provides another 
way to produce images in the SEM. EBIC (electron -beam induced current) and EMF 
(electromotive force) measurements depend on similar phenomena (creation of excess 
electron/hole pairs by the electron beam) and give useful information on diffusion length 
and lifetime of minority carriers in semiconductors. The term “charge collection 
microscope” (Leamy [1982]) has been coined for these techniques which make the 
scanning electron microscope one of the most versatile tools for the characterization of 
semiconductor materials and failure analysis in the electronic industry (see, for example, 
Newbury etal. [1986], Schaper and BOsel [1985], or Verhoeven [1986]). 

3.3.6. X-ray mapping 

Emission of fluorescent X-rays can be picked up by a solid-state detector and 
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analysed with respect to its energy. For producing an image (i.e., a so-called X-ray map), 
an energy window is set which selects X-ray quanta typical for a specific element. The 
efficiency of the X-ray process is small, typically 10" 4 photons/incident electron, and only 
a few thousand photons reach the detector per second. The beam of the display cathode- 
ray tube receives an intensity pulse whenever a certain preset number of X-ray quanta of 
preset energy reaches the detector. In this way, regions with a high concentration of the 
corresponding element appear bright while others appear dark. If this is done consecu- 
tively for several elements, X-ray maps of the distribution of these elements in the region 
scanned by the electron beam are obtained. 

Owing to the large depth of X-ray emission (see fig. 7) and the large background 
noise due to Bremsstrahlung, the resolution of X-ray maps is limited to a few /zm. 
Nevertheless, X-ray maps are used extensively for demonstrating element distributions in 
metallic materials allowing a quick and unambiguous identification of phases and con- 
centration gradients caused, for example, by segregation during solidification or by 
diffusion at the interface between phases which are not at thermodynamic equilibrium. 

3.3.7. Cathodoluminescence 

The recombination of electron/hole pairs causes a release of energy which is emitted 
from the sample. Part of the emission is in the visible or infrared part of the spectrum and 
can be collected and amplified to form an image. The ability of some materials to emit 
this long-wave radiation is known as cathodoluminescence (CL). Only few examples have 
been reported with metallic materials where radiation is caused by surface plasmon effects 
(e.g., Sdeber [1982] or Richards and Triggs [1982]). Ceramic materials and semicon- 
ductors have been occasionally studied by CL, for example Al 2 0 3 -Zr0 2 , MgO and Zr0 2 
ceramics. SiC fibres or diamond (Czernuska and Page [1985], Brown etal. [1988] and 
Page [1993] and GaAs (Davidson [1974], see also Pfefferkorn etal. [1980], Holt and 
Saba [1985], Newbury et al. [1986] or Joy [1992]). Mineralogical and particularly 
biological applications are frequent (see, for example, Reimer and Pfefferkorn [1977], 
Goldstein and Yakowitz [1977], Goldstein et al. [1994]). 

3.4. Accessory equipment 

The large specimen chambers of most commercial instruments allow special stages 
to be mounted by which various experiments can be carried out to yield additional 
information on the nature of metallic microstructures and their development. 

3.4.1. Stereomicroscopy 

Simple tilting stages or goniometer stages allow stereopair micrographs to be taken 
which give an excellent three-dimensional impression of rough surfaces when viewed in 
a stereoscope, by the “anaglyphe” method using a red and a green filter for the two 
images or by viewing with polarized light. The tilt angle and the viewing distance 
determine the subjective impression of depth. Quantitative evaluation of the parallax 
yields accurate data on the z coordinates at regularly spaced or arbitrarily chosen x-y 
positions. These data can then can be combined to construct height profiles or height 
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maps (e.g., Blaschke and Waltinger [1971], Boyde [1973], Arzt and Fischmeister 
[1979], Bauer and Exner [1981], Beaton and Filshie [1983], Gabriel [1987, 1992a], 
Lyman etal. [1990], Lienkamp and Kunaver [1994]). This technique can be used to 
obtain height differences destruction free and with high resolution which will often 
compare favourably with stylus-type instruments) and characteristic parameters (rough- 
ness indices, distribution of tilt angles for surface elements, etc.) for fracture surfaces and 
other rough surfaces. Figure 10 shows the profile map of a ceramic fracture surface together 
with the tilt-angle distribution. Further applications for solving problems in materials 
science have been reported for example by Exner and Fripan [1985], Sigl and Exner 
[1989], Feijoo and Exner [1991], Lienkamp and Exner [1994], and by many others. 


3.4.2. Dynamic and non-ambient-temperature SEM 

Large depth of focus and the possibility of rapidly changing the magnification in 
combination with mechanical or low- and high-temperature stages are prerequisites for 
continuous observation of specimens subject to applied stress, magnetic or electric fields, 
chemical reaction, and various effects due to cooling or heating. A multitude of 
phenomena in physical metallurgy have been studied, including fatigue crack growth in 
ferrous alloys (Schaper and BOsel [1985]), crack propagation and delamination in hard- 
layer coated steel during cyclic bending (Wetzig etal. [1984]), cracking of surface 
layers during heat treatment (Grabatin et al. [1983]), deformation of superplastic 
lead-tin eutectic, the effect of stress and magnetic field on the configuration of magnetic 
domains in an iron-silicon alloy (Newbury and Yakowitz [1977b]), solid-state 
sintering (Fulrath [1972]), or liquid-phase sintering (Riegger etal. [1980]). Video- 
recording is an ideal way to register events of interest, and special devices have been 
developed to record fast processes such as cracking or martensitic transformation with 
much higher resolution than is possible with optical microscopy. 

4. Scanning tunneling, atomic force and related microscopies 

Only a little more than a decade ago, the application of the concept of electron tunneling 
known since the early 1920’s led to the development of a new type of microscope, the 
scanning tunneling microscope (STM) by Binnig etal. [1982], Binnig and Rohrer [1982], 
STM is not just another type of high-resolution electron microscopy. A whole class of novel 
techniques based on an astonishingly simple concept allows the detection of almost any kind 
of interaction on a nanometer or even subnanometer scale. Some of these techniques 
(scanning tunneling microscopy, scanning tunneling spectroscopy, atomic force microscopy 
and magnetic force microscopy) have rapidly evolved into routine tools for surface character- 
ization, and new techniques based on similar principles are invented each year. In the decade 
of its development, dramatic impacts in fields as diverse as materials science, biology or 
electro-chemistry (among many others) have been made, and the papers published exceeded 
the thousand some time ago with a rapidly increasing rate of several hundreds each year. 
Review papers and monographs became available (e.g., Hansma and Tersoff [1987], 
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Angle normal to crack , deg. 

Fig. 10. Fracture-surface analysis by instrumented stereoscopy: (a) scanning electron micrograph evaluated by 
stereometry; (b) line profiles; (c) distribution of profile line length as a function of tilt angle of surface profiles 
perpendicular to crack direction. (After Bauer etal. [1982].) 


Binnig and Rohrer [1987], Garcia [1987], Wickramasinghe [1989, 1992a, b, 1993], 
Bonnell [1990, 1993a, b], Sarid [1991], GOntherodt and Wiesendanger [1992], 
Wiesendanger and GOntherodt [1992], Stroscio and Kaiser [1993], Dinardo 
[1994a, b], see also Further reading). The rapid development makes it difficult to follow up 
all issues of interest, even in a single field like physical metallurgy, and only some basic 
principles and a few examples of typical applications will be presented in the following. 
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4.1. Basic principles and capabilities 

An infinite potential energy barrier exists between two surfaces separated by a large 
distance. This barrier becomes finite when the surfaces are in close proximity, typically 
in the order of a few nanometers, and, when a voltage is applied, a statistically signif- 
icant number of electrons can penetrate the energy barrier. This effect was called electron 
tunneling and has been theoretically very well understood and quantitatively described on 
the basis of the classical wave particle dualism concept of quantum mechanics though a 
microscopic quantum-mechanical theory taking geometric effects into account is needed 
in the case of atomic resolution (see, for example, Tersoff and Hamann [1985], 
Tersoff [1993], Tersoff and Lang [1993]). 

The overall scheme of a scanning tunneling microscope is shown in fig. 11. To 
produce images of a surface at very high resolutions, a sharp metal tip, often made of 
tungsten or platinum, is scanned over the surface of the sample under investigation. The 
motion of the tip is controlled by piezoelectronic elements (usually lead zirconium 
titanate) in x and y directions by applying simple linear voltage ramps. A third element 
controlling the z position of the tip can be operated in two modes: in “constant current” 
imaging, the tip follows a constant charge density contour representing the surface profile 
by keeping tunneling current constant. This is achieved by a computer-controlled feed- 
back mechanism. The signal required to alter the vertical tip position produces the image. 
Altering the level of constant current (or the voltage applied between the tip and the 
surface) produces contours of different charge densities. In the second mode, “constant 
height” operation, the tip is held at a fixed z position and, while scanning the surface, 
changes its relative position to the sample surface. Thus, the sample/tip separation and, 
as a consequence, the tunneling current vary which is used as signal. 

As the tunneling currents are in the range of 1CT 9 A, the signal must be amplified by 
7 to 10 orders of magnitude. Effective shielding, avoiding mechanical vibrations by 
combining a series of damping strategies, careful sample preparation and a clean and 
sharp tip are critical for obtaining optimum resolution and unambiguous interpretation of 
the images. In contrast to conventional electron microscopy, scanning tunneling micro- 
scopy can not only be performed in vacuum but also in air or even in liquids. 

4.2. Atomic force microscopy 

Only a little later than STM, in 1986, Binnig et al. [1986] developed the atomic 
force microscope (AFM) capable of measuring or imaging the forces between the sample 
surface and the tip. This is achieved by attaching the tip to a cantilever. Forces acting on 
the tip produce a deflection which can be measured down to 1CT 12 m corresponding to 
forces in the order of 10” 13 N. (At the time this article was written, commercial instru- 
ments have force and depth resolutions of approx. 10“ 7 N and 4x 1(T" m, respectively). 
Since these forces can be measured locally with lateral resolutions in the order of 
0.1 nm, the forces between groups of atoms or even single atoms can be measured, and 
force microscopy has rapidly developed to a quantitative probe of all kinds of surface 
forces as electrostatic, capacitive, magnetic, adhesive, capillary, frictional or van der 
Waals (Sarid [1991], Meyer and Heinzelmann [1992], Hues etal. [1993], Burnham 



Fig. 11. Overall scheme of the scanning tunneling microscope (after Bonnell [1993c]). 


and Colton [1993]). The theoretical treatment of AFM is a complex problem and is still 
in progress involving empirical potentials, molecular dynamics (Abraham etal. [1989]), 
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continuum elasticity theory (Tomanek et al. [1989]) and ab-initio calculations (for a 
recent review see Meyer and Heinzelmann [1992]). Though there are a number of open 
questions in interpreting the major features of force distance curves and the nature of the 
imaging mechanisms as discussed by Burnham and Colton [1993], atomic force 
microscopes are now commercially available at reasonable prices, capable to measure 
surface forces and nanomechanical properties of materials as well as to modify surfaces. 

4.3. Tunneling spectroscopy 

Tunneling spectroscopy (TS) is another source of information available with the 
scanning tunneling microscope (see, for example, Hamers [1993], Stroscio and 
Feenstra [1993], Bonnell [1993b]) providing data relating to the local electronic 
structure of the surface. Compared with other surface spectroscopy techniques such as 
photoemission microscopy (UPS and IPS), electron energy loss spectroscopy (EELS) or 
infrared spectroscopy (IS and IRRAS) which provide information averaged over a large 
surface region (see § 6), with TS it is possible to measure the electronic structure on an 
atom-by-atom basis. Surface irregularities as impurities, steps, defects or electronically 
inequivalent atoms in ordered structures can not only be seen by STM but also be 
correlated with their effect on local changes of the electronic structure. Point spectro- 
scopy involves moving the tip to a feature of interest, modulating the tip voltage and 
recording the resulting change of tunneling current. In a second method for obtaining 
spectroscopic information, the tip voltage is modulated at high frequency with respect to 
the time constant of the feedback controller and the current is recorded at discrete values 
of the tip voltage. Though well funded by bulk tunneling (see, for example, Hansma 
[1982] or Wolf [1986]), proper analysis of the data is crucial to the use of this powerful 
technique and is presently still in the state of development. 

4.4. Related scanning techniques 

The same scanning and feedback principles as in scanning tunneling microscopy can 
also be applied to use other types of interactions between the sample and the tip for 
characterizing the properties of surfaces and to produce images. In addition to atomic 
force microscopy (AFM), magnetic force microscopy (MFM), after the first successful 
attempts in 1987, has been developed as a technique of its own with commercial 
instruments becoming available and interesting results being reported (for a recent review 
see Gruttner et al. [1992]). The principle is the same as that of the atomic force 
microscope, but the tip is made by electrochemical etching of ferromagnetic foils or 
wires, most often nickel, but also iron, cobalt or amorphous FeBSiC, or by coating the 
tungsten tip with a ferromagnetic thin film. The probe senses the magnetic stray field 
(which is not easily related to sample magnetisation and, therefore, may be a dis- 
advantage of this technique compared to electron probe based techniques like scanning 
electron microscopic polarisation analysis (SEMPA)). Also, lateral resolution is routinely 
limited to 40 to 100 nm with optimum values of 10 to 20 nm. On the other hand, MFM 
can be operated in air and requires little or no sample preparation. 

Another modification of the force microscope is the electrostatic force microscope. 
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A family of techniques yielding information on the structure of buried interfaces, is 
ballistic electron spectroscopic emission microscopy (BEEM), useful for studying the 
transport and scattering of electrons and holes in multilayer structures (see reviews by 
Kaiser et al. [1993] or Bell et al. [1993]). Furthermore, numerous STM related 
techniques have been invented and demonstrated to yield specific information on materials 
surfaces. Among those described in recent reviews (Wickramasinghe [1992a,b, 1993], 
Pohl [1992]), are the scanning noise microscope, the scanning tunneling potentiometer, the 
scanning photon microscope, the inverse photoemission microscope, near field thermal 
microscopy, scanning optical and acoustical probe microscopes, with more to come up in 
the future (for a list of more than 20 different techniques of scanning microscopy see 
Wickramasinghe [1993], for details see the most recent publications on STM conferences). 

4.5. Applications 

The major reason for rapid acceptance of scanning tunneling microscopy is the fact 
that three-dimensional images of the real surface are obtained at unprecedented levels of 
lateral and vertical resolution. Under optimized sample and tip conditions, quantitative 
information about topography and electronic structure can be obtained for individual 
surface atoms as well as for groups of atoms, defects, etc. Thus, STM is not just another 
microscopic technique but, as discussed above, comprises a new class of surface analysis 
techniques not available when the last edition of this book was published. Applications 
are manifold, and, though the majority of research papers are aimed to demonstrate the 
opportunities of STM and related techniques under extreme conditions, applications to 
actual problems of physical metallurgy and materials science have been reported. 

Obviously, scanning tunneling microscopy is the ideal technique to study the 
arrangement of atoms on the surface of crystalline solids. This arrangement influences 
processes like epitaxial growth of thin films or catalytic reactions as well as surface 
properties like surface energy or electronic behaviour (see the recent review by Unertl 
[1993]). Since surface atoms have dangling bonds, they rearrange in order to minimize 
energy and form a structure which is said to be reconstructed (Zangwill [1988]). The 
reconstructions of the (111) and (100) surfaces of Si are probably the ones most often 
studied (see, for example, Giessibl [1995]. The (100) and the (110) surfaces of Ir, Au 
and Pt have similar reconstructions. Reduced atomic densities are observed for the (110) 
surfaces called missing row structure because every second row of atoms has been 
removed. Surprisingly, the atoms in the (111) surfaces show a higher density than in the 
bulk. These and many other results have been reviewed by Unertl [1993], Opielka et 
al [1993] or by Stroscio and Kaiser [1993], An example of an STM image is shown 
in fig. 12. The local symmetry reflects the local electronic density of states. The defects 
in the periodic structure (upper left) are due to impurities. 

For the study of amorphous materials or where large single crystals do not exist, 
STM and related techniques are presently the best structural methods available. Their 
major limitations are that the subsurface structure cannot be detected and the atomic 
position cannot be determined with the same degree of precision as with diffraction 
techniques. These and other limitations of STM and AFM in surface crystallography are 
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Fig. 12. Scanning tunneling micrograph of the (0001) surface of graphite (courtesy of M. GObel and Von 
Blanckenhagen, Institut flir Materialforschung I, Kemforschungszentrum Karlsruhe). Scan size 12x12 nm, 
bias -60.8 mV; z range 1.0 nA (black; 0, white 1.0 nA). Note the scales are in nm at the axes and the grey 
levels indicate nA. 


discussed in recent reviews (e.g., Unertl [1993]). 

Surface diffusion at low temperatures (down to room temperature for Au and other 
metals) was measured by STM and MFM with interesting correlaries on the possibility 
to extrapolate high temperature data. Hillock formation by surface diffusion, surface 
phases and their transformation with temperature, surface premelting (explaining why 
solids can not be superheated above their melting points) and many other phenomena 
(see, for example, Kuk [1993], Stroscio and Kaiser [1993] and the reviews cited above 
and under Further reading at the end of this chapter) were studied by the techniques 
discussed in this paragraph. 

The ability to apply scanned probe techniques to solid/liquid interfaces opens a wide 
field of applications in adhesion, friction, materials processing and, most widely studied, 
electrochemistry. Special instruments for electrochemical STM have been developed and 
a wealth of information on the evolution of surfaces and films during electrochemical 
treatments of surfaces has been reported (for reviews see Sonnenfeld et al. [1990], 
Siegethaler [1992] or Cataldi et al. [1990], Barda and Fan [1993]). For example, 
the development of smooth areas during electropolishing, preferential attack at dis- 
locations, roughening during etching, the formation of islands in the early stages of film 
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deposition and long-range ordered structures on the surface of etched Si, like those 
obtained in ultra-high vacuum, were observed. 

Images of magnetic recording media were produced by MFM and their recording 
behaviour was investigated. Laser-written domains in magnetically hard and soft 
materials as well as natural domains and Bloch wall segments in a wide range of 
magnetic materials were imaged, and magnetic surface forces were measured. Again, 
there are a number of limitations, some of which will certainly be overcome in future 
developments. 

The various spectra, obtained with tunneling spectroscopy, particularly tunneling 
current/applied bias curves, have been analysed for their information on the correlation 
between the position and the electronic structure of surface atoms. 

With the development of nanotechnology more and more questions will seek answers 
which can only be given on the basis of techniques reading atomic resolution. With the 
exception of atomic force microscopy, the use of these techniques is rather specialised 
at present, and it remains to be awaited which of them will contribute most to solve 
materials science problems. 

5. Other special techniques of surface microscopy 

In addition to light microscopy and transmission and scanning electron microscopy 
used routinely in all fields of materials research, development and control, microstruc- 
tures can be investigated by several more exotic image techniques. While some of these, 
such as scanning tunneling, atomic force, photoemission or field-ion microscopy, are of 
high interest for various advanced studies of material surfaces, others are still in the stage 
of experimentation, have been substituted by other techniques or are more useful in other 
fields of application like biology or mineralogy. 

Instruments capable of analyzing the chemical nature and the electronic state of 
surface atoms have been developed at a rapid rate in the last few years, utilizing all kinds 
of interaction with incident photons, electrons and ions. In §6, a summary of these 
techniques in a classification with respect to exciting and emitted species is presented. 
The information obtained stems nearly exclusively from regions close to the surface and 
thus is decisively distinct from that furnished by bulk chemistry. If the lateral resolution 
is sufficiently good, the signals can also be used for surface imaging. 

A full account of all available techniques cannot be given in this context but short 
surveys of those being used to some extent in the investigation of metal surfaces in 
physical metallurgy will be given in this section. 

5.1. Scanning acoustic and thermal wave microscopy 

Though based on early discoveries and inventions (for reviews on historical develop- 
ments see Wickramasinghe [1983], Somekh [1990] or Briggs [1992]), acoustic and 
thermal-wave microscopy are rather new developments (Korpel etal. [1971], Lemons 
and Quate [1974], Wickramasinghe etal. [1978], Koch [1979], Ash [1980], Rosen- 
cwaig [1982] and Murphy [1993]). 
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Various types of signals and detectors have been used to form magnified images and, 
depending on the physical principle, the instruments have been called acoustic, thermal- 
wave, photoacoustic, or electronacoustic microscopes or thermal-wave and thermo- 
graphic microscopes, or thermoacoustic probes. Either a piezoelectric crystal is attached 
to the surface which emits acoustical signals, or alternatively, a laser or an electron beam 
are focussed on the surface resulting in periodic surface heating as the beam intensity is 
modulated and thermal and acoustic waves are sent through the sample. After reflection 
or scattering, the signals are picked up by a suitable acoustic or thermal detector 
(ultrasonic transducer, gas microphone, infrared detector) and transformed back into 
electrical signals. By applying the scanning principle, the efficiency of these types of 
microscopes can be fully utilized. The laser or electron beam is scanned across the field 
of view, and the signal received is used to modulate the brightness of a cathode-ray tube. 
The information furnished is different from that obtained with optical microscopes or 
with scanning electron microscopes in that it reveals sub-surface features such as, for 
example, pores and other defects, grain boundaries and microstructural components 
which differ from the surrounding matrix by their acoustic or thermal properties (density, 
specific heat, thermal conductivity). 

5.1.1. Scanning laser acoustic microscopy 

The scanning laser acoustic microscope (SLAM), also termed scanning photoacoustic 
microscope (SPAM) has now been developed to a commercially available instrument 
owing to its high versatility. In the range of high frequencies (1 to 10 GHz) the laser 
scans typically a field of 200 /nm square, and the typical magnification is 1000 x. The 
penetration depth is in the order of 100 p m. Since at these high frequencies the 
wavelength is similar to that of visible light (3 GHz corresponding to 520 nm), the 
maximum resolution of the acoustic and the optical microscopes are comparable. At the 
other hand of the spectrum, at 1 Hz to 100 MHz, the resolution is low but the depth of 
information increases to the size of the sample, e.g., to 50 mm at 10 MHz where, at a 
typical magnification of 2 to 3 x, the scanning area is approx. 100 mm square. 

With the possibility of changing the frequency, the depth from which the information is 
obtained can be varied and, as with the confocal principle, three-dimensional images can be 
produced. Also, using stereographic or holographic type of image processing is useful not 
only to get 3D images but also to reduce the blemishing effects of superimposed features. 
Quantitative information on the depth of features below the surface, the thickness and 
composition of surface layers or the depths of cracks can be extracted (for a recent 
review see Briggs [1994]). 

Various useful applications of scanning acoustic microscopes have been reported in 
the field of materials as, for example, by Wickramasinghe [1983], Tam [1986], Dieser 
and Herrmann [1988], Dieser and Matthaei [1988], Matthaei et al. [1988a, b], 
Opielka et al. [1990], Arnold [1991], Gilmore [1993], Murphy [1993] or Briggs 
[1994]. With the commercially available instrumentation and higher resolutions to be 
achieved in the future by decreasing the noise and wavelength with extensive work under 
way further, at research institutes as well as at industrial firms (Gelmore [1993]), acoustic 
microscopy is a promising alternative for materials characterization and product testing. 
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5.1.2. Thermal-wave microscopy 

With the electroacoustic principle excitation by pulsing the beam in a scanning 
electron microscope, a thermal-wave microscope is available by just attaching an 
additional detector to the SEM. The spatial resolution of such a device (also called 
thermoacoustic probe) is determined by the spot size of the incident beam, the modula- 
tion frequency and the thermal conductivity ranging from one /zm or less for insulating 
materials at high frequencies (several GHz) to a few mm at 100 Hz. Owing to the fact 
that thermal waves are more sensitive to local variations and have a better resolution than 
acoustic imaging, there are numerous potential applications for the thermal-wave 
microscope as well. Nevertheless, this technique has not been routinely applied in 
materials investigations. Some applications of thermal-wave microscopy were reported by 
Winschuh etal. [1989, 1993] and Murphy [1993] among others. 

5.2. Field-ion and field-electron microscopy 

Very high gradients of electric fields at the surface of a metal cause emission of 
electrons and ions. This is the basis of field-ion and field-electron microscopy. 

5.2.1. Field-ion microscopy 

Extremely high resolutions, down to the atomic dimensions (<0.15 nm) are achieved 
in field-ion microscopy (FIM) and the imaging atom probe (IAP). A large number of 
monographs (e.g., MOller and Tsong [1969], Bowkett and Smith [1970], Wagner 
[1982, 1985], Sakurai et al. [1990], Tsong [1990] or Miller et al. [1992] and 
handbook articles (e.g., Edelmann [1980], Smith [1986], Seidman [1993] or Cerezo 
and Smith [1994]) give a full account of these and the related spectrometric techniques. 
In field ion microscopy, noble gas atoms (usually helium, but also neon or argon) or 
hydrogen are ionized by tunneling of electrons at the cooled surface of a pointed metal 
tip. The ions are extracted from the surface by a high voltage and accelerated to hit a 
channel-plate converter. The secondary electrons are multiplied and emitted radially to 
a fluorescent screen. A high resolution image of the tip is obtained showing individual 
atoms and their arrangement. As a typical example, the image of a tungsten tip is shown 
in fig. 13. Ionization occurs more readily at terrace steps which appear bright and outline 
facets of low-surface-energy planes. Lattice defects cut by the tip surface such as 
dislocations, stacking faults, grain boundaries or antiphase boundaries in ordered 
structures are revealed. Vacancies and interstitials can be observed and their movement 
studied by taking photographs at certain time intervals. 

At the field strength produced at a pointed tip in the field ion microscope (approx. 5 x 
10 10 V/m at an applied voltage of 10 kV) it is also possible that atoms of the specimen 
material are ionized and are emitted from the surface and accelerated toward a detector 
the light output of which is recorded photographically. This mode of image generation 
is called field ion evaporation microscopy (removal of specimen material) in contrast to 
the field desorption microscope (removal of absorbed gas ions) described above. Field 
evaporation can be controlled with great precision by applying the potential as short 
high-voltage pulses. By using pulse times in the order of a few milliseconds, two to three 
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Fig. 13. Field-ion micrograph (reduced in size for printing) of a single-crystal platinum tip, (lOO)-oriented with 
facets of low-index planes. Magnification approx. 35 x 10 6 . (From Doerr and Ownby [1975].) 

atoms can be removed at a time. The amount of the material analysed by the pulse 
technique is in the order of one million atoms per day, i.e. a volume of 10” 23 m 3 . The 
resolution is further increased by cooling the tip. With sharp tips, and at low temperat- 
ures, it is possible to image individual atoms but typically much larger areas with a 
higher ion output are observed in order to get a larger volume analysed, though at a 
decreased resolution. 

5.2.2. Atom-probe field-ion microscopy 

The chemical nature of the ions pulled off from the tip surface by the high-voltage 
pulse can be identified by attaching a time-of-flighi (TOF) mass spectrometer. By 
positioning the tip with respect to an aperture hole in the fluorescent screen it is possible 
to focus each individual surface atom. In the detector system specially designed for this 
purpose, each ion arriving produces a single voltage pulse. A minicomputer controls a 
digital timer and analyses the data. Combining the time of flight information with the 
information on the location where the ion had been removed, it is possible to obtain an 
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atomic resolution image of microstructural features and to measure the mass-to-charge 
ratio of single atoms from preselected regions of the specimen. The lateral resolution of 
this instrument called atom probe is in the order of a few tenths of a nanometer while the 
depth resolution depends on the interplanar spacing which, for high index planes, is in 
the order of hundreds of a nanometer. 

In another mode, called imaging atom probe (IAP) maps can be produced for different 
kinds of elements by switching the detector on during the time when the species of 
interest arrive at the screen after the pulse was applied. These maps are similar to the X- 
ray maps produced in the SEM with two decisive differences: The resolution of the atom 
probe maps in orders of magnitude better than that of X-rays. However, since in the 
imaging atom probe the surface atoms are removed by the pulsed voltage and time gating 
can only be performed for one species at a time, the maps observed for different 
elements can not be obtained from exactly the same parts of the specimen. For one 
element, three-dimensional information is obtained by analysing layer by layer, and the 
element distribution in the sample is obtained which offers unique opportunities in 
advanced studies of microstructure and its evolution. 

Finally, the pulsed-laser atom-probe field-ion microscope or, in short, pulsed laser 
atom probe (PLAP) must be mentioned which uses a laser pulse instead of the voltage 
pulse on top of the steady-state dc voltage (Kellogg and Tsong [1980]). Pulsed-laser 
atom-probe field-ion microscopy has been used to characterize semiconductors and 
conducting ceramics where a voltage pulse may lead to fracture due to thermal stresses. 

5.2.3. Field-electron microscopy 

Field-electron microscopes (FEM) are non-commercially made laboratory equipment 
in which an etched single-crystal tip is heated in high vacuum. The emitted electrons are 
accelerated by an anode and produce an image on a fluorescent screen. The intensity of 
electrons emitted (field emission current) depends on the voltage and the work of 
emission; the lattice structure and local geometric structure of surfaces can be studied 
with high resolution down to a few nanometers (MOller [1936], Edelmann [1980]). 

The crystallographic structure of clean surfaces and (if by chance a grain boundary 
was located in the tip) the structure and the movement of grain boundaries as well as 
changes of the tip geometry and allotropic transformations of the tip material during 
heating have been studied; by measuring the energy distribution of the field electrons the 
electronic structure of the single-crystal tip was investigated (for references see Edel- 
mann [1980]). Absorption of gas from the vacuum chamber or of evaporated substances 
(metals or oxides) changes the image drastically, which has been used for studying the 
sites of adsorption, the migration of adsorbed species along grain boundaries and the 
formation of compounds. 

It seems that this technique has been overrun and replaced by other developments 
since no mention is made in more recent reviews. 

5.2.4. Applications of field-ion microscopy 

In contrast to the statement made in the last edition of this book, the disadvantages 
connected with the high effort of preparing suitable tips and the extremely small volume 
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Fig. 14. Distribution of cobalt atoms in a very small volume (approx. 10" 23 m 3 ) of a copper-lat.% cobalt alloy 
at different stages of annealing at 450°C. The annealing times are (a) 10 min, (b) 30 min, (c) 120 min and (d) 
440 min (from CtRtzo and Smith [1994]). 

analysed (requiring long analysis times and a large number of tips for obtaining 
statistically significant results) do not prevent the application of field ion microscopy, and 
many interesting studies of materials have been reported. The identification of small 
carbides in HSLA steels (Kneissl and Brenner [ 1 990]) and particularly the study of the 
composition of carbides in secondary hardening in high speed steels (Fischmeister et al. 
[1988]) are examples where field ion microscopy has contributed to the quantitative 
understanding of technically important materials. Extensive studies of radiation effects in 
metals have been carried out (for references see Seidman [1993]). 

In a recent review by Cerezo and Smith [1994], a large number of case studies are 
discussed including secondary precipitates and segregation at y/y '-interfaces in super- 
alloys, phases and Bloch walls in Alnico permanent magnets, ordering and site occupat- 
ion in various alloys, spinodal decomposition of iron-chromium alloys, segregation at 
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grain boundaries and surfaces, oxidation processes and investigations of nanostructured 
and multilayer materials. Numerous field-emission microscopic studies have been made 
in the field of nucleation and growth where experimental results were needed to 
understand the morphology, crystallography and chemistry of particles in statu nascendi 
and growth for example of Guinier-Preston zones in various aluminium alloys (Hono et 
al. [1986, 1989]) and particularly of cobalt particles in copper cobalt alloys (Wagner 
and Brenner [1978], Haasen and Wagner [1980, 1985], Wagner [1985], Haasen 
[1985], and Cerezo et al. [1992]). Figure 13 shows, as an example for the kind of three- 
dimensional information obtained in such studies, the evolution of particles from a 
homogeneous Cu-1 at.%Co solid solution (Cerezo et al. [1992], Cerezo and Smith 
[1994]), as analysed by the position-sensitive atom probe field-ion microscope (PoSAP). 
For further examples of applications of the various types of field-ion microscopy as well 
as for details and references see the monographs by Wagner [1982, 1985], Miller and 
Smith [1989], Tsong [1990], Sakurai etal. [1990], Miller etal. [1992], the overviews 
by Brenner [1978], Edelmann [1980], Smith [1986], Seidman [1993], or Cerezo and 
Smith [1994] and the proceedings of various conferences (e.g., those of the International 
Field Emission Symposia) and current and special issues of journals devoted to surface 
analysis (e.g., those of Surface Science and Journal de Physique). 

5 .3. Photo-electron emission microscopy 

Microscopes using electrons excited by ultraviolet light were developed in the late 
fifties and made commercially available at the end of the sixties (for reviews see 
Wegmann [1972], Schwarzer [1981]). In these photo-electron emission microscopes 
(PEEM), a high-intensity beam of ultraviolet light is focussed by means of quartz lenses 
and mirrors on a small area of a surface (area in the order of 0.2 mm 2 , energy density 
several W/cm 2 ) which activates emission of relatively slow electrons. A high voltage is 
applied between the sample and a pierced anode facing the sample surface. These 
electrons, after passing the axial aperture of the anode and the magnetic lenses of a 
three-stage electron microscope, produce a highly magnified image on a fluorescent 
screen or a photographic film. Heating the sample up to temperatures of beginning 
thermionic emission allows in-situ high-temperature studies. Contrary to scanning 
electron microscopy, the light-beam cannot be focussed to a sufficiently small size, and 
integral image generation is used which has the advantage of small exposure times, 
varying with material and magnification from a tenth to a hundred seconds. The sample 
surface is cleaned by ion bombardment, but contamination is sometimes a problem in 
spite of the high vacuum in the chamber (typically 10~ 10 bar). 

The intensity distribution of photo-electron emission is essentially determined by the 
variation of electron emission with material and orientation (material and orientation 
contrast), with the orientation of a surface element with respect to the optical axis (relief 
or topographical contrast) and with distortions of the magnetic field in the vicinity of tips 
and edges. The topographical contrast is stronger than in SEM images but the stereo- 
scopic impression is usually less pronounced since the surface of the sample is always 
normal to the axis of projection and cannot be tilted. Material contrast is usually very 
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clear in unetched plane surfaces if the work-functions of different phases in a material 
are sufficiently different, which is usually the case. Orientation contrast is only pronoun- 
ced in cubic metals but not in non-cubic metals and alloys, in oxides or in carbides. 
Thus, material- and orientation contrast can be easily separated in most practical cases. 

Studies in materials research have provided much interesting information in all kinds 
of high-quality metallographic work (for early references see Wegmann [1972] and 
Schwarzer [1981]). Owing to the very small depth of information (10 nm), the high 
lateral resolution (< 10 nm), the excellent phase separation and the possibility for in-situ 
heating, photo electron microscopy is excellently capable for quantitative kinetic studies 
of changes in microstructural geometry (DannOhl etal. [1971], Wegmann [1972] and 
Middleton and Edmonds [1977]). For example, a study by Taylor and Pollard 
[1982] has revealed the bonding sequence (grain-boundary movement and annihilation) 
during diffusion-bonding of steel under load at temperatures up to 1000°C. 

The major draw-back of PEEM has been the very high price of instrumentation. The 
production of the commercially built Metioscope was discontinued and recent application 
reports are scarce. The photo-electric effect using ultraviolet light excitation is now 
utilized in various kinds of spectrometers, like ESCA ( electron spectroscopy for chemical 
analysis ), and UPS ( ultraviolet light-excited photoelectron spectroscopy ), see § 6. 

5.4. Scanning Auger-electron microscopy 

So-called Auger electrons are emitted when an electron beam hitting the surface 
creates electron holes in a lower shell (K, L or M) and when this hole is filled by an 
electron from a higher shell. The excess energy of this jump is transferred to another 
electron which is emitted. An electron detector and an electron spectrometer are used to 
register the number of electrons as a function of kinetic energy which depends exclus- 
ively on the energy levels of the electron shells involved and thus is specific for the 
element. By scanning the electron beam, successive points and line profiles can be 
analysed and used to form an image by modulating the brightness of a cathode-ray tube 
by the signal intensity of the Auger electrons. Auger maps very similar in appearance as 
X-ray maps (see §3.3.6) are obtained in this way, but, since emission of Auger electrons 
is limited to a depth of a few (< 5) nanometers (owing to the short mean free path of the 
low-energy Auger electrons) compared to a few micrometers of the characteristic X-rays, 
see fig. 7) with a much better depth resolution. Three-dimensional information can be 
obtained by consecutively sputtering off layers by argon (with sputtering rates in the 
order of micrometers per hour). Lateral resolution is also much below 1 /im, down to 
10 nm in favourable cases depending mainly on the spot size of the beam. In experimental 
microscopes, 5 nm resolution has been obtained more recently (Venables and Hembree 
[1991]), and a resolution in the order of 1 nm seems possible (Baker and Castle [1994]). 

In principle. Auger electron microscopy could be performed in the scanning electron 
microscope with an Auger electron detector attached. However, since ultrahigh vacuum 
is usually not available in an SEM, the scanning Auger microscopes (SAM) have become 
instruments on their own. Recent reviews are available which give a full account of the 
basic principle and the electronic systems of the SAM (Signorelli etal. [1988], Riviere 
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[1990], Baker and Castle [1994]). Sample preparation is not critical as long as clean 
surfaces are produced. This is possible by sputtering off any adsorbed layers and other 
contaminants inside the UHV chamber or by fracturing the sample in the specimen 
chamber. An extension is angular distribution Auger microscopy (ADAM) in which the 
angles at which the Auger electrons leave the surface are measured and analysed to 
produce images providing information on the structure of thin films, single crystals and 
monolayers (Frank and Hubbard [1993]). 

Applications of scanning Auger electron microscopy in physical metallurgy, micro- 
electronics and corrosion are manifold, particularly in combination with Auger electron 
spectroscopy (AES) which is discussed in detail in Chapter 13 (see also §6). The 
following case studies have been reviewed by Baker and Castle [1994]: pit formation 
at inclusions in steel and pit propagation laterally under the passivating oxide film 
(Baker [1993]), embrittlement by segregation (Seah [1975]), in-situ fracture of spray- 
formed aluminium alloys (Baker and Tsakiropoulos [1993]), wetting during vacuum 
brazing of a gold-nickel alloy (Brooker etal. [1984]), among others. With some of the 
present problems relating to difficulties in imaging insulating materials or rough surfaces 
solved, SAM will increase its potential for materials characterization owing to its 
sensitivity for chemical composition, good lateral and extremely high depth resolution. 

5.5. X-ray microscopy, topography and fluorescence 

The concept of X-ray microscopy was developed early, at the beginning of this 
century (see reviews by Cosslett and Nixon [1960] or Duke and Michette [1990]). 
The lack of sufficiently intense sources was overcome by the development of microfocus 
X-ray sources and plasma sources, and, particularly, by the availability of synchroton 
sources (Feder etal. [1977], Ash [1980], Duke [1981], Koch [1983], Michette [1990], 
Buckley and Rarback [1990]). Amplitude and phase contrast can be used for imaging 
(see, for example, Rudolph et al. [1990]). Scanning X-ray microscopes (Buckley and 
Rarback [1990]), devices for integration of the low-intensity projected or reflected 
X-rays by means of special TV cameras, microchannel plates or image intensifiers (Cheng 
et al. [1990]) and holographic techniques for three-dimensional imaging (Howells et al. 
[1990]) have been developed. Owing to the high absorption of X-rays in crystalline materials, 
applications of soft X-ray microscopy has been more or less limited to biological objects (for 
details and references see, for example, Duke and Michette [1990]). 

X-ray topography is a useful technique to study defects in single crystals such as 
dislocations, twins and stacking faults, subgrain boundaries, interfacial defects and 
strains, domain structures and other substructural entities (Tanner [1976], Hoche and 
BrOmmer [1980], Pangborn [1992]). The penetration depth of 5 pm and a lateral 
resolution of > 1 /im restricts application to relatively perfect crystals (defect density 
< 10 5 /cm 2 ) but owing to its high selectivity for different types of defects and their 
location (subgrain boundaries, stacking faults, structure of ferromagnetic domains, 
dislocations) X-ray topography has become a standard technique for monitoring crystal 
quality, especially in the semiconductor industry. Imaging uses a Bragg reflection, the 
intensity of which varies with local variations of orientation, lattice parameter and 
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structural factor. It is not possible to magnify the image directly, owing to the lack of 
X-ray lenses. High-resolution film and photographic magnification has been widely used, 
typical exposure times ranging from 10 min to 2 h with a 1 kW X-ray source. More 
recently, digital image storage and accumulation have become available, providing better 
resolution and higher speed. X-ray topography has been applied mainly in the following 
principal areas of research (Weissman et al. [1984], Pangborn [1992]): development 
and quality control of electronic devices, kinetics of transformation, crystallisation and 
crystal growth, and deformation and fracture. Classical studies are the investigation of the 
internal magnetic structure of nontransparent ferromagnetic crystals (Bowman and 
Booth [1971]), and the investigation of damage due to ion implantation (Tanner 
[1976], De Rhode and Smits [1981]). Of particular interest are in-situ dynamic studies 
like the movement of dislocations in a silicon single crystal under external stress (Chang 
et al. [1982]), the evolution of the plastic zone during crack propagation in a silicon 
crystal (Michot and George [1982]), and recrystallisation of deformed aluminium 
(Gastaldi and Jourdan [1984]). With portable X-ray generators, advanced detection 
systems and computer analysis of results, X-ray topography is also useful in nondestruct- 
ive evaluation of deformation states, microscopic flaws and cracks in parts outside the 
laboratory (Pangborn [1992]). 

Finally, fluorescence microscopy must be mentioned. If a fluorescing substance is 
irradiated by photons (X-rays or light of short wavelength, usually ultraviolet), some of 
the energy is re-emitted as light of longer wavelength which is typical for the substance. 
This effect is called fluorescence and is used in mineralogy for identification purposes 
and, after suitable staining with fluorescent substances, in biology (Rost [1981] and 
Axelrod et al. [ 1 983]). Extremely small amounts of fluorescent nonmetallic phases can 
be detected. However, since very few phases in metallic alloys are fluorescent, this 
technique is rarely used for materials characterization. 

5.6. Imaging by other types of spectroscopic information 

There is a very large number of spectroscopic techniques in which photons, electrons 
or ions interact with surface near regions of the sample and some kind of radiation is 
analysed (see § 6). In addition to the microscopic techniques discussed up to now, there 
are presently a few more using the signals reflected or emitted from the surface to form 
images. Examples are nuclear magnetic resonance (NMR) microscopy (Pettegrew 
[1990], Mallard [1993], Wachtman [1993]) presently only used in biomedical 
applications, or imaging in secondary ion mass spectroscopy (SIMS). No doubt several 
other microscopic techniques will be developed in the future. It will be interesting to see 
to what extend these techniques will be useful in physical metallurgy. 

6. Topochemical techniques and surface spectroscopy 

Surface analytical methods have become available to materials researchers for nearly 
three decades. Two of the techniques which have become most popular, ESCA (electron 
spectroscopy for chemical analysis) and AES (Auger electron spectroscopy) were 
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described in the late sixties (Siegbahn et al. [1967], Harris [1968]). The number of 
additional spectroscopic techniques for chemical analysis increases at an even higher rate 
than that of microscopies and often combinations of topochemical and imaging methods 
are realized in a single instrument. Excellent reviews and books are available which 
present the progress and state of the art of the fundamentals and the applications of 
photon, electron and ion probes as well as electric field probes (for example Carlson 
[1975], Czanderna [1975], Ibach [1977], McIntyre [1978], BrOmmer etal. [1980], 
Metals Handbook [1986], Fischmeister [1988], Sibilia [1988], Walls [1992], 
Riviere [1990], Briggs and Seah [1990, 1992], Lifshin [1990, 1994], ASM Handbook 
[1992], Cahn and Lifshin [1993], among many others). Some of the techniques based 
on these physical principles are of special interest to materials scientists and engineers 
while others are used only occasionally. No attempt will be made here to give a full 
coverage of these methods. Rather, a recent overview by Ortner and Wilhartitz 
[1991] based on an earlier compilation by Grasserbauer [1989] is used here to present 
the relevant information on the large variety of methods in a systematic way (Tables 4 
to 6). For references, the reader is referred to the books and reviews mentioned above. 
In addition, reviews on the application of surface analysis in materials science and 
technology, for example that by Fischmeister [1988], and the relevant chapters in the 
encyclopedia edited by Cahn and Lifshin [1993] and in Volumes 2 A and 2B of 
Materials Science and Technology edited by Lifshin [1992] and [1994], respectively, 
should be consulted. Combining the most effective techniques for characterizing the 
chemical and structural properties of materials (see fig. 15) will result in deeper insights 
into the nature of materials as well as improvements in the development of processing 
and properties of products needed for modem technologies. 



Fig. 15. Characterization of materials by chemical, spectroscopic and microscopic techniques (from Ortner and 
Wilhartitz [1991]). AAS = atomic absorption spectrometry, OES = optical emission spectrometry, XRF = X-ray 
fluorescence spectrometry, MS - mass spectrometry, XPD = X-ray diffraction, further acronyms see Tables 4 to 6). 


References: p. 1016. 









Resolution 


Table 4 

Ion probe techniques (from Ortner and Wilhartitz [1991]). 


Technique 

Lateral 

Depth 

Process 

SIMS 

(a) Static 

first monolayer 

Techniques based on sputtering 

Sputtering with Ar', O^, O", Ga', Cs' at 

(Secondary ion mass 
spectroscopy) 

[mm] 

(b) Dynamic 
500-1000 nm 

50 nm (with Cs') 
120 nm (with O) 

1-5 AL (atomic 
layers) 

0.001 to 0.1 rnn/s; TOF-SIMS (time- 
of-flight SIMS) 

As above, but with typically 1 nm/s 
Double focussing (mass spectrometry) 
with M/AM between 1000 and 100000 


LR1-S1MS 
(Laser resonance 
ionization SIMS) 

1000 nm 

1-5 AL 

Very complex instrumentation, single 
element at a time method; selective 
ionization of one atomic species in 
sputtered atom cloud by double or 
triple photon absorption. TOF-MS 
(time-of-fiight mass spectrometer) 

LNRI-SIMS 
(Laser non-resonant 
ionization SIMS) 

1000 nm 

1-5 AL 

Multiphoton ionization with intense 
pulsed UV-laser beam (2 10 9 W 
cm -2 ) in sputtered atom cloud 

SNMS 

(Sputtered neutrals 
mass spectrometry) 

1000 nm 

1-5 AL 

Secondary ionization of sputtered 
particles in HF-plasma above sample 
surface 

Quadruple MS-detection 

GDMS 

(Glow discharge mass 
spectrometry) 

[mmj 

10-100 nm 

Ar-glow discharge for sputtering. 

Ionization in negative glow region 
Detection with double-focussing mass 
spectrometer, M/AM = 1000-10000 


Information content 


Surface studies in catalysis, gas-surfacc reactions, 
polymer analysis, only method with trace and 
ultratrace detection power at high lateral and depth 
resolution (detection limit from ppt to high ppb-lcvels) 
Isotopic specificity 

High resolution and high density depth profile possible 

Imaging resolution 1 /um 

Three-dimensional distribution analysis 

Very high detection sensitivity at ppb- and sub-ppb- 

levels, e.g. 2 ppb i6 Fc in Si with consumation of 0,8 AL 

= 40 Fc atoms sputtered! 


As SIMS, but with detection limit presently still in 
/ug/g-range 

As SIMS, but with detection limit presently still in 
/ug/g-range 


Advantages more on bulk-ultratrace analysis side than on 
topochemical analysis side 

Detection limit in the ppt to ppb range for pin samples 
only! 




Technique 

Resolution 

Lateral 

Depth 

Process 

Information content 

GDOES 

(Glow discharge optical 
emission spectrometry) 

[mm] 

10-100 mu 

As for GDMS, but with detection of 
atomic emission in UV/VIS-rangc 

Techniaues based on scattering 

Detection limit in 1 ppm range 

Routine method for /cm layer characterization in 
metallurgy 

RBS (HEIS) 

(Rutherford back- 
scattering spectroscopy, 
or high energy ion 
scattering spectroscopy) 

|mmj 

2-3 nm 

Measurements 
up to 1000 nm 

a or 4 He*-scattering E„: 1-2 MeV 
Measurement of energy distribution of 
back-scattered or or 4 He‘-ions 

Qualitative and quantitative elemental analysis; depth 
profiling by deconvolution of measured energy 
distribution with energy loss functions possible, 
however, generally poorer depth resolution than SIMS, 
but non-destructive depth profiling! 

Additional structural information by channeling and 
blocking phenomena: study of formation of epitaxial 
films 

MEIS 

(Medium energy ion 
scattering spectroscopy) 

(mm] 

0.3-0.4 nm 

200-600 keV 

Same as above 

ISS (LE1S) 

(Ion scattering spectro- 
scopy, or low energy 
ion scattering 
spectroscopy) 

[mm] 

First monolayer 
analysis 

500-600 eV He', Nc* 

Same as above. Adsorption results 

ERD 

(Elastic recoil 
detection) 

[mm| 

First monolayer 
up to some 

ftm 

Elastic recoil detection of forward 
scattered light target atoms by 
bombardment with typically 3 MeV 

4 He” up to 30 MeV Si* 

Depth profiling for ’H and 2 H with a, or H, C, N, O 
with Si* 





Electron probe techniques 


Technique 

EPMA 

(EPXMA) 

(Electron probe 
microanalysis, or 
electron probe X-ray 
micro-analysis) 


Resolution 


Primary electron-beam: 

5-50 keV, diameter J-J00 nm 
Excited volume: 1-5/tm 3 
(pear shaped) 

1-5 /im 


EELS Resolution same as as for X-ray 

(Electron energy loss nano-analysis 

spectrometry) Detection limit: - 3-5% 

accuracy: ~ 10-20% relative 
Especially suitable for light elements 
due to lacking sensitivity of EDS. 

High electron yields with electron 
spectrometers beneath thin specimen. 

However low signal/noise ratios and 
high background due to multiple 
elastic scattering. 

HREELS poor monolayers 

(High resolution electron (mm to cm) 
energy loss 
spectrometry) 



Information content 


Process 

Scanning primary electron-beam excites 
characteristic X-rays in fiat, polished 
samples. With conventional detectors the 
elements Na-U, with windowless 
proportional flow counters the elements B 
to U can be detected 


In connection with transmission electron 
microscope 


Imaging possible by beam scanning 
(STEM) or stigmatic electron spectrometer 


Monoenergetic beam focussed onto the 
surface of a solid, measurement of reflected 
electrons 

Two electron spectrometers necessary (one 
for monochromatizalion of primary electron 
beam, typical £ o =500 meV, and one for 
energy analysis of reflected electrons) 


Quantitative topochemical analysis: WDS 
(wavelength dispersive spectroscopy): 10-100 
/ig/g 

EDS (energy dispersive spectroscopy): 100-5000 

/»g/g 

Optimized for quantitative X-ray analysis after 
correction for absorption and enhancement 
(computer programmes) by use of standards 
possible with an accuracy of ~ 1% rel., 
standardless with an accuracy of 3-10% rel. as a 
function of composition 
Qualitative and semi -qualitative information of 
elemental composition 

Position and structure of absorption edges of 
valence band or near valence band core levels 
contain bonding information 
Fine structure evaluation in combination with 
high spectral resolution (0.1 eV at 100 keV) 
allows to gain stoichiometric information for 
extremely small domains 

Sensitive detection of structural changes in nano 
domains possible by study of position and width 
of plasmon peaks 

Useful for studies of adsorbates on solid surfaces 
(surface coverage with adsorbates, information 
on bonding geometry, studies of catalytic 
phenomena, thin film characterization) 



Technique 

AES 

(Auger electron 
spectrometry) 


Resolution 


Primary electron beam: 

1-10 keV 
poor 

(low signal to noise-ratio, very high 
continuum electron background due to 
multiple scattering requires 
registration of 1st derivative of 
spectrum, hut very high surface 
sensitivity) 


THEED 

(Transmission high 
energy electron 
diffraction) 


LEED 

(Low energy electron 
diffraction) 


A =0,1 nm at 150 eV 
A =0.01 nm at 15 kcV 
(High energy of primary electron 
beam yields high resolution of 
imaging) 

Minimum phase size for structure 
analysis of precipitates: ~ 20 nm 
Primary electron beam 
1-5 keV 
poor 


monolayers 


RHEHD 

(Reflected high energy 
electron diffraction) 


Primary electron beam 
> 10 keV 


XNA 

(X*ray nano-analysis) 


5-30 nm at beam diameter 3nm and 
sample thickness 100-20 nm 
Absolute detection limit: ID' 20 g 
Elemental distribution imaging: 
lateral resolution ca. 1 nm (signal 
enhancement by digital image 
processing) 


Process 


Information content 


Measurement of kinetic energy of emitted 

Auger-electrons 

UHV of 10 1U nibar mandatory 

In situ sample preparation: fracturing, 

annealing; thin film deposition 


Electron diffraction (coherent interference 
effects) 


Diffraction at solid surfaces 
Measurement of reflected electrons of a 
mono-energetic PE-beam on a fluorescent 
screen 


In connection with TEM 

Large area detectors (EDS) in small 

distance from specimen 

Principal disadvantages are low collection 

yields (10 -2 to 10 -4 ) and low X-ray yield for 

light elements (B, C, N, O, ...) 


Micro surface analysis (elemental identification, 
quantative elemental analysis, direct compound 
identification by evaluation of chemical shifts 
and line shapes possible, depth profiling in 
combination with A f* -sputtering) 

Relative detection limit depends on beam 
diameter and current density, c.g. 10-20 mass% 
at 100 nm and InA, 1-2 mass% at 1 /im and 1 
nA or 100 nm and 100 nA, or 0.1 -0.2 mass% at 
l/im and 100 nA 
Imaging resolution 200 nm 

TEM diffraction patterns (point patterns for 
monocry stal line specimens, ring patterns for 
poly crystal line specimens, stnpe formed patterns 
for large grained or textured specimens, and 
diffuse circular halos for amorphous materials) 
useful for determination of lattice constants and 
identification of phases 

LEED patterns reveal the periodicity of atoms of 
the surface and the overall symmetry, but not the 
individual atomic position (periodicity is needed 
for reflection) 

Useful for determination of crystalline order of 
the surface of a single crystal or study of 
adsorption and segregation processes on surfaces 
or overiayers 

Important for the identification of precipitates at 
grain boundaries or interfaces and for sub-/xm 
phases and inclusions 



Table 6 

Photon probe techniques (from Ortner and Wilhartitz [1991]). 


Technique 

Resolution 

Lateral 

Depth 

Process 

Information content 

LRMA 
(Laser Raman 
microanalysis) 

l /zm 

0.1 /zm 

Inelastic photon scattering (Stokes and anti- 
Stokcs) 

Molecular and structural microprobc; complementary 
info to IR about functional group vibrations, 
evaluation of polarization effects 

SERS 

(Surface enhanced Raman 
spectroscopy) 

1 pl m 

0.1 /zm 

As above 

As above, but enhancement up to KT; limited to Ag, 
Cu, Au as substrates 

NIR-FT-RS 

1 /zm 

0. 1 /zm 

As above, but near IR 

As above, but less fluorescence interference 


(Near infrared Fourier 
transform Raman 
spectrometry) 


FT-IR-M 

(Fourier transform infrared 
microscopy) 

10 /zm 

1-10 /zm 

IR-absorption with interferometric recording 
of spectra 

IR (0.75-1000 /zm) absorption by excitation of 
vibrational and rotational states 

Molecular and functional group information 

ATR-IR-S 

(Attenuated total reflection 
infrared spectrometry) 

Several mm 

1-10 /zm 

Internal IR-reflcction 

As above: SA of polymers, catalysis, electro -chemical 
reactions. By use of polarized light: information on 
orientation of adsorbates on surfaces 

IRRAS 

(Infrared reflectance 
absorption spectrometry) 

mm 

1-10 /zm 

Angle resolved IR reflectance 

measurements 

Determination of complex optical properties and 
identification of chemisorbed species 

EXAFS 

(Extended X-ray 
absorption fine structure 
spectrometry) 

ca. 20 /zm 

/zm 

Absorption of monochromatic X-rays 
(synchrotron radiation), several keV 

"Local atomic structure probe" 

Measurement of atomic distances between absorbing 
atom and neighbours (range 0.6 nm) 

SEXAFS 
(Surface EXAFS) 

ca. 20 /zm 

I nm 

As above, but up to 1 keV 

Accurate determinations of bond lengths of adsorbates, 
surface molecular studies. The polarization of 
synchrotron radiation allows to probe atomic 
distances in specific directions 

NEXAFS 
(Near edge X-ray 
absorption fine structure 
spectrometry) 

ca. 20 fi m 

/zm 

As above, but keV 

Determination of valence states of very fine 
precipitates, not measurable by TEM: studies of 
crystal growth, nucleation, formation of interfaces 





Resolution 



Technique 

Lateral 

Depth 

Process 

ESCA 

Electron spectrometry for 
chemical analysis 

XPS 

(X-ray photoelectron 
spectroscopy) 

100 /im (crystal focussing) 

< lO^m 

(ESCASCOPE of VG, FT-)ens) 

Typically 5-10 
atomic layers 

Two techniques, XPS and UPS, see below 

Absorption of monochromatic X-rays and 
measurement of the kinetic energy of 
ejected pholo-clcctrons 

UPS 

(Ultraviolet photoelcciron 
spectroscopy) 

In the order of mm 

Several atomic 
layers 

Absorption of monochromatic UV-radiation 
and measurement of the kinetic energy of 
ejected photoclectrons. (Excitation with 
noble gas discharge lamps or synchrotron 
radiation) 

ARUPS 

(Angle resolved ultraviolet 

photoelectron 

spectroscopy) 

In the order of mm 

Single hrst mono- 
layers 

Same as for UPS, structural sensitivity 
enhancement by use of polarized radiation 

LAMMS 

(Laser micromass 
spectroscopy) 

NTLD: 1-2 ^m 

TVLE: 1-2 

Several monolayers 

In the order of /*m 

Non-thermal laser desorption 
(NTLD < 10* W/cin 7 ) 

Thermal volume laser evaporation 


(TVLE > 10* W/cm 2 ) 

Detection by TOF-MS (time of night mass 
spectroscopy) 


Information content 


Qualitative and quantitative elemental analysis of 
surfaces (Z> I; sensitivity limit ea. 0.1 mass% but 
0.01 monolayers! plus information on chemical 
bonding (by chemical shifts of binding energies) and 
binding energies. Depth profiles with sputtering 
installation possible 
Imaging resolution < 10 ^m 

Study of electronic structures (of the valence band of 
solids or of bonding electrons in gas-molecules 
(even vibrational excited states can be resolved) 
Extremely good vaccuum (p£ 10 - " mbar) necessary. 
Study of chemisorption phenomena, adsorption, 
desorption (esp. with respect to catalysis) 
Measurement of valence band emission spectra for 
"density of state"-calculations: very weak adsorption 
phenomena, e.g., of noble gases. Variation of 
detector angles allows to exploit directional effects 
in photoclectron -emission 

Molecular information from adsorbed or chemisorbed 
species or from surface contaminants 

Microanalysis for elements, some bonding and 
molecular information from fragmentation patterns. 
Detection limit in the order of l-10^ig/g 
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7. Quantitative interpretation of microstructural geometry 

Quantitative methods for characterizing the geometry of microstructures are an 
important means for understanding the relationship between production parameters and 
behaviour of technical materials in application as well as for providing the experimental 
basis for theoretical studies of microstructural genesis and its effect on mechanical and 
physical properties. The important elements of quantitative microstructural analysis (also 
termed quantitative metallography) are image analysis, i.e., the quantification of features 
visible in metallographic cross-sections or transmission images; stereology, i.e., estima- 
tion of three-dimensional geometry from two-dimensional image data; and the interpreta- 
tion of the three-dimensional parameters. 

Generally, more than one single geometric aspect is of relevance in characterizing the 
microstructural geometry of multiphase materials, and a combination of parameters will 
be needed to give a full description. Such combinations have been proposed for studying 
the evolution of microstructure or for empirical correlations with mechanical properties. 
A large number of techniques have been developed and employed for characterizing 
microstructural geometry. Table 7 shows an overview over important analysis methods. 
This section aims to give a short introduction to techniques of image analysis and 
stereology, focussing on applications to problems relevant in materials science. 

7.1. Image analysis 

Simple comparison of microstructure with standard test charts is extensively used in 
quality control of materials (for example, ASTM standards for determination of grain 
size or of slag inclusions). The accuracy of these comparison methods is rather limited 
compared with counting techniques (Exner [1969b, 1994] and Exner and Hougardy 
[1988]). These semi-quantitative methods should not be used for research work and will 
not be discussed any further here. 

When scanning an image for collecting data, counting with point and line grids, 
measurements along straight lines, or area measurements can be used. Each of these 
strategies has its merits, according to the purpose of the investigation (parameters to be 
determined) and available instrumentation (manual, semi-automatic or fully automatic). 

The manual acquisition of image data is carried out directly at the microscope, using 
inserts in the ocular or ground plates with arrays of points or lines, or with micrographs 
and suitable grids or rulers (Gahm [1971], Exner and Hougardy [1988] or Exner 
[1994]). Figure 16a shows a typical point grid superimposed over the microstructure of 
a two-phase material. The number of points falling within the black phase and the 
number of intersections between the outer circle and the perimeter lines of the black 
phase are counted (yielding, as discussed below, estimates for volume fraction of the 
black phase and its area of interface with the white phase). Figure 16b shows linear 
analysis of the grain size in a single-phase material. A number of fields must be 
evaluated in this way in order to get statistically significant numbers. Simple electronic 
counters and step stages are useful means to speed up these counting procedures. More 
complex data (size distribution and elongation of planar features) can be determined by 


References: p. 1016. 



(a) 




Fig. 16. Topical grid-, line- and graticule-counting procedures in manual image analysis: (a) point-counting for 
volume fraction analysis (circles can be used for interface density measurement, see text); (b) line-counting for 
determination of mean linear grain size of a single-phase material (numbers on the right margin show number 
of grain boundaries intersected); (c) graticule-counting for size distribution measurements (area of circles and 
ellipses of varying aspect ratio increase by a factor of two in each higher class). 
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Table 7 

Techniques for (he geometric characterization of three-dimensional microstructures and 

two-dimensional images (“fingerprinting'’). 

3D Description 2D Description 


Stereology 1. 

Global parameters 

(volume-, surface- and line densities, 
orientation) 

Distributions (size, shape) 

Stereometry 2. 

Nonplanar surfaces (fracture surfaces, 
deep-etched interfaces), 

Transparent materials (3D microscopy, 

TEM foils) 

Serial sectioning 3. 

Topological parameters (number den- 
sity. genus) 

Shape and arrangement 

Model structures (simple shapes like 4. 

spheres, cubes) 

Regular lattices, statistical 
arrangements 

Computer generated structures 5. 

(dendritic patterns, fractal structures 

etc.) 

Point processes and mosaiques 


Image comparison 
Standard charts, graticules 


Simple geometric parameters 
Perimeter, area, shape factor, size and 
shape distribution 


Mathematical morphology 
Manipulation of digital images (open- 
ing, closing, skeletonizing) 
Morphological parameters 
Complex special techniques 
Fourier analysis (periodicity) 

Fractal analysis (selfsimilarity) 

Pair correlation functions, graph theory 
(arrangement) 

Geometric abstraction 
Circles, hexagon, planar lattices (ana- 
lytical modelling) 

Poisson point process, Voronoi 
mosaique (computer modelling) 


simple graticules, an example of which is shown in fig. 16c. Manual methods for 
microstructural analysis have been reviewed in detail by Fischmeister [1965, 1981], 
Gahn [1971], Exner [1972a, 1994], Underwood [1985] or Exner and Hougardy 
[1988], Exact procedures for practical assessment of the most important stereological 
parameters (volume fraction, interface density, size and distance, orientation, elongation, 
proximity and contiguity, see below) have been worked out recently by a committee of 
the German Society for Materials including estimates of experimental accuracy (Exner 
[1993]). 

The first step towards automation was realized in lineal scanners which were popular 
during the sixties and seventies (Exner [1972a] and Gahm [1975]). More recently, semi- 
automatic instruments have been provided with digitizers as input device to a calculator 
or computer (see, for example, Hillje and Redmann [1976], Exner [1978b] and Exner 
and Hougardy [1988]). Digitizers are tablets by which the exact coordinates of points 
touched with a special pencil are registered at a high rate (up to 10 4 per second) with 
high precision (0.1 mm). The operator indicates the intersections of the scanning line 
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with the perimeter or outlines the perimeters of the microstructural features using his 
ability to differentiate between features of different kind (composition, structure) on the 
basis of grey or colour shading and of shape. Thus, the operator keeps steady track of 
what is measured and is able to exclude artefacts by simple judgement. Modem 
instruments compute all relevant microstructural parameters from these data (see, for 
example, Russ [1995]). Disadvantages are the high time-expenditure and concentration 
required, in addition to some danger of introducing human bias. Nevertheless, this type 
of instruments is often the best choice for practical work due to their reliability and 
versatility. 

Fully automatic instruments are available with a wide efficiency range (Fischmeister 
[1981] and Exner and Hougardy [1988]). Most of them use videocamera (CCD) type 
scanning modules with electronic phase detection. Simple low-price instruments allow 
determination of a limited number of parameters for high-contrast situations. High priced 
instruments, usually modular devices with fast hardware modules or, more recently, 
flexible software modules and full grey-level image-storage with up to 1024 x 1024 
picture-point lateral resolution and 256 steps of grey-level resolution combine high speed, 
versatility and convenience for any type of automatic image data acquisition. Scaling, 
shading correction, scanning-table steering and focussing are all carried out automatical- 
ly, and all kinds of image manipulation (filtering, edge contrast enhancement, halo 
correction, etc. and image modification techniques based on mathematical morphology 
the principles of which are outlined in § 7.3 below, etc.) can be implemented if required. 
Figure 17a shows the grey-level histogram and fig. 17b demonstrates the effect of a 
special filter (pseudoplast filter) on the appearance of a light micrograph of a three-phase 
alloy. Phase discrimination (either by dedicated hardware modules or by software 
algorithms), parameter selection and evaluation, statistical treatments and many other 
routines can be activated by push-button commands or by a simple computer program 
designed individually for any specific problem. As an example, fig. 17c shows the 
appearance of a microstructural evaluation on the monitor or screen of an automatic 
image analyzer. “Interactive” devices — usually using a mouse or a light-pen to be used 
on the display monitor — allow the operator to use his judgment for excluding artefacts 
or adding missing details (e.g., parts of a grain-boundary network). 

In principle, any image-forming instrument can be used as input device; only light 
microscopes and scanning electron microscopes have been used in on-line configurations. 
Transmission electron micrographs and other printed images have been evaluated only 
occasionally for solving problems in materials science and technology. Software for 
automatic image analysis is now offered with most highly computerized scanning 
instruments mentioned in this chapter, particularly with energy dispersive X-ray analysers 
(electron microprobe). The combination of image analysis and energy-dispersive X-ray 
analysis was used for determining geometric parameters and composition of individual 
particles simultaneously (Ekelund and Hertzmann [1981]). 

Instrumentation for image analysis has developed rapidly over the past two decades 
(for historical and recent reviews see Fischmeister [1965], Gahm [1971], Exner 
[1972a], Hougardy [1976], Fischmeister [1981], Exner and Hougardy [1988], or 
Exner [1994]) and is still advancing at a rapid rate. In this context, the dramatic 


References: p. 1016. 
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Fig. 17. Typical procedures of television-equipped fully automatic image analyzers: (a) Microscopic image of 
a three-phase copper-cadmium alloy with superimposed grey-level distribution; (b) Pseudoplast filter producing 
a plastic appearance by showing gradient variation only; (c) Size-distribution analysis of nonmetallic inclusions 
in a dirty steel with superimposed counting flags indicating proper detection and area distribution histogram. 


developments in digital image processing must be mentioned, which not only are useful 
for quantitative image analysis but also for contrast enhancement and image restoration 
(Russ [1990], Jahne [1993] or Talbot [1994]). The state of the art has now reached a 
level which allows only minor improvements with respect to practical applications. 
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Though recent developments (e.g., processing of real colours and grey levels) may widen 
the scope to the treatment of even more complex problems in image analysis, the present 
rapid decrease of price levels at a given performance is certainly the best reason for a 
much more general utilisation of automatic image analysers. 

7.2. Planar characteristics and stereology 

Image analysis yields a number of geometric characteristics describing the geometry 
of planar images of cross-sections or thin-film projections, the most important of which 
are listed in table 7. They usually can be qualitatively interpreted in terms of the three- 
dimensional microstructures from which they originate. Quantitative three-dimensional 
characteristics are obtained by a body of mathematical methods based on integral 
geometry and statistics which is called stereology. There is an enormous number of 
original papers and formulae spread over the scientific literature of fields as far apart as 
mathematics, geology, materials science and biology. Useful monographs and standard 
textbooks (De Hoff and Rhines [1968], Underwood [1970], Saltykov [1974], Weibel 
[1979, 1980], Rhines [1986], Russ [1986, 1990, 1992], Exner and Hougardy [1988] 
or Coster and ChermaNT [1989]) review the literature and should be consulted 
whenever a quantitative evaluation of spatial microstructures is attempted, in order to 
avoid misinterpretations and redundant derivations. A few of the more important aspects 
of microstructural geometry (table 8), their experimental evaluation from surface images 
and some applications in physical metallurgy are discussed in the following sections. 

Standard nomenclature for the stereological terms as outlined in the classical 
textbooks by De Hoff and Rhines [1968], Underwood [1970] and Weibel [1980], see 
also Exner [1994] and as standardized by the International Society for Stereology 
(Exner [1987]) is used in the following and is recommended for general use. 


7.2.1. Volume-fraction analysis 

Simple plausibility proofs as well as sophisticated mathematical treatments show that, 
for a homogeneously dispersed phase, area fraction, lineal fraction and point fraction are 
statistically unbiased estimates for the volume fraction, i.e., in standard nomenclature 
(Underwood [1970], Weibel [1980] and Exner and Hougardy [1988]): 

V v = A a =L l = P p . (4) 

The experimental error of this estimate decreases with an increasing number of points 
counted or lines and areas measured and, at a 5% level of error probability, for point- 
counting using a wide-spaced regular point grid, is approximately (Hilliard [1976]): 

±AV = 2[P P (1 - P ? )/Pf\ (5) 

where P is the total number of points of the test grid. A more stringent evaluation of 
errors includes the variation of area fractions between different test fields (Nicholson 
[1978]). 

Volume fraction has been the parameter most frequently used in quality control of 
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Table 8 

Some important geometric characteristics of planar images and three-dimensional microstructural 

features 


Planar characteristic h 

Example 

Corresponding three- 
dimensional characteristic ‘ 

Microsiructural example 

Area fraction, A a 

Particle intersects 

♦Volume fraction, V v 

Amounts of phases in multi- 
phase material 

Line density, L A 

Intersect perimeters 

♦Surface fraction, S v 

Grain -boundary or phase- 
interface area 

Point density, P A 

Triple points in 
intersected grain- 
boundary network 

♦Line density, Ly* 

Length of dislocation lines or 
grain -boundary triple lines 

Number density, N A 

Number of intersected 
particles 

Numerical density, N v 

Number of particles 

Distribution by area 

m 

Intersect area 

distribution 

Distribution by number 
and linear size 

Number of spherical panicles as 
a function of diameter 

Distribution by length 

M 

Chord length 
distribution 

Distribution by volume /v) 

Number of grains as a function 
of volume 

Mean size 

Mean intersect area or 
mean linear intercept 

♦Mean linear size L 

Mean volume, V 

Mean linear grain size 

Mean particle volume 

Distances 

Nearest -neighbour 
distance 

♦Mean linear distance D 

Nearest-neighbor distance 

Surface-to-surface distance of 
particles in one direction 

Distance of panicles in 
dispersion- hardened materials 

Shape characteristics 

Area/perimeter ratio, 
lenglh/width ratio 

Shape parameters 

Shape changes of panicles or 
pores in processing 

Planar curvature 

Local and mean 

curvature 

♦Mean curvature 

Curvature of pore-solid 
interfaces 

Arrangement in plane 

Random, regular, 
clustered 

Arrangement in space 

Connectivity 

♦Contiguity 

Clustering of carbides in high- 
speed steel 

Number of closed loops in the 
pore space of a sintered material 
Amount of surface shared with 
other grains of the same phase in 
a multiphase material 

Orientation 

Preferred orientation of 

intersection lines 

♦Orientation of grain 
boundaries and interfaces 

Anisotropy of pore/solid 
interface, grain boundary area in 
rolled sheets or drawn wires 


* Three-dimensional parameters marked with an asterisk can be calculated from data obtained with planar 
images without any assumptions but randomness of sampling. All other three-dimensional characteristics are 
restricted to model geometries, to known shapes or known size distribution or both, or can only be assessed 
by three-dimensional measurements (stereometry, serial sectioning or chemical desintegration). 
b Subscript A indicates that quantity is per unit area. 
c Subscript V indicates that quantity is per unit volume. 
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materials and has found many useful applications in materials research. It is obviously 
closely related to composition and exerts a decisive effect on the properties of multiphase 
materials. In equilibrium phase diagrams, the lever rule allows phase boundaries to be 
determined to a high accuracy with a minimum number of samples. For example, 
Linden [1972] has redetermined the two-phase region in the copper-aluminium system, 
and Aldinger [1969] has shown excellent agreement of results obtained by classical and 
by image-analyzing methods in the four-phase region of the Ag-Cu-Cd-Zn diagram. 
Modem phase-diagram studies employ computer calculations to a high degree which are 
facilitated if accurate quantitative data are available. 

A typical field for volume-fraction analysis is the determination of nonmetallic 
inclusions, of carbides, and of the time-temperature dependence of phase transformations 
in steels. Figure 18 shows a typical isothermal time-temperature-transformation (TTT) 
diagram for a chrome-molybdenum steel which makes it possible to read not only the 
nature of phases obtained after a certain holding time at constant temperature but also the 
fraction of austenite transformed (Lafond and Moliexe [1975]). Studies of transfor- 
mation kinetics of discontinuous precipitation or recrystallization are further examples. 
A very interesting aspect of volume-fraction analysis is the possibility of determining 
local composition, which is straightforward if the equilibrium compositions of the 
individual phases present in the microstructure are known. Thus, segregation and 
inhomogeneity phenomena and other systematic compositional variations can be 
quantitatively determined, sometimes with as good or better accuracy than by micro- 
chemical methods. 

Good examples can be found in the investigation of surface and heat treatment of 
steels, e.g., the analysis of the carbide distribution in the surface of a hardened steel part 



Time 

Fig. 18. Isothermal time temperature transformation diagram for a chromium-molybdenum steel. (A = austenite. 
F = ferrite, C = carbide. M = martensite). The heavy lines give start and end of a reaction, the numbers at the 
thin curves give the volume fraction of austenite transformed. (After Laeond and Moliexe 1 1975].) 
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(Fischmeister [1972]), or the scanning electron microscopy study of the influence of 
heat treatment on the composition of the matrix in high-speed steels (Fischmeister et al. 
[1986], Karagoz and Fischmeister [1987]). 

7.2.2. Interface density 

Dispersity of a microstructural component (isolated particles as well as complex- 
shaped continuous or semi-continuous phases) can best be quantitatively described with 
a single number by measuring the density of its interface area per unit volume. Stereo- 
logy (Saltykov [1974], Underwood [1970] and Weibel [1980]) tells us that: 

S w = 2 P L , (6) 

where S v is the interface area per unit volume of the material and P L is the number of 
intersections (per unit length) between a scanning line and the intersections of a plane 
cross-section with the interface as seen in a metallographic image. If we relate the 
interface of a microstructural component (phase a) to the volume of this particular 
component, we get the specific interface density of the a-phase: 

5“ = S v /V w = 4 N a /L a = 2 PjP r , (7) 

where L a is the length of the line inside the a-phase (L„ = LV y , L is the total length of the 
scanning line) and N a is the number of a-regions transversed by the scanning line. Of 
course, the density of grain-boundary area can be determined in the same way. 

An analysis of statistical errors shows (Hilliard [1976]) that the standard error, A S v , 
is approximately 

AS V = kSy/yfP, (8) 

where P is the total number of intersections and k is approximately 1.2 for contiguous 
grains and 4 for randomly dispersed particles. The range S V ±AS V corresponds to 5% 
error probability (95% significance level). 

Prominent examples of application of surface-density measurements are found in 
powder metallurgy, where the pore-solid interface provides the driving force for sintering 
processes (ch. 33), in the study of coarsening processes (grain growth and particle 
growth, ch. 28) and in finding correlations between microstructure and properties. Figures 
19a and b show two simple examples of the latter, where linear relationships are 
observed between the hardness of various metals and their grain-boundary density 
(Saltykov [1974], Rhines [1976, 1986] and Ondracek [1978]) and between the 
coercive force and the interface per unit volume of magnetic cobalt phase in WC-Co 
hard alloys used in cutting and wear applications (Fischmeister and Exner [1966]). 

The concept of surface density is less popular for characterizing microstructural 
geometry in physical metallurgy than might have been expected considering its versatility 
combined with extreme ease of evaluation. While for the separate determination of 
number and size of particles simplifying assumptions about shape are necessary (see 
below), the only requirement for obtaining significant numbers for S v is statistical 
sampling, while shape effects as well as variations of the size of the individual features 
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Specific surface of Co-binder, m 2 /cm 3 


Fig. 19. Simple relationships between properties and microstructural geometry: (a) hardness of some metals as 
a function of grain-boundary density (data from Saltykov [1974] and Rhines [1976]); (b) coercive force of 
the cobalt phase in tungsten carbide-cobalt hardmetals as a function of interface density (after Fischmeister 
and Exner [1966]). 


have no influence. Surface densities can also be used for describing the contiguity, i.e., 
the degree to which grains of a phase are connected in space or the degree of preferred 
orientation (see below). In modelling microstructure-property relationships or processes 
of microstructural genesis during casting, heat-treatment, deformation, etc., interface 
densities are most useful quantities for averaging the complex geometric appearance of 
phases in most technical materials. The use of interface densities (instead of size) was 
suggested in theoretical modelling of coarsening by Exner and Fischmeister [1966a] 
and by Glicksman etal. [1992], of sintering by Aigeltinger and Exner [1977], Exner 
and Giess [1989] and Liu and Patterson [1993], and of recrystallisation by De Hoff 
[1972, 1989]. The latter author developed the framework for stereological theories of 
kinetic and dynamic evolution of microstructures (De Hoff [1972]). 

7.2.3. Size and distance 

In contrast to volume fraction and surface density or specific surface, three-dimen- 
sional size and separation (distance) are not related to two-dimensional data in a simple 
manner. Only if the shape of an object is known can its size be defined by a single 
parameter. The assessment of nearest-neighbour distances requires knowledge of the 
spatial arrangement of objects, which is often not available from planar images. Thus, 
models are needed to estimate size and distance and their distribution. 

The most useful model relates to specific surface and surface density. The size and 
distance can be described by the average linear distances between the surface points 
inside (mean linear size) and outside (mean linear distance) the volume covered by a 
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phase a. For multiphase materials we get the mean linear intercept ( mean linear particle 
size for a dispersed phase): 


K = K/K 



( 9 ) 


and the mean linear distance between the surface elements of phase a: 


D„ 


= {L-L a )/N a 


= 4 


l~Vy 

S v 


( 10 ) 


For space-filling grains (V v = 1), the mean linear grain size (mean linear grain intercept, 
Heyn’s grain size) is: 

L = 1/N l = 2S w . (11) 

This concept makes it possible to characterize size and distance independently of shape, 
arrangement, or size distribution. Mean intersect area (Jeffries’ grain size) A is an 
indirect measure related to the length of grain edges per unit volume (Fischmeister and 
Arzt [1981]). A has no concise meaning with respect to the size of isolated particles if 
shape is not uniform (and simple). Rather, as can be shown using derivations by De 
Hoff [1967, 1980] and Cahn [1967], see also Exner [1994], there is a direct correlation 
between the mean intersect area and the integral mean curvature C v 

C v = 2t rlV A = 2t tV w /A (12) 

where N A is the number of intersect area per unit area of the cross section. C v thus is 
another important global stereological quantity useful for describing microstructural 
changes (see fig. 21 below, and Exner [1994] for references). 

There are numerous other measures for characterising mean size and/or distance. For 
example, a measure of size frequently used is R = 3L/4, derived by simply relating the 
radius of a sphere to its mean linear intercept. It is obvious that the result obtained does 
not give much more useful information than do L or S v since R is simply the radius of 
equally-sized spheres with the same specific surface and volume fraction (but not average 
volume or number) as the (irregular) particles or grains. For space-filling polyhedra, 
similar size measures can be derived (Thompson [1972]) which, however, are similarly 
restricted in describing the geometry of real grain structures. Other size parameters 
discussed in literature (for references and derivations see Rhines and De Hoff [1968], 
Thompson [1972], Underwood [1970], Weibel [1980], Fischmeister and Arzt 
[1981], Coster and Chermant [1989] or Ohser and Lorz [1994]) rely on well-defined 
geometries as well. The assumptions made are usually clearly stated in the original 
papers but often neglected by users of the final results, which may lead to erroneous 
conclusions. 

Measures of distance other than that defined in eq. (10) usually are based on model 
assumptions concerning the spatial arrangement. For example, nearest-neighbour 
distances in three dimensions can be calculated from two-dimensional counts if the shape 
and size of all particles is identical and if the volume fraction is not too high (see, for 
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example, Bansal and Ardell [1972]) or if parallel platelets are considered (Roosz et 
al. [1980] and TOrrOnen [1980]). However, if the nearest-neighbour distances of 
irregularly shaped and distributed particles is to be estimated, no straightforward methods 
are available. As discussed recently by Ohser and Lorz [1994], the transformation of 
pair correlation functions (see e.g., Stoyan etal. [1987]) from two to three dimensions 
seems to be a possibility which, however, may prove too complicated for practical 
applications. Thus, computer simulations of sections through three-dimensional arrange- 
ments (for example based on the Poisson-Voronoi mosaique (see Ohser and Lorz 
[1994]) may be a viable alternative to serial sectioning. 

The situation becomes even more complex when information on the three-dimen- 
sional size distributions of grains or particles is required. Direct stereological procedures 
to derive the 3D distributions from distributions of linear intercepts or from intercept 
areas are available for idealised models in which the assumption is made that all features 
have constant shape and that this shape is extremely simple (sphere, cube, ellipsoid). 
Since this situation is rare or at least not typical in real microstructures, these models are 
not useful in most practical cases. The same is true for other analytical approaches. 

For example. King [1982] has derived a method for assessing three-dimensional size 
distributions (cumulative particle mesh size distribution) for irregular particles from 
planar data, i.e., area distribution of sections or projections, or from linear intercept 
distributions. No a priori assumptions on regular geometric shape, convexity, congruency 
or particular functional form of the size distributions are made in this widely neglected 
paper. However, a shape-dependent constant must be known or determined experimental- 
ly for a uniformly sized sample. Due to this complication, this or other unfolding 
procedures have not found much interest outside the stereological literature. Nevertheless, 
solutions to the so-called “tomato salad problem” have been published continuously over 
the years. A recent approach concerns polyhedric grains and may prove useful under 
some conditions for grain size characterization in single-phase materials (Zhao and Chen 
[1993], Ohser and MOcklich [1995]). As a general rule, rather than standard text books 
or original papers, special reviews (e.g., Exner [1972b], Simpson and Standish [1977], 
Cruz-Orive [1978], Fischmeistbr and Arzt [1981], Exner [1994] or Osher and 
Lorz [1994]) should be consulted for representative conversion procedures, their 
applicability and restrictions, and the related derivations should be followed up before 
deriving (or even worse, publishing) more of the already abundant solutions. 

In reality, microstructural elements vary not only in size but also in shape. The shape 
variation may be determined by growth processes and is often limited by crystallographic 
conditions. In such cases, simplifying assumptions may be justified and special solutions 
may be found to solve a particular problem. An interesting example was published 
recently (MOcklich etal. [1994], Ohser and Lorz [1994]) where a numerical solution 
was presented for establishing the size and shape distribution of hexagonal prisms as 
present in silicon nitride from the size and shape of the intersect areas. 

In view of these difficulties, serial sectioning has been used for establishing three- 
dimensional grain and particle size distributions. Alternatively, chemical disintegration or 
removing of the embedding matrix can be applied to separate grains (e.g., by attacking 
the grain boundaries of aluminium by liquid gallium as shown by Rhines and Patter- 
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son [1982]) and to size the isolated features microscopically or by weighing. It is 
interesting to note that logarithmic normal size distributions have been observed in most 
cases (see, for example, Exner [1972b] and Rhines and Patterson [1982] or Rhines 
[1986]) which make possible to describe the size distribution by only two parameters 
(e.g., the arithmetric mean and the standard deviation). 

There are a large number of problems in materials research where size, distance and 
size-distribution are relevant. One of the best-known relationships between microstruc- 
tural geometry and properties is the Hall-Petch relationship predicting a linear depen- 
dence of the yield stress of polycrystalline materials on the square root of reciprocal 
grain size. However, size distribution has a significant influence on this relationship 
(KOhlmeyer [1978], Gokhale and De Hoff [1984], Rhines [1986], see also Exner 
[1994]) since deformation starts at the largest rather than the average grain (Essmann et 
al. [1968]). Experimental results confirm the simple form of the equation owing to the 
fact that the shape of grain-size distributions in most annealed single-phase materials is 
fairly similar. Examples for the predicted linearity between the yield strength and the 
reciprocal square root of the grain size or the particle size are shown in figs. 20a and b. 
It is interesting to note that prestraining changes the parameters but not the general form 
of the Hall-Petch relationship (Wang and Murr [1980], see fig. 20a). 

However, geometric similarity is not a general rule and can not be assumed a priori 
but needs experimental proof for every material and processing schedule. Large 
deviations from geometric similarity are frequent. For example, the width of grain-size 
distributions in recrystallized metals depends strongly on deformation prior to annealing, 
even at long annealing times (Okazaki and Conrad [1972] and Rhines and Patterson 
[1982]). This has consequences for the relationships between properties and microstruc- 
ture which must not be neglected. 

For aluminium samples, it has been reported by Gokhale and Rhines [1993] (see 
also De Hoff [1984], Rhines [1986] and Exner [1994]) that the variation in size 
distribution has a much larger influence on the yield strength than the variation in mean 
grain size by a factor of 2 and that the same is true for the tensile strength. This example 
clearly demonstrates the importance of a full appreciation of microstructural parameters 
in theoretical modelling and the ambiguity of well-accepted relationships between 
microstructure and properties. Another typical example is the dependence of transverse 
rupture strength of WC-Co alloys on carbide particle size, spacing and contiguity (see 
Exner [1994]). 

Other useful equations involving mean size and distance are derived for simplified 
geometries, often taking a model in which grains or particles are assumed to be spheres 
of uniform size. For instance, the Zener relationship for dispersed particles limiting the 
grain size (ch. 28), 

L = klJV, (13) 

(where L is the equilibrium grain size in a material with particles of mean linear size L a 
and volume fraction V v ), is usually derived in this way (Porter and Easterling 
[1981]). k is a constant depending on the shape, the size distribution and the arrangement 
of the particles. The value usually taken (k = 4/3) relies on the assumption that the 
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Fig. 20. Experimental correlations between size and distance of microstructural features and the yield strength 
of metallic alloys, (a) yield strength of some metals as a function of square root of mean linear grain size. CR = 
cold reduction in percent (data from Wang and Murr [1980] and Raza [1982], (b) yield strength of unalloyed 
low-carbon steels as a function of the reciprocal square root of the mean particle spacing and of the mean linear 
grain size (data from Liu and Gurland [1968] and Anand and Gurland [1976] according to Exnf.r and 
Hougardy [1988]). 


particles are regularly dispersed single-size spheres. Therefore, deviating numbers for k 
are obtained experimentally. 

A more sophisticated treatment is needed including size distributions and it seems that 
Monte Carlo simulations provide a means for assumption-free modelling. A very 
promising approach for a generalisation of the Zener equation is to treat the interaction 
between grain boundaries and matrix/particle interfaces by stereological modelling 
(Aigeltinger and Exner [1977], Exner [1994], 

Equations describing the kinetics of grain growth and particle growth by Ostwald 
ripening or coalescence, the dependence of dendrite arm spacing and lamellar spacing on 
freezing rate, and several other processes determining the size of microstructural features 
have been derived on the basis of simplified model geometries some of which are 
discussed in more detail in other chapters (e.g., ch. 9, § 15). Experimental proofs given 
on the basis of quantitative microscopy will be statistically significant and free of 
systematic errors only if the geometric details of the theoretical models are compatible 
with the real microstructure. As an example, fig. 21 shows the comparison of size 
distributions of vanadium carbide particles of spherical shape with predictions made by 
the Wagner-Lifshitz theory for Ostwald ripening (ch. 15, §2.4.2). Perfect agreement for 
reaction-controlled growth is obtained for long annealing times while significant 
deviations exist for the as-sintered (i.e., briefly annealed) state (Exner [19731). This 
comparison is made on the basis of linear intercepts (Exner and Lukas [1971]) rather 
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Fig. 21. Si 2 e distribution of spherical VC particles in a nickel matrix sintered and heat-treated at 1500°C (liquid 
matrix). Histograms: experimental results; solid curves: theoretical prediction (after Exner [1973]). (a) Sintered 
1 h, mean linear intercept = 17.5 fim\ (b) annealed 8 h, mean linear intercept =45.5 fim. 


than of sphere diameters, since unfolding of the experimental data generally increases the 
experimental scatter pronouncedly (Nicholson [1978]). 

7.2.4. Orientation, contiguity, shape and other complex parameters 

In addition to volume fraction and interface density (and the related parameters size 
and distance for separate features in a microstructure) there is a variety of other, more 
complex aspects of microstructural geometry. Two of these aspects, both rather important 
in many practical problems, can be quantitatively characterised using measurements made 
in plane cross sections, namely orientation and contiguity (degree of neighbourship and 
skeleton formation) using the concept of weighted interface densities. 

The degree of orientation is quantified by relating the oriented part of the interface 
to the total interface of a microstructural component yielding 

F m = S w (oriented)/5 v (total) (14) 

which, as outlined in early literature (e.g., Underwood [1970], Saltykov [1974], see 
also Exner [1994]) can be calculated from directed point counts along perpendicular 
lines for linearly as well as planarly oriented interfaces (for counting recipes see Exner 
[1993]). 

Similarly, the degree of skeleton formation of a microstructural component can be 
derived by relating the grain boundary density to the total interface, i.e., 

C aa = S v (grain boundary )/S v (total) (15) 

as suggested by Gurland [1958, 1966], and the degree of neighbourship between two 
microstructural components, a and /3, by relating the interface density between compo- 
nent a and /3 (in a two- or a multiphase alloy) to the total interface of component a, i.e., 
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C" p = Sy (interface af})/S y (total a) (16) 

Useful extensions of the concept of contiguity are the continuous volume and the 
separated volume (Lee and Gurland [1978]). Recent reviews of these aspects, their 
assessment including practical recipes, and a review of applications in materials science 
are available (Exner [1993, 1994], Fan etal. [1993]). Particularly for characterizing the 
microstructure of cemented carbides, the concept of contiguity proved indispensible (see, 
for example, Exner and Fischmeister [1966], Gurland [1966], Exner and Gurland 
[1970], or Lee and Gurland [1978]). 

Most recently, a new parameter for characterising the degree of connectiveness in 
two-phase microstructures was proposed by Poech and Ruhr [1994]. Based on the 
skeletonizing procedure provided by mathematical morphology (see § 7.3) and available 
with most modem image analysers, the “matricity” of the two phases is defined by the 
length of the skeleton line in one of the phases divided by the sum of the length of 
skeleton lines in both phases. The application of this parameter was successfully 
demonstrated for nickel-silver and martensite-austenite model alloys and for modified 
aluminium-silicon castings (Poech and Ruhr [1994] and Pompe [1994]). 

The most difficult problem in quantification of microstructural geometry is shape. 
Though numerous attempts have been made to find parameters for “average” shape in 
three dimensions, a general solution is not available, and some proposed parameters (see, 
for example Underwood [1970, 1976, 1980], Fischmeister [1974], Medalia [1980], 
Exner and Hougardy [1988], or Exner [1994]) are very limited, while others are of 
not much use at all since they either cannot be assessed experimentally or are size- 
sensitive. There are two ways out: one is to use planar shape descriptors for sections or 
projections as a “fingerprint” for three-dimensional shape, the other is serial sectioning 
in order to get the three-dimensional reconstructions or to isolate individual grains and 
particles chemically (Rhines and Patterson [1982], De Hoff [1982]). While the latter 
techniques are time consuming and only useful for demonstration purposes or very 
specific applications, e.g., for proving or disproving a theoretical concept, planar shape 
characteristics are in widespread use. 

Classical shape parameters to describe various aspects such as elongation, bulkiness, 
waviness, or symmetry can be defined on the basis of combining metric properties like 
area, perimeter, Ferret’s diameter, moments of inertia, etc. (Beddow et al. [1980], 
Schwarz [1980], Exner and Hougardy [1988], Coster and Chermant [1989], see 
also Exner [1994]). Applications of planar shape descriptors to problems in physical 
metallurgy have remained scarce, but now — with the availability of shape parameters 
in most automatic instruments — they can be used for descriptive purposes or for 
establishing empirical correlations for shape-sensitive properties of materials. As an 
example, a close correlation between fracture strength of eutectic aluminium-silicon 
alloys and shape of the silicon precipitates has been demonstrated (Paul etal. [1982]). 

Other concepts of shape characterization are based on clear mathematical concepts, 
for example those using Fourier or Walsh coefficient (see, for example, Meloy [1980] 
or Floor [1987]), those based on mathematical morphology (Coster etal. [1981], 
Coster and Chermant [1989], see § 7.3) or, as a derivative of the presently fashionable 
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fractal analysis, fractal dimension. Provided that the experimental procedure for the 
assessment of fractals is sufficiently precise, a unique value characterising the irregularity 
of interfaces and nonplanar surfaces is obtained. Applications of this concept to problems 
in materials science are abundand and some interesting examples have been published 
including fracture surfaces, grain boundaries in deformed metals, phase interfaces after 
martensite transformations, among other (see reviews by Wright and Karlsson [1982], 
Hornbogen [1986a, b, 1987], or Exner [1994]). Considering the fact that these concepts 
are rather abstract from human perception and even more their limitation to planar 
geometry, it seems rather doubtful that they will be successful in the fields of materials 
science and technology (see also Exner [1994]). 

Serial sectioning has been used to evaluate topological parameters , mainly in 
connection with studying the grain shape in metals (Rhines etal. [1974] and Rhines and 
Patterson [1982]) and the development of microstructure during sintering (De Hoff et 
al. [1972] and Aigeltinger and Exner [1972]). In this way, the number of separate 
parts (or, in other words, the number of pores or particles) per unit volume can be 
assessed. Stereological equations relating numbers per unit area or unit length to numbers 
in three dimensions are usually not reliable since they depend on information concerning 
shape- and size-distribution. Another useful topological parameter is connectivity (or 
genus) which describes the number of connections (channels) between nodes (larger 
regions). Figure 22 shows the results of a study of sintered copper on which the 
geometric properties of the pore space have been followed up as a function of porosity, 
i.e., pore volume fraction (Aigeltinger and Exner [1972]). If chemical disintegration 
for isolating microstructural features is not feasible, serial sectioning, though extremely 
time-consuming even with automatic sample preparation, automatic instrumentation and 
computer processing, may be the ultimate answer to geometric analysis of irregularly 
shaped microstructural features in three dimensions (De Hoff [1982]). 

Another important aspect closely related to both shape and size is curvature. 
Variations of local curvature provide the driving forces for shape changes and coarsening 
of microstructural features (e.g., fibers, pore systems, precipitations, etc.) but are difficult 
to quantify. Integral mean curvature, on the other hand, can be measured quickly and 
unambiguously from cross-sections using equation (12), see §7.2.3. As a sensitive 
parameter for monitoring the path of microstructural changes during annealing (De Hoff 
and Isawaran [1982]), it should find much more widespread application. In fig. 22, the 
variation of curvature during the sintering process can be clearly related to the qualitative 
and quantitative changes of pore geometry. Curvature distribution, on the other hand, 
cannot be assessed from planar cross sections, rather, stereometric techniques, must be 
applied (Feijoo et al. [1990]). Though very tedious, the distribution of curvatures of 
dendrite surfaces was measured with interesting results on the development of dendrites 
during solidification of aluminium alloys, and a comparison with theoretical predictions 
was made (Feijoo and Exner [1991]). 

Finally, there remains the problem of characterizing the arrangement of micro- 
structural features. Again, three-dimensional characterization must rely on simplifying 
assumptions or on serial sectioning. The classification (random, regular, clustered) and 
quantitative description of planar arrangement of features defined by a characteristic 
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Fig. 22. Changes of metric and topological properties of pore space in sintered copper powder as a function of 
porosity varied by sintering at 1000°C. The spherical and the electrolytic (dendrite-shaped) powders show 
different pore structures at comparable porosity. The right part of the figure clearly shows the formation of new 
contacts (increasing genus) at the early sintering stage and the closure of pore channels (sharply decreasing 
genus) and formation of isolated pores and their disappearance by shrinking in the later stages. These 
topological changes are also reflected by the metric properties shown in the left part of the figure: Surface 
density and integral mean curvature decrease due to smoothening of the pore-solid interface but increase for a 
short period when the channels start to close and at the very late stages when isolated pores shrink. (After 
Aigeltingkr and Exner [1972].) 

point (e.g., centre of gravity) on the basis of nearest-neighbour distance distributions has 
been proposed (Schwarz and Exner [1983]) and a number of statistical procedures 
became available from other fields, e.g., from pattern recognition and cluster analysis. Up 
to recently, application of arrangement parameters to materials is near to non-existent. A 
very interesting exception is a study on the arrangement of carbides in high-speed steels, 
which has shown the feasibility of simple statistical concepts and the value of arrange- 
ment analysis (Werlefors et al. [1979]). Mathematical morphology (see §7.3) and, 
particularly, techniques based upon pair correlation functions and other statistical 
approaches based on stochastic geometry (see, for example, Stoyan et al. [1987], 
Stoyan [1990], Ohser and Lorz [1994] or Exner [1994], Bertram and Wendrock 
[1995] or Wendrock et al. [1995]) supply a wealth of concise descriptors which, 
however, are only developed for planar arrangement. As mentioned above (§ 7.2.3), most 
of these approaches seem not a practicability for 3D arrangements and seem rather 
insensitive to changes of arrangements which are often easily detected by visual 
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inspection, as, for example, lamellar patterns or particles lined up in rows which have a 
certain degree of irregularity. 

7.3. Mathematical morphology 

In addition to classical geometric parameters, a special methodology based on 
“structuring elements”, termed mathematical morphology has been developed, largely in 
France, into a consistent and practically useful framework for image analysis. A full 
theoretical account of this field, including many pertinent references, has been published 
by Serra [1982, 1988], There is no space here to describe these sophisticated concepts 
in detail. Attempts to give a comprehensible treatment of image analysis and of image 
manipulation based on mathematical morphology which, for the materials scientist, are 
easier to understand than Sena's mathematical theory, have been published by Fisch- 
meister [1981], Coster and Chermant [1989], Russ [1990, 1992], Chermant and 
Coster [1994] or Chermant [1994], among others. Some procedures which have been 
proven useful in practical work are outlined in the following. 

The microstructure or, rather, its binary image is treated as a set of points which can 
be transformed by various operations in order to extract characteristic parameters of this 
set, which then are used to describe the geometry of the microstructure. The effects of 
the most useful operations, erosion and dilatation, opening and closing are demonstrated 
in fig. 23. Erosion is achieved by moving the structuring element from picture-point to 
picture-point and removing all those points which do not fulfill a specified criterion, e.g., 
that all neighbour points are inside the dispersed features shown in fig. 23a, which, in 
other words, means removing all points at the edge of the features. By this operation, the 
two small features 1 and 2 and the bridge connecting two parts of the large particle are 
eliminated. The area of the large feature has obviously become smaller as shown by the 
dashed line in fig. 23a. By dilatation , i.e., by adding all those matrix points which have 
at least one neighbouring point inside the features, the original size is very closely 
reconstructed but the two small features are still gone as is the bridge between the two 
large features (fig. 23b). Therefore, this combined operation is called opening. Similarly, 
if dilatation is carried out first, followed by erosion (figs. 23c and 23d) the result is 
reversed and a bridge is formed by this combined operation called closing. 

It is obvious that this manipulation very effectively allows the elimination of small 
features or contacts arising from detection noise in automatic instruments or to separate 
relevant information from the image. Another operation for segmenting connected 
features is the grassfire transformation (Nawrath and Serra [1979]) allowing 
identification and counting of separate particles and grain comers by alternating or 
repeated erosions and dilatations. Another transformation procedure called skeletonizing 
reduces each feature (e.g., irregular intersect area) to a pattern of lines which then can be 
analysed with respect to the number of branches and nodes. The ratio of line lengths of 
the skeletons of two interdispersed phases was suggested by Poech and Ruhr [1994] to 
characterise the character of two-phase materials. The concepts of covariograms and star 
function, and a large number of other parameters were suggested for characterizing size, 
spacing, shape, arrangement and many other aspects of microstructural geometry by 
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Fig. 23. Two-dimensional image transformations by operations of mathematical morphology (Exnhr and 
Hougardy [1988]). E is the element of 9 picture-points (with the reference point in the centre) by which the 
following operations are carried out: (a) Original features and eroded outline (dashed) of feature 3; (b) dilated 
outline of eroded fig. 23a; (c) dilated outline of fig. 23b; (d) eroded outline of fig. 23c. Combined operations 
are called opening (a and b) and closing (c and d). 


Serra [1982, 1988] and Coster and Chermant [1989] and a large number of authors 
which tried to apply mathematical morphology to practical problems. 

Most information obtainable in this way should be strictly considered as fingerprint- 
type information on microstructural geometry (Fischmeister [1981]) owing to the lack 
of quantitative relationship with three-dimensional characteristics of the microstructure 
and to uncontrolled effects of image distortion. The stereological aspects of mathematical 
morphology have been discussed by Serra [1982, 1988] and Coster and Chermant 
[1989], among others. In general, mathematical morphology is very useful in all those 
cases where changes of microstructural geometry need to be monitored without seeking 
3D information. In cases where physical modelling of microstructural changes is 
attempted, stereological parameters should be preferred. This statement seems to remain 
valid in spite of the later attempts to clarify the stereological aspects of mathematical 
morphology culminating in the mathematically correct but unpracticable fact that all 
stereological equations could be derived from mathematical morphology and thus are a 
sub-set of this concept. 

More recently, mathematical morphology has been extended to the analysis of grey- 
tone pictures and certainly will further proceed to colour images and other applications 
not directly relating to microstructures. As mentioned above (§7.1), image modification 
by morphological processing (filtering, cleaning and many other amendments) are indis- 
pensible applications of mathematical morphology used in most modern image analysers. 
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7.4. Further aspects 

Generally, more than one single geometric aspect is of relevance in characterizing the 
microstructural geometry of multiphase materials, and a combination of parameters will 
be needed to give a full description. Such combinations have been proposed for studying 
the evolution of microstructure (for example, during sintering, see fig. 22 and ch. 31) or 
for empirical correlations with mechanical properties (for example, those of cast and 
annealed aluminium alloys (Paul et al. [1982]), of cemented carbides (Exner and 
Gurland [1970]) or of sintered steel (Exner and Pohl [1978]). In addition to these and 
other examples cited above, successful applications of quantitative characterization of 
microstructural geometry have been reported in the literature; some of these have been 
reviewed in overviews, for example by Fischmeister [1972], Saltykov [1974], 
Hougardy [1975], Lafond and Moliexe [1975], Rhines [1976, 1986], Mathy [1977], 
Exner [1978a, 1994, 1995], Ondracek [1978, 1982], Russ [1986, 1990, 1992], Exner 
and Hougardy [1988], or Coster and Chermant [1989], among others. Quite often, 
however, data are published not so much to aid the solution of a materials problem but 
rather to demonstrate the feasibility of a new technique or a new instrument. Considering 
the high standard of methodology now achieved, it seems timely to utilize quantitative 
analysis of microstructure more directly as a clue for understanding the behaviour of 
materials during processing and in application. 
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I. Introductory remarks 

In the fifties, when transmission electron microscopes became commercially available, 
their potentialities for enabling applied and fundamental research in physical metallurgy 
and materials science were realized soon. Within a few years the resolution limit for a 
direct imaging of structural details of solids — up to that time limited by the light- 
wavelength in optical microscopy to some fraction of a /xm at best — was reduced to 
about one nm; and in the course of this rapid development the horizon was opened for 
completely new aspects of research with a spatial resolution to nearly an atomic level. 
However, at that time it was quickly realized that transmission electron microscopy 
(TEM) differs from the classical optical microscopy significantly in the sense that TEM, 
in particular when applied to crystalline specimens, requires a much more profound 
knowledge of the interaction of the imaging (electron) waves with matter than in the case 
of optical microscopy. This holds for the imaging of crystal inhomogeneities (lattice 
defects, precipitates etc.) by TEM via elastic interaction of the imaging electrons with the 
specimen atoms. But it holds also for a local material analysis via various processes of 
inelastic interactions. 

The present article attempts to present a brief, and necessarily incomplete, introduc- 
tion (i) to a number of methods for imaging different kinds of lattice defects and precipi- 
tates by diffraction contrast, constituted mainly by elastic interaction, and (ii) to problems 
of the new field of analytical electron microscopy, based on inelastic interactions. 

Because of the restricted space available, the references quoted are necessarily 
incomplete and, perhaps, not altogether free of arbitrariness. Other, more special aspects 
of TEM such as, e.g., Lorentz microscopy of ferromagnetic domains and special aspects 
of high-voltage electron microscopy (e.g., the critical voltage effect) are not dealt with 
at all. 

In order to facilitate the step into the topics of this chapter the reader is recommended 
to study first the Appendix, Elements ofkinematical diffraction theory, which may provide 
him with the basic knowledge necessary for an understanding of the content of the 
present chapter. This appendix may also be useful in connection with ch. 12. 


2 . The instrument 


The technology of modem electron microscopes has been developed so far that 
nowadays commercially available instruments and the sophisticated attachments fulfill 
practically all requirements necessary for studying crystalline and non-crystalline thin 
films. Some essential properties of the instrument are sketched in this section. 

The resolution of an electron microscope is governed by errors of the magnetic 
lenses, in particular by the spherical aberration of the objective lens. The ultimate 
resolution r min can be reached for an optimum objective aperture angle a opl , with 


“op. = Ak • 


„ — d\3/4x-*1/4 

r min = C s • 


( 1 ) 


where A is the wavelength of the incident electrons and C s the constant of spherical 
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aberration. A and B are constants, their actual values depend on the way in which 
different contributions to lens errors are combined (Hirsch etal. [1977]). 

High-resolution instruments (with accelerating voltages of 200 kV or 300 kV) possess 
a point-to-point resolution of < 0.25 nm. If ultimate resolution has to be reached then 
stringent requirements are necessary for the stabilities of the high voltage and the lens 
currents. The pole piece of the objective lens possesses a narrow gap and small borings, 
so that in such instruments the specimens can be manipulated only in a very limited 
range. 

Usually, ultimate resolution is not necessary for electron microscopy studies in 
materials science. For these studies it is more important that the specimen can be shifted 
and tilted over large ranges and that different signals of scattered electrons and X-rays 
can be detected in analytical microscopy studies. The lenses of the standard instruments 
possess pole pieces with larger boring which allow specimen manipulations and the 
mounting of detectors for analytical purpose. Thereby, the resolution of the instrument 
is reduced to ~ 0.4 nm. This resolution is, however, sufficient for most TEM studies in 
materials science. 

An electron microscope can be split into three components according to its function: 
(i) the illumination system composed of the electron gun and the condensor lenses 
produces a fine electron beam to “illuminate” the specimen; (ii) the objective lens, which 
immediately follows the specimen, produces the diffraction pattern and a first magnified 
image of the specimen; (iii) the magnification system produces the final image. In 
addition to the optical components the microscope encloses also a specimen chamber 
(specimen handling system) and a recording system (Hirsch etal. [1977] and Hren et 
al. [1979]). 

The illumination system provides a beam of adjustable size, intensity and conver- 
gence angle at the specimen, resulting in a limited coherency. The term coherence refers 
to the range of phase differences in the illuminating beam as it approaches the specimen. 
If the electrons come from a single point source, then all the waves in the incident beam 
are in phase with each other and the illumination is coherent. On the other hand, if the 
source of electrons is so large that there is no phase relation between the incident waves, 
the illumination is completely incoherent. In reality, the filament in an electron micros- 
cope is somewhere between these two extremes and the incident illumination is defined 
as partially coherent (Spence [1981]). 

In TEM the dimensions of the region on the specimen, a, over which the illumination 
appears coherent is related to the angular aperture of the illumination, a it by (Geiss 
[1979]): 


Field emission guns possess a very small a, which results in a large coherence length. 

Charged electrons interact strongly with the transmitted specimen, the scattering 
cross-section <x e for electrons is rather large compared to the cross-section of neutrons or 
X-rays (see appendix). Specimen thicknesses are required in the range of 10 nm to 1 /im 


References: p. 1105. 
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depending on the imaging mode and on the voltage of the TEM. Recipes exist for 
electrochemical thinning of conducting materials (Thompson-Russell and Edington 
[1977]) as well as for ion beam etching of insulators (Tighe [1976]). 

Different kinds of electrons and electromagnetic waves are emitted from a specimen 
which is irradiated with high-energy electrons. The different waves result from elastic or 
inelastic scattering processes. Different signals (fig. 1) are used for different imaging 
modes. Information on the crystal structure and on defects in the specimen can be 
obtained by studying the elastically scattered electrons (in micrographs inelastically 
scattered electrons are also present and contribute to the background intensity) whereas 
investigations of inelastically scattered electrons and of other waves leaving the specimen 
allow the determination of chemical composition and topology of the specimen surface. 

The microscope can be operated in different modes as described briefly in the 
following. For more details the reader is referred to textbooks on electron microscopy 
given in the list of references. 

In standard or conventional transmission electron microscopy (TEM) mode the 
microscope is operated to form images by bright field (BF), dark field (DF), or lattice 
image (phase) contrast, see fig. 2. A BF (DF) image is formed when only the direct (one 
diffracted) beam is used for image formation. The objective aperture prevents all other 
beams to pass to the recording system. Usually, the specimen is oriented so that the 
Bragg condition is nearly fulfilled for a set of lattice planes. Then one reflected beam is 
strongly excited besides the incident beam (see § 4). A weak-beam dark-field (WBDF) 
image is produced if a weakly excited DF beam is used for imaging (Cockayne [1978]). 

A lattice image is formed by the interference of at least two beams in the image plane 
of the objective lens. Lattice fringes can be observed if a row of systematic beams 
(reflected at the lattice planes in question) is used for imaging, while a structure image 
is formed by using many beams present in a low-indexed Laue zone. Special adjustments 
of the microscope are required for the formation of the high resolution electron micros- 
cope images (Spence [1981] and Thomas and Goringe [1979]), see §9. 


Incident beam 


Cathodoluminescence 


Absorbed 

electrons 



Backscattered electrons 

Secondary electrons 
Auger electrons 


I —Specimen 


-Transmitted electrons 

Diffracted electrons-/*' \ — Energy Loss electrons 


Fig. 1. Electrons and electromagnetic waves emitted from a transmitted specimen as a result of elastic and 
inelastic scattering or diffraction of the incident electron waves. 
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Illumination system 

Q 



i- diffracted beam — — 




Bright Field Dark Field 


■ • • <•;— • • 
000 hkl t*k 000 
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Fig. 2. Ray diagrams (including Ewald sphere construction) for: (a) a conventional two-beam bright-field (BF) 
image; (b) dark-field (DF) image; (c) weak-beam dark-field (WBDF) image; (d) lattice image. The gun of the 
electron microscope is tilted by the appropriate angle in going from (a) to (b) or from (a) to (c). 


The objective lens produces a diffraction pattern of the specimen in its backfocal 
plane (see fig. 3). The first image of the object is rotated by 180° against the diffraction 
pattern. The diffraction pattern and the first image are magnified by the subsequent 
intermediate lenses and projector lenses. The information obtainable from the diffraction 
pattern is summarized in § 3. 

In the scanning transmission electron microscopy (STEM) mode, the electron beam 
is focused as a fine probe on the specimen by the prefield of the objective lens. The 
beam probe is scanned over the specimen (by scanning coils) and the transmitted 
intensity is recorded. 

The STEM mode is usually applied in materials science for analytical microscopy 
(see § 10), where the probe is fixed on a selected small area and either the energy losses 
of the transmitted electrons are studied (EELS) or the X-rays emitted from the specimens 
are investigated (usually by EDS) for the determination of the chemical composition of 
the specimen. The use of a very small probe size (< 10 nm) is often limited by a strongly 
enhanced contamination rate (Hren [1979]). 

In the STEM mode backscattered electrons can be collected as well as secondary 
electrons. These scanning micrographs are similar to those obtained by regular scanning 
electron microscopy (SEM). Information on the topography of the specimen surface can 
be obtained. 


References: p. 1105. 
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Fig. 3. Ray diagram in the area of the objective lens of an electron microscope. A diffraction pattern is formed 
in the backfocal plane F of the objective lens, whereas the (first) image I of the object O lies in the plane 1. 
The first image of the object is rotated by 180° relative to the diffraction pattern. 

3. Information from the diffraction pattern 

3.1. Diffraction spot pattern 

The diffraction pattern provides crystallographic information on a qualitative phase 
identification and on the orientation-relations between crystals and the direction of the 
incoming electron beam. Kikuchi patterns (see § 3.2) can be used for the exact deter- 
mination of the orientation, whereas the convergent-beam technique allows statements on 
crystal symmetry and determination of the foil thickness. 

The possibilities and the accuracy of analysis of diffraction patterns are discussed in 
many textbooks on electron microscopy and special papers (e.g., Andrews etal. [1971]). 
Important factors for the accuracy are: (i) the shape factor described in the appendix 
which determines the intensity distribution in the reciprocal space; (ii) instrumental 
alignment and beam divergence; (iii) specimen perfection; (iv) curvature of the Ewald 
sphere and its orientation with respect to the foil; (v) double diffraction. 

3.1.1. Double diffraction 

From the structure factor it follows that certain reciprocal lattice points are not 
present (zero intensity) for certain crystal symmetries. However, each diffracted beam 
(within the crystal) can act as an incident beam and can diffract electrons to a reciprocal 
lattice point forbidden by the structure-factor rules, especially in orientations where 
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several different reflections are excited simultaneously. For example, in the diamond cubic 
structure the (002) reflection is not allowed, but this reflection can be excited in a [110] foil 
via double diffraction: if a (ill) reflection is excited then this reflected beam can act as a 
primary beam for (111) planes; this gives a total reflection £, +£ 2 = (1 1 1) + (1 11) = (002). 

3.1.2. Patterns from ordered crystals 

The symmetries of ordered crystal structures are often changed compared to the 
disordered crystal structure. This results in the appearance of superlattice reflections at 
positions that are forbidden for the disordered structure. The intensities of the superlattice 
reflections correspond to the difference between the atomic scattering factors of the 
different atoms, in contrast to the intensities of the fundamental reflections, which are 
related to the sum of the scattering factors. As an example, the B2 superlattice is 
selected. It is based on the bcc structure of the CsCl lattice with one kind (A) of atoms 
at 000 and other (B) at \ The structure factors are (for complete ordering): 

F = /. + /„ for h + k + l = even: fundamental reflections, 

(3) 

F = f A - f B for h + k + l = odd: superlattice reflections. 

The corresponding diffraction pattern is shown in fig. 4a. In general, superlattices can be 
identified from their diffraction pattern, either by comparison with structure-factor 
calculations for different possible superstructures, or by an analytical method which can 
be viewed as the reversal of the structure-factor calculations (Khachatur y an [1974]). 
In this method the ordered lattice is represented by a superposition of concentration 
waves. Theoretically, it allows unequivocally the determination of the real space lattice 
from the complete set of experimentally determined superlattice diffraction vectors. 

This method by Khachaturyan can be used to identify superlattices in substitutional 
(Das etal. [1973]) and interstitial solutions. However, it must be carefully observed that 
superstructures cannot always be identified by diffraction patterns alone. If, as in many 



Fig. 4. Diffraclion patterns from ordered crystals: (a) diffraction pattern of an ordered /3-AlNi alloy (B2 
superlattice), [100] zone axis, superlattice reflections are visible; (b) Diffraction pattern of an ordered Ta-O 
alloy, [110] zone axis, variants of different orientations contribute to the pattern. 


References: p. 1105. 
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disordered cubic alloys, an ordered structure has non-cubic symmetry a number of 
orientational variants may exist. Figure 4b shows, as an example, a (110) pattern of an 
ordered Ta-0 interstitial phase. The superlattice spots arise from very small ordered 
domains. If the selected area contributing to the diffraction pattern is large compared to 
the domain size, then the different domain variants contribute to the diffraction pattern. 
It is then impossible to determine the superstructure from the diffraction pattern alone. 
The size, shape, and the number of the different variants of the ordered domains can be 
observed, however, from different DF images taken with superlattice reflections. 

3.2. Kikuchi lines 

Electrons can be scattered inelastically by interaction with the atoms of the specimen. 
Those electrons lose energies in the range of about a hundred eV. The inelastically 
scattered electrons can subsequently be diffracted coherently when Bragg’s law is 
fulfilled at a suitable set of reflection planes. Since the (primary) inelastic scattering 
process occurs in different directions, the loci of the different subsequent coherent 
scattering are cones with semivertex angles of (90° -6) to each side. The two cones are 
bisected by the reflection plane. The lines are, therefore, produced in pairs which are in 
contrast to the background. A deficiency line of less intensity than the background occurs 
nearer the origin of reciprocal space than its associated excess line with intensity above 
the background. Two conditions must be fulfilled so that these Kikuchi lines are 
observed. One is that the crystal is thick enough, so that enough inelastic scattering 
processes occur, the second condition is that the crystal must be nearly perfect, especially 
not bent. The width of the lines indicates the curvature range of the crystal planes over 
the thickness traversed. If this becomes excessive, the lines disappear into the back- 
ground as their intensity is spread over a larger angle. 

The Kikuchi line pattern can be used to determine the orientation of the crystal with 
respect to the incoming electron beam. There exist two ways for doing this. Either three 
pairs of Kikuchi lines have to be indexed and then the orientation can be calculated, or 
the observed Kikuchi pattern has to be compared to Kikuchi maps (Thomas and 
Goringe [1979]). The orientation of the specimen with respect to the electron beam can 
be determined with an accuracy of better than 0.3°. The excitation error s (cf. § 4) can 
also be determined from the relative position of Kikuchi lines compared to the diffraction 
spots as demonstrated in fig. 5. 

3.3. Convergent-beam diffraction 

The size of the area giving rise to the diffraction pattern can be substantially reduced 
if — similarly as in STEM — the electron beam is focused onto the specimen. Under 
this condition it is practically impossible to maintain the nearly parallel illumination 
condition. Some convergence of the beam is introduced. As a result, the diffraction 
“spots” become “discs”. For a convergence angle a < 2 d B this does not introduce 
problems since the discs do not overlap (0 B is the Bragg angle). 

In the intensity distribution of those diffraction “discs” much information is available 
(Steeds [1979]): 
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Fig. 5. Kikuchi lines on a diffraction pattern: (a) excitation error s=0; (b) s>0. (c) Sketch showing shift in 
Kikuchi lines produced by a tilt e. 
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(i) Each diffraction disc is produced from exactly the same small area (microdif- 
fraction) without any shift introduced by lens errors (Hirsch etal. [1977]). Therefore, 
these diffraction patterns can be used for the determination of the orientation and the 
crystal structure of the small illuminated area. 

(ii) Each disc contains the intensity diffracted by this same area of crystal for a range 
of incident orientations. That means that diffraction discs (convergent-beam electron 
diffraction pattern — CBED) are two-dimensional maps of diffraction intensities as a 
function of inclination between the incident electrons and a particular crystal direction. 

The lattice parameters of a crystal can be determined very accurately from the fine 
structure of CBED patterns. A change in Bragg angle A 9 results in a change of the 
positions of lines within a pattern. The method is very strong in measuring relative 
changes in lattice parameters. Absolute numbers do not have a high accuracy. 

If the change of lattice parameters of an alloy or compound is directly and unambig- 
uously related to its chemical composition, then the chemical composition may be 
deduced from lines observable in CBED patterns. As an example, for dual-phase steels 
the local concentration of carbon could be determined with an accuracy of <0.1 at% 
(Steeds [1981]). The spatial resolution (including beam broadening) is governed just by 
the geometry of the electron probe. 

Local strains (Steeds [1981]) and also the foil thickness (Kelly etal. [1975]) can be 
measured from CBED patterns. Furthermore, CBED patterns allow the determination of 
the symmetry and of the space group of the crystals (Steeds [1981]). 

3.4. Moire pattern 

Moire patterns (see fig. 8c for an example) occur from overlapping crystals as in 
composite films or in two- (or more) phase systems. Two general cases must be 
considered: 

(i) Parallel moire fringes are formed if two parallel planes of different spacings are 
reflecting. The lattice spacings (d ] = l/g, and d 2 = 1/ gf) differ only in magnitude. Fringes 
can be observed with distances d= 1/Ag= l/\g x -g 2 \- 

(ii) A rotational moire pattern is formed when planes with equal spacing d, but 
rotated through an angle a, diffract together. For this situation the moir6 spacing d t is 
given by d r - 1/gsina. 

Sometimes moire patterns have to be distinguished from other periodic defects in the 
crystals, e.g., a set of parallel dislocations. This can always be done best by imaging the 
same area with different diffraction vectors. The direction of the moire lines is usually 
perpendicular to the diffraction vector. 

4. Theory of diffraction contrast 

4.1. Introduction 

Diffraction contrast of defects in crystalline specimens is mainly a problem of high- 
energy electron diffraction in non-perfect crystals. It will turn out later that, by intrinsic 
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reasons, for the imaging of lattice defects (dislocations, stacking faults etc.) an electron- 
optical resolution of about 1 nm is in general sufficient. Accordingly, lens errors of the 
objective lens can be neglected. We assume that the objective lens is focused onto the 
lower specimen surface through which the electrons leave the specimen. Then it suffices 
for the purpose of this section to calculate the electron wave function at the lower 
specimen surface since the electron-optical imaging system is assumed to be perfect. 

In a first part, we deal with the diffraction in a perfect crystal specimen. In a second 
part, it will be outlined how the diffraction theory must be extended for specimens 
containing lattice defects. Electron refraction effects, due to the mean inner potential of 
the crystal, are neglected throughout. 

4.2. Specimen, reciprocal lattice and excitation error 

We consider a specimen of constant thickness t of the order of 100 nm. A Cartesian 
coordinate system is introduced with its origin in the upper specimen surface and with 
the z-axis (unit vector e z ) perpendicular to the specimen plane and pointing downwards. 
The lateral dimensions L x , Ly of the specimen are orders of magnitude larger than t. 
According to the appendix, this means that the intensity distributions IF(k)I 2 at the 
reciprocal lattice points g are rod-shaped or spike-shaped with the spike axis parallel to 
e z and a spike length of the order of l/t. This is indicated in fig. 6, where also the 
wavevector k 0 of the incident wave and the Ewald sphere are inserted. Note that this 
figure is not to scale: assuming a modulus of (0.2 nm) -1 for the low-order diffraction 
vector and k 0 = 1/A =(3.7 pm) -1 (100 keV electrons), we have k Q /g i = 54. Having this 
in mind, one realizes that near the low-order reflections the Ewald sphere is so flat that, 
if the sphere cuts through the spike of one of the reciprocal lattice points, this will 
happen also for the spikes of other adjacent reciprocal lattice points. Accordingly, in 
high-energy electron diffraction, in general several Bragg reflections are excited 



Fig. 6. The Ewald sphere and the reciprocal space. The intensity distributions IF'(k)I 2 around the reciprocal 
lattice points are spike-shaped with spike lengths inversely proportional to the specimen thickness. The 
excitation error s is positive for g=g, and negative for the other g. 
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simultaneously; besides the primary wave a number of diffracted plane waves leave the 
crystal. However, in general the specimen can be orientated so that only one diffracted 
wave is strong. Then, we speak about a two-beam case (primary plus diffracted beam). 
Images taken with either of these two beams are called strong-beam images. On the other 
hand, dark-field images taken with an extremely weakly excited beam are called weak- 
beam images. 

The direction of incidence of the primary beam with respect to the specimen can be 
characterized by the excitation error s g , or simply s, which is given by the distance 
between the reciprocal lattice point g considered, and that point on the intensity spike 
where the Ewald sphere cuts through it. s is positive (negative) if the point g lies inside 
(outside) the Ewald sphere. If 6 is the glancing angle between the direction of k 0 and the 
lattice planes belonging to g and if 0 g is the corresponding Bragg angle, then 

s = gA0, with A 0 = 0- 0 g . (4) 


43. Outline of the dynamical diffraction theory 

The essential points of the dynamical diffraction theory, in contrast to the kinematical 
theory as outlined in the appendix, are that diffractions between all plane waves involved 
have to be treated as being equivalent and that, in the absence of absorption effects, 
conservation of intensity must be fulfilled. There are several ways for formulating the 
dynamical theory for a perfect crystal. Here we refer very briefly to the classical way 
first developed by Bethe [1928]. Another way will be outlined later when crystals with 
lattice defects are considered. 

Following Bethe, the time-independent Schrodinger equation with a periodic potential 
is solved in terms of independent “eigen solutions” or Bloch waves ip B . Generally, Bloch 
waves can be written in the form 

'/'bW = H r ) exp(2iriJt ■ r), (5) 


where the wavevector K has to be found for a given electron energy and a given 
direction of K. The function b(r) is periodic with the crystal periodicity. In the two-beam 
case, to which we restrict ourselves, there are two independent Bloch waves belonging 
to a given tangential component of K parallel to the electron entrance surface of the 
crystal. We assume that the two-beam diffraction vector g is perpendicular to e 2 and that 
the origin of the coordinate system lies on a reflecting lattice plane. Before the solutions 
are presented we introduce some terms which will be used repeatedly later. One 
important diffraction parameter is the extinction length 


" m ’ 


( 6 ) 


where V e is the volume of the elementary cell and F s is the structure amplitude, see 
appendix. For low-order reflections £ g is of the order of some 10 nm. Besides £ g we use 
the “wavenumber” notations 
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1/fg = <t q , <ts(t( W ) = (t 0 .(1 + w 2 )‘ /2 , w = sg g . 


(7) 


w is the “normalized” excitation error and a - is the reciprocal of the effective extinction 
length £ geff as a function of w. The wanted solutions, expressed in these terms, are (e.g., 
Hirsch etal . [1977]): 


0b° = C ( 0 l) exp^rri^ 0 • r)], 
0 b* = c< 2) exp[27ri(^ 2) • r)]. 


- 0 ) _ r (2) - J_ 
0 8 ~V2 


-|l/2 


1 -■ 


w 


( 1+ 


W 


c 0) = _c< 2 > = 
c g L o 


V2 


-,1/2 


1 + 


W 


/ / \l/2 

|^/l + w 2 j 


= Jfe 0 + M (,) , 
M (,) = + <r)e z , 


=*o +S. / = 1. 2, 


A* (2) = 


(9) 


The M (1) are the “eigen values” and the c„ <0 (n = 0, g) are the components of the “eigen 
vectors” of the Schrodinger equation. The relation between eqs. (5) and (8) is easy to 
realize if the phase factor exp( InK® ) r) is put in front of the two terms of 0 ( b in 
eq. (8). 


If the i/» B are known, a linear combination of the ip ( £. 


( 10 ) 


has to be found which satisfies the boundary conditions at the specimen surface at z = 
0 where both partial waves with the wave vector K 0 have to add up to unity so that at 
z = 0 they fit to the incident primary wave: 

i P 0 ~ 0o exp[2m(V r)], 0 O = 1 for z < 0. (11) 


Further, both partial waves with wave vectors K g have to cancel at z = 0. In our case, this 
leads to 


„<■> - 


< P i2 ) = c «\ 


( 12 ) 


At the lower specimen surface at z = t, the Bloch waves decompose into their partial 
waves, and those having the same subscript, 0 or g, interfere and constitute a plane wave 
below the specimen. Thus, we obtain for z > t: 

0o = 0o (0 exp[277i(* 0 • r)\ 0 g = 0 g (t) exp[27ri(& g • r)], 
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with 

* g = * 0 + se z + g, |* g | = |* 0 1, (13) 

and 

<f>Jt ) = cos Trot - i W sin tt at, (f>(t) = i - l/2 - sin iron. (14) 

(l + w 2 ) (l + w 2 ) 


4.4. Normal and anomalous absorption 


High-energy electrons, when passing through a crystal, experience, besides the elastic 
scattering at the atom potential, also inelastic scattering, e.g., by interaction with the 
thermal vibration of the crystal atoms (phonons) or with the crystal electrons (plasmons, 
inner-shell excitation). By these events energy is transferred between high-energy 
electrons and the crystal, which leads to a loss of coherency of the wave fields of the 
elastically and inelastically scattered electrons. This effect can formally be described as 
an “absorption” (although, of course, the inelastically scattered electrons are not really 
absorbed). This “absorption” is different for different Bloch waves, depending on the 
high-energy electron density distribution li/f B l 2 with respect to the atom positions 
(Hashxmoto et al. [I960]). From eqs. (8) and (9) it is easy to derive that li/' ( B 1) l 2 reveals 
a maximum electron density at the lattice planes characterized by g and a minimum in 
between; for li// B l 2 the reverse is true: 


K‘ f = 1 + 7 cos 2ir(g ■ r), 

(1 + w ) 

Kf = 1 - 7 — 72 - cos 2 ir(g ■ r). 

(l + w 2 ) 


(15) 


Accordingly, i/Zg interacts more strongly with the crystal atoms and is thus more 
strongly absorbed than iAb* ■ We subdivide the absorption into normal absorption, which 
a high-energy electron would experience when travelling through the crystal far away 
from any Bragg reflection, and anomalous absorption, which takes the structure of I i/'bI 2 
into account. Normal absorption is accounted for by adding a common factor 
exp(-/Lioz/2), where yu, 0 is usually expressed by the “normal” absorption length £ 0 ', with 
/u 0 = 2 The anomalous absorption is introduced by adding a positive imaginary part 

to l/f„ 


iA g ~ 



(16) 


where typically £ 0 ' = (10-20)£ g (Radi [1970]). Accurate values of £ 0 ' are not well known 
since they depend, e.g., on the size of the objective aperture. Normal absorption acts only 
as a scaling factor, independently of the actual diffraction conditions. Therefore, accurate 
values of are not required, and often &=£ g is used for intensity calculations. 
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Working through the abbreviations introduced in eq. (7) leads in first order of 
to the substitution 


a a + i a', a’ 


5(i 


+ w 


This finally gives: 

< “ exp[- 1 pt! n z\ l = 1, 2, 


with 


(17) 


fi m = fi 0 + 2va\ n 2 = fi 0 - 277(7', (18) 

i.e., the absorption of i/'b > ( , /' < b ) ) is enhanced (reduced). (£ g is usually left uncorrected 
where entering into the c^.) 


4.5. Dynamical bright-field and dark-field intensities 

The same substitution as eq. (17) must be applied to the argument var in eq. (14). 
After some algebraic operations, we find for the bright-field (7 0 ) and dark-field (7 g ) 
intensities: 


h = ex P(~7V) 

h = ex P(-MoO 


cosh 27 ra't + 


w sinh 2va't sinh 2 va't + sinh 2 vert 


(l + w 2 )" 

sinh 2 77 a't + sin 2 vat 


1 + W 


(19) 


1 + W 


In fig. 7a, b we have plotted I 0 (t) and 7 g (r) as a function of t for w = 0 and w= 1 (with 
^ g = 10£ g ). We see that the total intensity oscillates between 7 0 and 7 g , a maximum 
in 7 0 corresponds to a minimum in 7 g and vice-versa (“Pendellosung”). These oscillations 
are due to the fact that the z-components of the wavevectors of the Bloch waves i/Zg and if/™ 
differ by a, which leads to a beating of the partial waves constituting ip 0 and t// g , 
respectively. The full oscillation period A t and the oscillation amplitude 

decreases with increasing Iwl. Further, the oscillations are damped with increasing t 
which is a consequence of the anomalous strong absorption of if has decayed, 
a beating between partial waves is no longer possible. On wedge-shaped specimens the 
Pendellosung-oscillations give rise to “thickness fringes” or “thickness contours” along 
lines of constant specimen thickness. An example is shown in fig. 7c, d. 

In fig. 8a, b we show 7 0 and 7 g as a function of w for some values of t. Both terms 
show oscillations with varying w (“bend contours”), which decrease in amplitude with 
increasing t. Whereas 7 g is symmetric in w, this is not true for 7 0 : the maximum in 7 0 (i.e., 
best transmittivity!) occurs at w>0. A complicated system of bend contours is visible in 
fig. 8c. 
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Fig. 8. (a, b) Two-beam bend contours calculated for £'=£□ = 10f g as a function of iv, with (a) I = 2.5£ g , and (b) 
/ = 5f g . (c) Bright field bend contours in copper. The dark lines correspond to specimen orientations where a 
certain set of lattice planes is in Bragg orientation. The moire patterns in the centre of the figure are due to a 
thin oxide layer on the specimen surface. 
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4.6. The column approximation 

We consider in fig. 9 the point C at the lower specimen surface. The lines through A 
and C and through B and C are parallel to the wavevectors k 0 and kg of the primary and 
diffracted wave, respectively. Triangle ABC is known as the Takagi-triangle. It has been 
shown (Takagi [1962], Howie and Basinski [1968]) that the wave functions ij/ 0 and i j/ g 
at point C are determined in very good approximation only by the specimen parameters 
within this triangle. With specimen parameters we mean local specimen thickness, local 
orientation, i.e., local excitation error, and local displacements of atoms from their 
positions in a perfect lattice. For high-energy electrons the Takagi triangle is very 
narrow: The angle at C is 2 0 g with the Bragg angle 0 g being about 1° or less. Taking 
t= 100 nm, it turns out that the distance A-B is some nm at most. Accordingly, if 
variations of the specimen parameters over such a distance are sufficiently small, then ip 0 
and i/'g at point C can be calculated under the assumption that the specimen parameters 
along the “column” above C, i.e., along the z-axis in our case, are the same for the entire 
specimen. This means that for calculation of ip 0 and i fi g the “column” coordinate is the 
only intrinsic variable, whereas the lateral coordinates (x, y in our case) act only as 
parameters. This is the column approximation introduced by Hirsch etal. [I960]. 

4.7. Diffraction at imperfect crystals 

4.7.1. The displacement field 

Lattice defects cause displacements R(r„) = Rn of the atoms from their positions m in 
the defect-free reference lattice*. In general, IIJJ is of the order of, or smaller than, the 
interatomic distances. Here we assume for simplicity that the displacements do not vary 
appreciably over the atom positions within the elementary cell so that the structure- 



Fig. 9. The Takagi triangle. The wave functions ip 0 and ip g at point C are determined by the specimen 
parameters within triangle ABC. 


Regarding the notation R, cf. the footnote in §5.2. 
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amplitude F s is unaffected by the lattice defects. Further we confine ourselves to cases 
where R is a continuous function in space, giving thus rise to “strain contrast”. Contrast 
due to stacking faults and other planar defects will be considered briefly in § 8. 


4.7.2. The kinematical approach 

A first insight in the intensities I 0 (x, y) and / g (x, y) for perfect and imperfect crystals 
may be obtained by the kinematical diffraction theory outlined in the appendix. We start 
from eq. (A.46), where we neglect, as usual, the term s»R in the exponent. Thus, the 
kinematical diffraction amplitude F(s ) follows as: 

H s ) = J v expj-27ri[g • R(r) + s • r]}d 3 r, (20) 


where the integration runs over the specimen volume V c = L x L y t with /«:L x ,L y . The 
diffracted intensity follows by inserting eq. (20) into the integral of eq. (A.35). We 
assume that g is perpendicular to the z-axis and make use of the fact that — because of 
the smallness of the Bragg angle — the Ewald sphere near g is (nearly) parallel to the 
x-y plane. Thus, the differential d/ E in eq. (A.35) can be replaced by dLy 1 dLy 2 [the 
coordinate s 3 in the appendix is identical with the excitation error s in eq. (4)]. Now 
having in mind that, within the column approximation, the terms x, y in R(x, y; z ) are not 
coordinates but only parameters, it turns out that all integrations in eq. (A.35) with F(s) 
according to eq. (20) can be carried out, except those over the z-axis. After dividing the 
result by the specimen area we get for the diffracted intensity per unit area: 


h( x ’ ^ = [^1 !j 0 W{-27ri[g • R(z) + *z]}dz 


( 21 ) 


Thus, with eq. (6), the amplitude <f> g follows as: 

(f> e (x, y,t) = j~ <f> o J o expj-2-7ri[g • R(x, y; z) + sz]} dz. (22) 


Here we have added the amplitude <f> 0 , of the primary wave, which is constant (!$ 0 I= 1) 
within the kinematical approach. Equation (22) is the kinematical contrast integral de- 
rived by Hirsch etal. [1960] who have shown that the imaginary unit i should be added. 
The argument in the exponent of eq. (22) can be re-interpreted as 


g • R(z) + jz = j o \(£)d£, 

with 


Mz) = s + 


d(g • R(z)) 
dz 


(23a) 


(23b) 


being the local excitation error experienced by the primary wave as due to the local 
orientation of the reflecting lattice planes. Diffraction from (f> 0 into $ g is especially strong 
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where s,(z) is small (Cockayne etal. [1969]). For /?=0 (perfect crystal) eq. (22) can be 
integrated and yields, with l$ 0 l= 1, 


/g = £ 


1 sin 2 7 rts 


(24) 


to which the “dynamical” solution eq. (14) degenerates if absorption is neglected and 
l.s£ g l = lwl» 1. Thus, the range of applicability of eqs. (21)-(24) is restricted to large 
values of Iwl, for which I g « 1, for instance to the weak-beam imaging mode (Cockayne 
et al. [1969], and Cockayne [1978]), where the diffracted beam taken for the image 
formation is excited with lwl» 1. 

Equation (22) can be rearranged in order to separate the defect-induced part of the 
contrast integral. By partial integration we get 

4> s =4>' g + 4>;, (25) 

with 


ivr , exp{-27ri[g ■ R(z) + sz]} 
-T<Po 


— 27rLy 


z=0 


(26a) 


4>g=-j-4>o~ ^ ex p{~ 27ri [s - R ( z ) + «]}<k- ( 26b > 

If g»R(t) and g»R( 0) are neglected in eq. (26a), then </>' represents the “background 
amplitude” with identical to (24). 4>" is the defect-induced part of 4> g - Since the 
integrand of eq. (26b) is proportional to the displacement derivative d(g»/?(z))/dz, the 
integration limits can in general be extended to ±°°. In the dark-field intensity 7 g =l$ g l 2 
the interference term between and 4>" is often neglected (cf. e.g., Hirsch etal. [1960] 
and Wilkens and Hornbogen [1964]): 

7 .M = M 2 + \^( x >yf- ( 27 > 

Then adjusting to conservation of intensity gives the bright-field intensity: 

/ 0 (x,y)-(l-|^| 2 )-|^(x,y)f, (28) 

i.e., the kinematical contrast in dark field (bright field) is always bright (dark). Later we 
will see that this is not true if Iwl is not » 1. 


4.7.3. Dynamical diffraction theory in terms of plane waves 
In §4.3 we have outlined that one of the prerequisites of a dynamical diffraction 
theory is that all plane waves involved must be treated equivalently, i.e., diffraction 
between all plane waves must be taken into account. Equation (22) suggests intuitively 
one way by which this condition can be met: Both sides of eq. (22) are differentiated 
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with respect to t, then we set t-z and concede that also <f> a may be z-dependent. In a 
next step we construct an equivalent equation, describing transition from the diffracted 
into the primary wave, i.e., k g and k 0 change their roles. This requires a change of the 
signs of g and s. Then we end up with a set of coupled differential equations: 

d0 g /d z = i(Vf g )0o(z) exp{-27ri[g • R(z) + .sz]}, 

(29) 

d^o/dz = i(Vf g )0 g (z) exp{+27ri[g • R(z) + sz]}. 

These equations are indeed one form of the differential equations established by Howie 
and Whelan [1961] in a more detailed way for describing strain contrast by dynamical 
diffraction in imperfect crystals. These equations have to be integrated down to z = t with 
the boundary condition tf> 0 = 1, <f> g = 0 at z=0. For R-0 (perfect crystal) they can be 
integrated analytically; the result is identical to that given in eq. (14). 


4.7.4. Dynamical diffraction theory in terms of Bloch waves 
A different approach for deriving dynamical diffraction in imperfect crystals starts 
from the Bloch-wave solution of the perfect reference lattice as given in §4.3 (Wilkens 
[1964], Haussermann etal. [1973], Wilkens etal. [1973]). A trial solution, in the two- 
beam case given by 

^ = <P m {zW+<p l2) {z)^\ (30) 


with z-dependent Bloch- wave amplitudes <p w is inserted into the Schrodinger equation 
which now contains a potential, the periodicity of which is perturbed by the atomic 
displacement field R(r). This leads in the column approximation to a set of ordinary 
differential equations for the <p {,) . In the two-beam case one possible representation is 
given by: 


d<p (1) /dz = i it 


d(g • R) 

dz 


d<p (2) /dz = iir 


d(g-*) 

dz 


(l + lf 


<p (2) (z) • exp 
<p w {z) ■ exp' 


|-27ri 

r s d(».*) + jj 

l 

<r dz JJ 

!+27ri 

r s d fe.*) + jj 

1 

ar dz JJ 


(31) 


These differential equations describe the defect-induced transition from Bloch-state (2) 
into Bloch-state (1) (first equation) and in the opposite direction (second equation). This 
transition is proportional to the displacement derivative d(g«J?)/dz and is essentially 
controlled by the phase factors exp(± 27ri<rz). Sufficiently far away from the defect, 
where d(g«/?)/dz vanishes, the differentia] equations obey trivial solutions, <p (l> and <p (2) 
are constant. Once eq. (31) is integrated down to z = t with <p„(0) = c 0 <0 the wanted 
amplitudes <p„(t) (n - 0, g) are obtained as outlined in § 4.3 with <p U) (t) inserted into eq. 
(10). Anomalous absorption is taken into account by substituting a in the exponents of 
eqs. (8) and (31) according to eq. (17). Normal absorption can be added subsequently. 

Both sets of differential equations, eqs. (29) and (31), are in principle equivalent 
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(Wilkens etal. [1973]). However, one consequence of the special form of the equations 
(31) is that they are especially suitable for deriving semiquantitative analytical solutions 
(e.g., Wilkens [1964], Chik etal. [1967], Wilkens and ROhle [1972] and Katerbau 
[1981]). 


4.7.5. Properties of strain contrast in strong-beam images 
Figure 10 shows strong-beam images in bright and dark field of dislocations 
traversing the specimen from the top to the bottom surface. Near the surface the 
dislocation contrast reveals characteristic bright-dark oscillations, which are “in phase” 
in bright and dark field near the top surface and which are “in anti-phase” near the 
bottom surface. Further, the bright-dark oscillations are reversed if the sign of g is 
changed. In the middle of the specimen the contrast is mainly dark in both bright and 
dark field. These phenomena, which are of quite a general nature, will be explained by 
a qualitative discussion of eq. (31). For this purpose we re-define the Bloch-wave 
amplitudes <p m into by incorporating the absorption terms of eq. (18), which are 
incorporated in the usual procedure when going from the ip m (t) to <p„(t), 

<p (l) {z) = <p (,) (z)exp — — z, l = 1,2, (32) 

and restrict ourselves to dynamical diffraction, i.e., w = 0. The results, however, are valid 
also for Iwl < 1. In a perfect crystal, where <p m and <p i2) are constant, <p (,) (z) decreases 
rapidly with z, whereas <p <2) is only weakly damped (£><£'). We assume a fairly thick 
specimen, so that <p (l) has already decayed significantly in the middle of the specimen, cf. 
fig. 11. We consider the three regions (1), (2) and (3), indicated in fig. 11, separately. 
Now concerning contrast formation by a defect (e.g., a dislocation segment in fig. 10), 
we consider one particular column close to the defect, and the depth position Zq of the 
latter is varied over the regions (1), (2), and (3). 

If Zq lies in region (1), where <p (l) and <p t2) are still of about equal strength, the 
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Fig. 10. Diffraction contrast of a single dislocation (S) and a narrow dislocation dipole (D) in copper traversing 
the specimen from the top (T) to the bottom (B) surface g = {220}. (The black-white dots in the background 
are due to ion damage (cf. §6.2.3) inside the microscope, as caused by an insufficient vacuum in the 
microscope column.) (a) Bright field (g*b) = 2; (b) dark field (g»b) = 2; (c) bright field (g*b) = 0; (d) dark field 
(£•*) = 0. 
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'0 — z t 


Fig. 11. Properties of strain contrast in strong-beam images taken with w=0, schematically. Shown are the 
moduli of the Bloch-wave amplitudes tp l ( z), /= 1, 2, for defect depth-positions Zo in either of the regions (1), (2) 
or (3), for details see text, (a): I0 <2 '(/)I, Zo in (1); (b): !<^> <2> (r)l bg ; (c): l$ (2) (/)l, Zo in (3); (d): l$ (2) (/)l, Zq in (2); (e): 
zo in (3). 

scattering within a narrow interval near Zq will increase (or decrease) l£ (1) l and at the 
same time decrease (or increase) l<p (2) l, depending on the sign of d(g»R)/dz and on the 
phase factors exp(± Iit'vtz^ on the right-hand sides of eq. (31). The phase factor is 
periodic in z 0 with period £ geff . Below the defect, the <p (t) in eq. (31) are constant again 
and the follow the decay as described by eq. (32). Then, going down to z = t, <p (l) has 
decayed irrespective of what happened to <p m at, or near to, Zq. Thus, the amplitudes <p 0 (t) 
and </> g (f) at z = t are both determined solely by <p a \t). And since the modulus of the latter 
oscillates with the period Az 0 =£ gefr we get bright-dark depth oscillations in both bright 
field and dark field, which are “in phase”: bright (dark) contrast in bright field corres- 
ponds to bright (dark) contrast in dark field. 

In region (2) we have already l<£> (1) l «: l<p (2) l. Accordingly, for defects with z 0 within this 
region the scattering goes essentially only in one way from <p i2) to tp ( '\ since there is 
(nearly) nothing to scatter from <p m to <p {2) . So l<p (2) l is decreased irrespective of the actual 
value of Zo- On the way down to z = l, <p (1) has again decayed, and the amplitudes <p 0 (t) 
and <f> g {t) are, as in region (1), solely determined by <p U) (t). Since now l$ <2) (/)l < l<p <2) (/)l bg , 
where the subscript bg refers to the corresponding value in the background far away 
from the defect, we obtain a dark contrast in both bright field and dark field, in striking 
contrast to the kinematical approach, cf. eqs. (27) and (28). 

If Zo lies in region (3), the scattering occurs even more than in region (2) only from 
(p (2) to <p m . However, now <p il \ on its enhanced level below z 0 , can “survive” during the 
short distance from Zq to t, along which absorption may be negligible. Consequently, the 
amplitudes <j> 0 and <p g at z = t are determined by the interference of partial waves as 
described in §4.3. The mutual phase shifts of these partial waves are given by exp 
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[+ 27riff(?-z 0 )], and since absorption can be disregarded over the distance t-z 0 , the depth 
oscillations in bright field and dark field must be complementary or in “antiphase”. Of 
course, in practice, these three regions are not sharply separated. Depending on the actual 
specimen thickness, we expect depth oscillations, which are “in phase” in bright field and 
dark field for defects at positions z 0 close to the top surface of the specimen. These depth 
oscillations are damped out with increasing Zo- For thick, or moderately thick, specimens 
they disappear, or nearly disappear, for z 0 in the interior of the specimen, resulting in 
only a dark contrast in bright and dark field. Finally, they reappear in “antiphase” for z 0 
close to the lower specimen surface, see fig. 10. 

4.7.6. Structure-factor contrast 

Coherent precipitates may reveal a mismatch between the lattice parameters in the 
precipitate and the surrounding matrix. This mismatch gives rise to a space-dependent 
displacement field and, accordingly, to strain contrast as described by eqs. (22), (29) 
or (31). If, in addition, the structure-amplitude F s inside the precipitate differs from that 
of the matrix, e.g., by a change of the chemical composition, an additional contrast, 
named structure-factor contrast may be caused. The reason is that now the extinction 
length £ g (proportional to 1 /F s ) becomes space-dependent, £ g = £ g (r). This can be incor- 
porated into eqs. (22) and (29) by setting l/£ g = 1/ £ g (x, y; z) and taking l/£ g in eq. (22) under 
the integral (cf. Ashby and Brown [1963] and Wilkens [1981]). For eq. (31) an approp- 
riate extension for including structure-factor contrast was given by Wilkens etal. [1977]. 

4.8. Practical applications of the differential equations 

Standard programs are available for the numerical integration of the differential 
equations (29) or (31) by electronic computers. The user has only to take care for 
subroutines for computing the displacement field R(x,y; z) or its derivative dR(x,y; z)/dz. 
If the diffraction contrast of a particular defect is to be calculated as a function of the 
specimen thickness t and the depth position z 0 of the defect centre, considerable computer 
time can be saved by making use of the method of “generalized cross-section” (Head et 
al. [1973]), referring to eq. (29), or of the scattering matrix method (Haussermann etal. 
[1973], and Katerbau [1980]), referring to eq. (31). In many cases computer time can 
additionally be saved by application of symmetry relations inherent in the particular 
diffraction contrast problem or in the displacement field of the defect (Howie and 
Whelan [1961], Ball [1964], Pogany and Turner [1968] and Katerbau [1980]). 

5. Dislocations 

5.1. Introduction 

Dislocations and dislocation distributions, e.g., in deformed crystal, are generally best 
investigated in the two-beam bright-field mode operated at a sufficiently positive 
excitation error, which ensures best transmittivity, cf. §4.5. This means a maximum 
specimen volume accessible by TEM and avoidance of pronounced black-white depth 
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oscillations for dislocations close to the specimen surfaces (§ 4.7.5), which may perturb 
the image. On the other hand, if details of the dislocation structure, e.g., splitting into 
partials etc., are of interest, the weak-beam technique Iwl » 1 is superior (Cockayne et 
al. [1969]), at the cost of transmits vity. 


5.2. The displacement field 

We assume elastic isotropy and introduce the displacement field R of a straight 
dislocation in its own (right-handed) coordinate system x, y, z. The dislocation line runs 
along the /-axis with its line unit vector u pointing from / <0 to / >0. The Burgers 
vector b lying in the x'-y plane can be split according to b=b s + b e , where the screw 
component b s is parallel, and the edge component b e is perpendicular to u. Then R is 
given by (e.g., Hirsch etal. [1977]) *: 

1? = l? a + l? s , (33a) 


R , = 


27T 


b artan — + A, 
x' 


_J xV 

2(1 - v) r 2 


(33b) 


R i= --(bxu)\ 


1-2 
2(1 - v) 


In r/r 0 + 


x 1 - z' 2 


4(1 _ v) 


(33c) 


with v= Poisson’s ratio, r-x 2 + z 2 , r 0 = inner cut-off radius. Note that Jf a is antisym- 
metric and R s is symmetric in x\z'. Further, R changes its sign if the sign of either b or 
u is changed. This is obvious for R s but must be noticed also for R a . 


5.3. Contrast profiles of single perfect dislocations 

We assume that b is a translation vector of the crystal structure (b is a “perfect” 
Burgers vector) so that g*b is integer. Regarding the contrast profiles of such dislocations 
we have to distinguish between the cases g»b £ 0 and g*b = 0. 


5.3.1. The g«b * 0 contrast 

In case of g*b # 0 the contrast is mainly governed by the component R a of eq. (33b). 
This component causes an S-shaped bending of the reflecting lattice planes as schematic- 
ally indicated in fig. 12. We assume and recall the fact that diffraction contrast is 
especially strong (weak) where the local excitation error s/z) is decreased (increased) as 
compared to the background value r, cf. §4.7.2, in other words where the reflecting 
lattice planes are bent by the dislocation displacement field towards (away from) the 
exact Bragg orientation. Consequently, for s * 0 the centre of gravity of a dislocation- 


* In the TEM literature it is customary world-wide to denote the displacement field vector by R and the 
dislocation line unit vector by u. Here we adopt the same notation, although in ch. 12 the symbol u is used for 
the displacement vector and R is used for denoting particle radii in various connections. 
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Fig. 12. Diffraction contrast of an edge dislocation for (g‘b)s* 0, i>0, schematically. Solid arrows represent 
the direction of incidence of the primary beam. Dashed lines represent local directions of incidence which 
would fulfill the Bragg condition. The angle between the two types of arrows is a measure of s,(z). Bright field 
intensity is denoted by I BF . 


contrast profile does not coincide with, but is shifted with respect to the image position 
of the dislocation line. Assuming a given direction of the line unit vector u the direction 
of this lateral contrast shift depends on the sign of (g»b)s , as will be demonstrated below 
in connection with the images of dislocation dipoles. 

The kinematical contrast integral <f>", cf. eqs. (26)-(28), was evaluated for different 
types of dislocations by Hirsch etal. [1960] and Gevers [1962], It turned out that the 
width of the kinematical dislocation contrast is about (l/3-l/2)f K , where £ K = s -1 denotes 
the kinematical extinction length. If dynamical diffraction has to be taken into account 
(M<1) this result remains essentially valid, with £ K substituted by £ geff =o- -1 [eq. (17)] 
(Howie and Whelan [1962]). It should be added that the contrast of a dislocation line 
is always single-lined for lg*Z>l = 1 but may be (asymmetrically) double-lined for \g»b\-2 
and 0<M<1. 

Under normal strong-beam (bright-field) conditions, f geff can be as large as several 
10 nm, whereas under weak-beam conditions ij K = s~ l can be kept as small as 5 nm 
(Cockayne etal. [1969] and Cockayne [1978]). Consequently, dislocation images are 
much sharper in the latter case, cf. fig. 13 where the same specimen area is imaged 
under strong-beam and weak-beam imaging conditions, respectively. Of course, the gain 
in resolution is paid for by a reduction of the useful specimen thickness. 

5.3.2. The g*b = 0 contrast 

In case of g«b = 0 the dislocation contrast is completely extinguished (g»R = 0) for 
screw dislocations and also for edge dislocations if in the latter g is parallel to the 
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Fig. 13. Dislocations in silicon: (a) strong-beam bright field image, n' > 0; (b) the same area imaged under dark- 
field weak beam conditions, tv» 1. 


dislocation line. For g»b = 0 but \g*b x u\ * 0 a “residual” contrast due to the displacement 
component R s in eq. (33c) is produced, which is symmetric with respect to the image 
position of the dislocation. This residual contrast is controlled in strength by the modulus 
\g»b x u\ and is in general significantly weaker than the g»b * 0 contrast and can thus be 
distinguished from the latter. Depending on the strength of \g»bxu\ the contrast may 
consist of a single, a double or a triple contrast line, cf. e.g., Howie and Whelan [1962] 
and Hirsch etal. [1977], 

Typical dislocation images for g»b * 0 and g»b - 0 are shown in fig. 10. Note that the 
depth-oscillations where the dislocations approach the specimen surfaces (§ 4.7.5) show 
different symmetries for g»b * 0 and g*b = 0. These differences are occasionally helpful 
for discriminating between the two cases. 

For dislocation of mixed type, complete contrast extinction is not possible at all, since in 
such a case the terms g*b, g*b x u and g»b t cannot be zero simultaneously. If the first two 
terms are zero but£»6 c ;£ 0 [cf. eq. (33b)] an asymmetrical residual contrast is produced. This 
is demonstrated in fig. 14 where a curved dislocation segment, which changes in type from 
screw to edge, is imaged under g*b-2 conditions (fig. 14a, strong symmetrical double- 
line contrast) and g»b = 0 conditions (fig. 14b). In the latter case the dislocation is out of 
contrast in the pure screw and the pure edge orientation but reveals an easily detectable 
asymmetric contrast around the “45°-orientation”, where g»b and (g*b x u) = 0, but 
g»b e * 0. Here b c refers to the component of b perpendicular to the local line direction u. 

5.4. Contrast of dislocation pairs 

Two parallel dislocations having opposite Burgers vectors b constitute a dislocation 
dipole. Another description is that b is the same for both dislocations but their line 
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Fig. 14. Strong-beam images of a curved dislocation segment of Burgers vector fc = j[110]: (a) g = (220), 
(g‘b) = 2, dislocation is in strong contrast; (b)g = (220), (g‘b) = 0, the dislocation contrast is extinguished only 
at the edge- (E) and the screw (S) orientations. (From Korner and Karnthaler [1980].) 


vectors u are opposite. The most common representative is a dipole constituted by two 
pure edge dislocations. If such a dipole is imaged with s * 0 and g*b * 0, then (g»b)s > 0 
for one and (g*b)s < 0 for the other dislocation. Consequently, the contrasts of the two 
dislocations are displaced into opposite directions: The distance between the two contrast 
lines is either smaller or larger than the true projected distance between the dislocations. 
If g is changed into -g , while leaving the sign of s unchanged, the two cases are 
interchanged. This phenomenon is called inside-outside contrast. An example is shown 
in fig. 15a, b. 

In other cases, the two parallel dislocations may show up the same sign of g*b. This 
may occur in an ordered alloy where two dislocations with the same perfect Burgers vector 
b of the disordered lattice constitute a split superdislocation (2b- perfect Burgers vector of 
the ordered alloy) with an antiphase boundary between them. Another example refers to a 
perfect dislocation which is split into partial dislocations (Burgers vectors b p ) and where 
the signs of g*b p are the same for both partials. If in such a case the distance between the 
two dislocations becomes of the order of, or smaller than, £ g eff (which may be = 1/s in 
the weak-beam case) the contrast of one of the two dislocations increases in strength and 
the contrast of the other decreases, depending on the signs of the (g*b p ) (Wilkens and 
Hornbogen [1964], Stobbs and Sworne [1971] and Cockayne [1978]). This is 
demonstrated in fig. 15c, d, showing the weak-beam images of an edge-dislocation dipole 
in Ag where the individual dislocations D, and D 2 are split into partials. For both dislocat- 
ions the two partials give rise to the same sign oig*b r which, however, is opposite for D, 
and D 2 . Accordingly, the sequence “strong-weak” is opposite for D, and D 2 . Further, the 
sequence is changed when changing the sign of g (indicated by the arrows). 

Below a critical distance the contrast of the weakly imaged dislocation disappears in 
the contrast tail of the strongly imaged one. This effect limits the resolution of dislocat- 
ion pairs having the same sign of (g*b) even on weak-beam images to about 2 nm. This 
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Fig. IS. Diffraction contrast of dislocation pairs taken with # = ±(202): (a,b) Strong-beam bright-held images 
of dislocation dipoles in copper, taken with w>0. Dipole 1 is clearly resolved in (a) but unresolved in (b). The 
distance between the dipole dislocations 2 and 2' is narrow in (a) and wide in (b). (cs d) weak-beam images 
of dislocations in silver which are split into partials. The dislocation at the top (bottom) is named D, (D 2 ) in 
the text. (From Korner and Karnthaler [1981].) 


lower limit is important for the evaluation of stacking-fault energies from the measure- 
ment of equilibrium distances of split dislocations, cf., e.g., Stobbs and Sworne [1971], 

5.5. Determination of the dislocation Burgers vectors and the dislocation densities 

The determination of the Burgers vector b of individual dislocations requires a 
number of images of the corresponding specimen area taken under different twobeam 
conditions. If a given dislocation shows contrast extinction or only residual (g*b- 0) 
contrast for two (non-parallel) diffraction vectors, say g, and g 2 , then its Burgers vector 
b is perpendicular to g, and g 2 (regarding the sign of b see below). The Burgers vector 
analysis is considerably facilitated if the general crystallographic direction of b can be 
anticipated, for instance A = j<110> in a fee lattice. In this case three images taken with 
three diffraction vectors g = { 1 1 1 } are sufficient for indexing all dislocations with 
ft=j<110>, since the six significantly different vectors b of this kind give rise to a 
different contrast-extinction behaviour for the three g = { 1 1 1 } (Hirsch et al. [1977]). The 
series of micrographs shown in fig. 16 stems from a deformed and subsequently annealed 
Ni crystal and was taken from a specimen parallel to the primary slip plane (111) with 
g a = ( 1 1 1), g b = (1 1 1 ), g c = (lll), where the subscripts a, b, c refer to figs 16a, b,c. 
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Fig. 16. Dislocation network in nickel. Two-beam bright-field images of the same specimen area, specimen 
normal (111). Three types of segment directions (see text) of the network are indicated by open circles (1), 
crosses (2), and dots (3). (a)g = (lll), (b) g = (1 1 1), (c)g = (lll). The small arrows point to the same position 
on the three images. 


respectively. In the central part of figs. 16b, c a network is visible constituted by three 
types of dislocation segments, as indicated in the figures. The segments (1) give rise to 
a g»b z 0 contrast on fig. 16c and to a residual (g*b - 0) contrast on fig. 16b and (scarcely 
to see) on fig. 16a. Accordingly, we have 6,=±j[101]. The segments (2) show g»b# 0 
contrast on figs. 16b and c, but not on fig. 16a. This leads to = — i [110]- The segments 
(3) are in g»b#0 contrast only on fig. 16b, thus A 3 = ±j[ 011]. Taking always the upper 
sign, we have b 2 + b i =b l as it is required for dislocation networks. Other dislocation 
segments in figs. 16a-c can be indexed in the same way. 

For a full indexing of a dislocation a discrimination between +b and -b (for a given 
choice of the line direction u) is required. This can be achieved by making use of the 
fact that the lateral shift of (g»b)s* 0 contrast lines is indicative for the sign of the 
S-shaped bending of the reflecting lattice planes from which the signs of u and b can be 
determined by physical arguments. In this context, certain contrast peculiarities due to 
surface-stress relaxation effects may also be helpful, cf„ e.g., Siems et al. [1962], 
Tunstall etal. [1964], Wilkens et al. [1967] and Hazzledine et al. [1975]. 

The dislocation density p is defined as the (average) dislocation length per unit 
volume or, in a somewhat misleading way, as the number of dislocations per unit area. 
A full and reliable evaluation of p requires rather extensive experimental work. Here we 
stress the attention to one aspect only: On a given set of micrographs taken with a given 
two-beam diffraction vector g dislocations of some slip systems may be fully in contrast 
(g»b# 0). Dislocations of other slip systems may be visible in residual contrast (g»b = 0) 
only or may be completely invisible. This fact suggests immediately that any reliable de- 
termination of p requires a careful evaluation of different sets of micrographs of the same 
specimen area taken with different g-vector, cf., e.g., Steeds [1966], Essmann [1966]. 

5.6. Elastic anisotropy 

Experience has shown that regarding the fundamental properties of the dislocation 
contrast of dislocations, elastic isotropy is a useful approximation even for moderately 
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anisotropic crystals like copper. However, for strongly anisotropic materials such as 
/8-brass the contrast rules, described above on the base of elastic isotropy, break down. 
This holds in particular for the discrimination between g*b * 0 and g»b - 0 contrast. In 
such cases a Burgers vector analysis requires the comparison of observed contrast 
figures, in particular of those parts showing depth oscillations, with computer-simulated 
images for modelled dislocation types, cf. Head etal. [1973] and Humble [1978]. 

6. Point-defect agglomerates, radiation damage 

6.1. Introduction 

Irradiation of crystals with energetic particles (e.g., electrons, neutrons, or ions) leads 
to the formation of interstitials and vacancies. Point defects may also be created by 
plastic deformation or by quenching-in of thermal vacancies. If these point defects are 
mobile, they may cluster together, forming point-defect agglomerates such as dislocation 
loops. In fee metals they may also form stacking-fault tetrahedra. This holds especially 
for vacancies, but in principle also for interstitials. Vacancies may also agglomerate into 
cavities (bubbles, voids). One of the problems to be solved by TEM is the determination 
of the crystallographic nature of such agglomerates and the determination of their type, 
vacancy (V) or interstitial (I), i.e., whether they are produced by an agglomeration of 
vacancies or of interstitials. In this section some of such methods are briefly outlined. 
For a review see Wilkens [1978]. 

6.2. Dislocation loops 

6.2.1. Formation of loops 

Dislocation loops are formed by agglomeration of point defects into plates on densely 
packed lattice planes. For instance, a monolayer (diameter D ) of agglomerated vacancies 
on a { 111 } -plane in a fee metal gives rise to an intrinsic stacking fault surrounded by a 
dislocation loop (diameter D) with a partial Burgers vector Z> f =j< 111> of Frank type, 
perpendicular to the loop plane. Agglomeration of interstitials on {111} gives rise to an 
extrinsic stacking fault over the loop area. If the loop size exceeds a critical value, it 
becomes energetically more favourable to eliminate the stacking fault area by sweeping 
a Shockley partial b s =g<112> over the loop area to convert the Frank partial into a 
perfect Burgers vector b, e.g., 

b F +b s =b, (34a) 

i[lll]+i[H2] = i[110]. (34b) 

There is evidence that similar two-step mechanisms for the formation of loops with 
perfect Burgers vectors are existing also for bcc (Eyre and Bullough [1965], Jager 
and Wilkens [1975]) and for hep metals (Foll and Wilkens [1977]). 
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6.2.2. Analysis of large dislocation loops 

If the loop diameter D is sufficiently large compared to £ geff then the loop nature is 
clearly visible on the micrograph and the Burgers vector b of the loop can be determined 
by the contrast-extinction rules outlined in § 5.5. Approximate information about the habit 
plane of the loop can be obtained by following the change of the projected shape and 
width of the loop when tilting the specimen around an axis parallel to the operating 
g - vector (for details see, e.g., Maher and Eyre [1971]). 

Segments of a dislocation loop which are opposite to each other have the same 
direction of b, but opposite line direction u. Their contrast behaviour is therefore similar 
to that of a dislocation dipole, cf. §5.4. Accordingly, if the loop is imaged with 
(g»b)s* 0, the loop is imaged either as inside contrast (loop contrast inside the true 
projected loop position) or as outside contrast, depending on the sign of (g*b)s. The 
question, inside- or outside contrast, can best be clarified by comparing images taken 
with g and -g and leaving the sign of s unchanged, cf. fig. 17 where dislocation loops of 
interstitial type in electron-irradiated molybdenum are shown; the apparent sizes of the 
loop contrasts change remarkably when going from g to -g. Then, the loop type, V or I, 
can be determined if the inclination of the loop plane with respect to the primary beam 
(direction of k 0 ) and the operating g - vector is known (Hirsch etal. [1977]). 

In application to practical cases the inside outside method suffers from a number of pit- 
falls which may lead to the wrong answer regarding the loop type. Recipes have been worked 
out in order to avoid these pitfalls (Maher and Eyre [1971], and FOll and Wilkens [1975]). 

6.2.3. Small dislocation loops 

If the loop diameter D becomes smaller than £ geff , dislocation loops are no longer 
imaged “as loops”. On kinematical strong-beam images the loop contrast degenerates to 



Fig. 17. Dislocation loops of interstitial type in electron-irradiated molybdenum. Two-beam bright-held images 
taken with h’> 0. Note the apparent change in the loop size when changing the sign of g. For details see text. 
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a black-dot contrast containing very little information on details of the loop (ROhle 
[1967]). Even under weak-beam conditions the inside-outside contrast method breaks 
down for D<1 nm. (Haussermann etal. [1973]). For dislocation loops below this limit 
the black-white contrast method becomes applicable when the specimen is imaged under 
two-beam dynamical conditions with vv = 0, in bright or dark field (ROhle [1967]). If 
under such an imaging condition a small dislocation loop is located within the surface- 
near regions of depth-oscillation (§4.7.5) it gives rise to a characteristic black-white 
contrast figure. An example is shown in fig. 18 referring to a Cu specimen containing 
mainly small Frank dislocation loops of V-type (produced by irradiation with 30 keV Cu- 
ions) in a very surface-near layer close to the bottom surface of the specimen: Such 
black-white contrast figures may be characterized by a black-white vector /, pointing 
from the centre of the black to the centre of the white lobe. Because of the very nature 
of the depth oscillation contrast, the black-white contrast of a loop is inverted, i.e., bright 
is changed into dark and vice versa, if either the loop type, V or I, is changed or if the 
loop depth-position Zq is shifted towards Zo ± l£ g .eff- Accordingly, in order to discriminate 
between loops of V or I type we need information about the sign of (g»l) and of the 
depth position Zq of the loop. The latter information can be obtained by careful stereo 
measurements (e.g., ROhle [1967] and ROhle and Wilkens [1967]). 

Additional information about the direction of the Burgers vector and the loop plane 
normal n can be obtained from the fine structure in the centre of a black-white contrast 
figure (Katerbau [1976] and Eyre et al. [1977]) and from the outer shape of the 
black-white contrast figures by comparison of experimentally obtained and calculated 
contrast figures. For elastically isotropic, or nearly isotropic, materials the shape of the 
contrast figure and in particular the angle between / and g depend in a characteristic 
manner on the direction of b and n with respect to k 0 and g, respectively. These 
dependencies have been worked out by numerical integration of the differential equations 



Fig. 18. Black-white contrast of small dislocation loops (Frank loops) in a copper specimen irradiated with 30 
keV copper ions. Dynamical dark field image taken with w = 0, specimen normal (110). g = (002). The inserted 
triangles indicate small stacking-fault tetrahedra. 
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(29) or (31) (e.g., Ruhle [1967], Katerbau [1976], Eyre etal. [1977] and English et 
al. [1980]) or by an analytical first-order perturbation integration of eq. (31) (e.g., 
Wilkens and Ruhle [1972]). 

The latter was successfully applied to the analysis of small dislocation loops in ion- 
irradiated tungsten (Haussermann et al. [1972] and Jager and Wilkens [1975]) and 
cobalt (FOll and Wilkens [1977]). More recent analytical calculations by Wilkens and 
Kirchner [1981] have shown that the shapes of the black-white contrasts of small loops 
are sensitive to elastic anisotropy. 

6.3. Stacking-fault tetrahedra 

Frank dislocation loops in fee metals can dissociate into stacking-fault tetrahedra 
(SFT) constituted by stacking faults on the four {111} -type tetrahedra faces intercon- 
nected by stair rod dislocations with 6=£<110> at the edges of the tetrahedra. This 
transformation is favoured in particular in metals of low stacking-fault energy such as 
gold, silver or copper. 

One consequence of the small strength of the stair-rod dislocations bounding the SFT 
is that their strain contrast is rather weak. Accordingly, the contrast of a SFT is 
predominantly determined by the stacking-fault areas (which may give rise to a fringe 
pattern if the SFT is larger than the extinction length (Chik [1966]), i.e., the contrast 
shape of a SFT is in general close to the shape of the SFT projected onto the image 
plane. Some SFT, as produced by ion-damage in Cu, are indicated in fig. 18 by a symbol 
reflecting their outer shapes, as expected for a specimen orientation close to (110). 

In spite of the weakness of the strain field, SFT may reveal black-white contrast. For 
large SFT with edge lengths of the order of, or larger than, £ geff the black-white contrast 
degenerates to the so-called Ashby-Brown contrast (Ashby and Brown [1963]) where 
the depth oscillations are suppressed due to surface-stress relaxation (McIntyre and 
Brown [1966] and Chik etal. [1967]). Under suitable conditions even small SFT may 
reveal black-white contrast similar to that of small dislocation loops (Saldin et al. 
[1978]). However, in contrast to small loops, black-white vector l points always parallel 
or anti-parallel to g (because the strain field does not show up a pronounced preferential 
direction). The latter property is sometimes helpful in order to discriminate between 
small loops and small SFT. 

6.4. Cavities 

Cavities (voids, bubbles) are best imaged under two-beam conditions with s = 0. Then 
cavities act like a local reduction of the specimen thickness t. Thus, in a specimen with 
foil thickness t in that range where thickness fringes are observable the contrast of a 
cavity is dark on the front flank of a bright thickness contour and bright on the rear side 
(Van Landuyt etal. [1965]). If t is beyond the region of thickness fringes, the contrast 
is bright but decreases rapidly with decreasing diameter d of the cavity. Finally, if d is 
below a critical value of some nm, cavities are no longer visible on in-focus images. 
Then the contrast is considerably improved by an appropriate defocusing of the objective 
lens (Ruhle and Wilkens [1975]). 
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6.5. Displacement cascades and disordered zones 

Energetic ions (in the case of fast neutron irradiation the primary knocked-on atoms) 
deposit their kinetic energy in a crystal in so-called cascades, within which a large 
number of atomic replacements takes place. In an ordered alloy these replacement events 
lead to a zone of reduced long-range order which reflects size and structure of the 
cascade. Such disordered zones can be made visible on dark-field images taken with a 
superlattice diffraction vector g (Jenkins and Wilkens [1976]). An example of disorder- 
ed zones in Cu 3 Au irradiated with Cu + -ions is shown in fig. 19. 

7. Precipitates 

The precipitation of a second phase can be pursued by TEM, which was done 
successfully since the early days of TEM of materials (Hirsch etal. [1977]). Depending 
on the structure of the interface, we talk about coherent, partially coherent or incoherent 
precipitates. A partially coherent particle may possess one coherent interface and one 
where coherency is lost (i.e., is semi-coherent or incoherent). This can be caused either 
by different structures in the precipitate and the matrix or by a large displacement along 
the interface (e.g., Olson and Cohen [1979]). An incoherent particle has a crystal 
structure different from the matrix. Often there exist no orientation relationships between 
the two phases. A precipitate can give rise to TEM contrast for two reasons: (i) due to 
the alteration of the electron waves passing through the particle (precipitate contrast ) and 
(ii) due to alteration of the electron waves passing through columns near the particle 
where the crystal has been distorted due to the presence of the precipitate (matrix 
contrast or strain contrast). 



Fig. 19. Dark-field image taken with a superlattice reflection g =(110) of a fully ordered Cu 3 Au specimen 
irradiated with 30 keV copper ions. The dark dots represent zones Of reduced long-range order as produced by 
the displacement cascades of the incident ions. The dark bands are due to antiphase boundaries. 
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The interpretation of the matrix contrast is straightforward and can be done in a 
similar way as for radiation-induced defects or dislocations. The strain contrast has to be 
observed for different foil orientations (and different diffraction vectors) under well- 
defined dynamical two-beam or well-defined kinematical conditions (see § 4). If a strain 
contrast can be observed then contrast calculations have to be performed for different 
strain distributions surrounding the precipitate in a thin foil. Matching the results of the 
calculations with the observations gives a qualitative and sometimes even quantitative 
model of the precipitate (Ashby and Brown [1963]). 

McIntyre and Brown [1966] and Chik et al. [1967] showed that black-white 
contrasts (BW contrasts) can be observed if certain conditions for the size and magnitude 
of the strain are fulfilled. The strain is thereby expressed in terms of a dimension- 
less quantity, 

Q = ^g~, (35) 

s* 

where r 0 is the radius of the particle, g the modulus of the diffraction vector, the 
extinction length, and s a parameter describing the constrained strain of the particle 
(Ashby and Brown [1963]). BW contrasts are expected for certain combinations of r 0 
and Q (Chik et al. [1967]). Small values of r 0 result in BW contrasts with depth 
oscillation similar to those of radiation-induced defects (§ 6), whereas for large values of 
r 0 and Q the depth oscillations of the BW contrasts are suppressed by the stress 
relaxations at the foil surfaces (Ashby-Brown contrast). In this case the sign of the 
displacement field can be determined by DF images unambiguously (Ashby and Brown 
[1963]). Very large defects produce no BW contrast at all (Chik et al. [1967]). The 
calculation of TEM contrast was performed for inclusions of different shapes and 
included also elastic anisotropy (see e.g., Lepski [1974]). The main problem in doing 
such computer simulations is the determination of the displacement field of the precip- 
itate placed in a thin foil. 

Spherical precipitates form in Cu-Co alloys (fig. 20) and the sizes depend on the 
annealing treatment (McIntyre and Brown [1967]). The typical coffee-bean contrast 
can be observed. A “line-of-no-contrast” is perpendicular to the diffraction vector g. The 
width of the lobes measured perpendicular to g is a measure of the size of the pre- 
cipitate. 

The small plate-like precipitates in Al-Cu also produce a strain contrast. The visi- 
bility depends on the operating diffraction vector since the displacement field is strongly 
anisotropic. Coherent Ag precipitates in Al-Ag alloys do not form a strain contrast, since 
there exists (nearly) no constrained strain around the particle (see fig. 21a). 

Semicoherent precipitates may produce a strain contrast which can be evaluated 
similarly as for coherent precipitates. This was demonstrated for small Mo 2 C precipitates 
in Mo by Lepski and Burck [1981, 1982]. Hirsch et al. [1977] showed that coherent, 
semicoherent, or incoherent precipitates can be studied by utilizing the strain contrast 
(DF and BF techniques). If the symmetry of the precipitate is different from the 
symmetry of the matrix, then additional reflections in the diffraction pattern can be 
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Fig. 20. Contrast from spherically symmetrical strain fields due to precipitations in a Cu-Co alloy. The coffee- 
bean type contrast is visible. 

observed (fig. 21e). The shape of the precipitate is revealed by DF images taken with 
such a reflection belonging to the precipitate (fig. 21c, d). 

The precipitates themselves can be made visible (i) if difference in the structure 
factor in the two phases (“structure-factor contrast”), of which an example is shown in 
fig. 21, (ii) if the orientation of the foil is such that certain planes in the precipitate are 
near the exact Bragg condition (strong reflection) while no planes in the matrix are in 
such a position (“orientation contrast”), (iii) if the matrix displacements induced by the 
precipitate cause an abrupt phase-change at the precipitate (“stacking-fault contrast”), or 
(iv) if a moire pattern is formed due to different lattice parameters in the matrix and 
precipitate. The contrast caused for the different cases is already extensively described 
by Hirsch etal. [1977]. 

Particles of a second phase change the diffraction pattern of the pure matrix because: 
(i) there are extra reflections due to the diffraction pattern from the precipitate itself, (ii) 
the second-phase particles exert an influence on the matrix diffraction spots, (iii) there 
is the possibility of double diffraction (if the particle is embedded in the matrix), and (iv) 
since the precipitates are usually small, the shape of the diffraction spots is reciprocal to 
the shape of the (small) particles. 

8. Structure of grain boundaries and interfaces 

Internal surfaces determine many properties of materials. For an understanding of the 
properties it is essential to know the structure of the defects, which are actually the 
regions in space at which two crystals meet. Interfaces between dissimilar materials are 
called heterophase boundaries (or just interfaces). Interfaces between crystals differing 
only in relative orientation and/or translation are called homophase boundaries. Homo- 
phase boundaries include grain boundaries, stacking faults, twins, and antiphase 
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Fig. 21. Contrast of precipitates, (a) Structure-factor contrast of small coherent precipitates in Al-6% Ag alloys; 
no strain contrast is visible, (b) Structure-factor contrast of incoherent (large) precipitates in Al-6% Ag alloys 
annealed for 1000 min at 400°C ( 6 " precipitates); foil orientation — ( 1 10); interface dislocations are visible 
between the A1 matrix and the Ag precipitates, (c) Contrast of incoherent Al-4% Cu alloys; structure-factor 
contrast; foil orientation (106), BF image, (d) DF image with a reflexion of the Cu precipitate. Only Cu 
precipitates of one variant are in contrast, (e) Diffraction pattern of figs, (c, d). 
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boundaries in ordered alloys. In the first part of this section the possibilities for revealing 
the structure of homophase boundaries will be described. The second part deals with 
some observations on heterophase boundaries. 

Kalonji and Cahn [1982] manifested that ten parameters are required for the geometri- 
cal description of an interface, see fig. 22. In this concept one assumes two interpenetrating 
crystals which are then misoriented (four parameters) and shifted against each other (three 
parameters). Three additional parameters are required to specify the location of a plane 
interface in the interpenetrating crystals. Finally, atoms of one crystal are removed from one 
side of the surface and those of the second crystal from the other. In this way, the symmetry 
of the interface can be described very elegantly by means of group theory (Gratias et al. 
[1979] and Kalonji and Cahn [1982]). Stacking faults and antiphase boundaries are pure 
translation interfaces, whereas grain boundaries represent general homophase boundaries. The 
geometric parameters are potentially capable of undergoing relaxation. 





Rejection of atoms of 
inappropriate colour 

Fig. 22. The idealized construction of a crystalline interface: (a) superposition of the two crystals; (b) 
imposition of rigid body rotation (four parameters); (c) imposition of rigid body translation (three parameters); 
(d) insertion of surface S of the interface (three parameters); (e) rejection of inappropriate atoms at the 
interface. 
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8.1. Transmission electron microscopy of pure translation interfaces 


At a pure translation interface (stacking faults, antiphase boundaries) two perfect 
crystal parts are shifted against each other. Usually, the plane of the interfaces coincides 
with a close-packed plane. For example, the translation vector is r =g<l 12> for a 
stacking fault (STF) in fee materials, whereas r is a lattice vector of the disordered 
crystal structure for antiphase boundaries in an ordered structure. TEM contrast is 
expected in the projected area of the (usually) inclined translation interface. The depth 
position of the interface may be Then the crystal part lying below the interface is 
shifted against the upper reference crystal by the translation vector r which defines also 
the sign of r. 

The diffraction contrast can easily be calculated by utilizing the so-called “matrix- 
method” introduced by Amelinckx and Van Landuyt [1978]. With this method the 
contrast can be calculated for a general n-beam situation. We restrict ourselves to the 
two-beam case which takes into account only one scattered beam 0 g besides the 
transmitted beam 0 O . The amplitudes of the scattered beam 0 g and of the transmitted 
beam 0 O are represented by a column vector 


0 = 


0o 


(36) 


Amelinckx and Van Landuyt [1978] described the changes of the amplitudes after 
passing through a perfect crystal of thickness t [normalized excitation error w, cf. eq. (7)] 
by: 


f 0o ('> w )^ 

<f>M, w) 


Af(/, w) 


/ out 


( 0o 


(37) 


where the subscripts “in” and “out” refer to the incoming and outgoing waves for a 
perfect crystal, respectively. The initial values are of course 0 O = 1, 0 g = 0. M(t,w) 
represents a “response matrix” given by: 


M(t, w) 


( .0 . 0 - \ 

0o 0 g 

jO i0- 

I0 g 00 ) 


(38) 


with [cf. eq. (14)]: 


,o/ \ . wsmTT<Tt ,o/ \ isin7ro7 

0o w ) = cos 7TO-/ - l -iry, 0 g (t,vv) = 


( 1 + W T 


(i-r 


(39) 


anomalous absorption is included, which means a is complex, whereas normal absorption 
in neglected. The normalized excitation error w has to be replaced by -w in eq. (39) to 
form 0“ and 0® of eq. (38). 

The influence of a planar defect included in a transmitted specimen on the amplitude 
distribution of the different waves can be described by a response matrix of the defect 
M d . This matrix is called Af“ for a pure translation interface, with 
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M a 


0 > 

,0 e / 


(40) 


and a =2-7TgT (g = diffraction vector, r = translation of the perfect crystal below the 
defect with respect to the perfect crystal in which the electrons enter). 

The amplitudes of the waves transmitted through a specimen (total foil thickness t ) 
containing such a translation interface (depth position r,(jc), see § 4) are determined by 
three contributions: (i) the contribution of the perfect crystal [foil thickness r,(jc)] lying 
above the planar defect, described by a response matrix Af(r 1 ,w), [eq. (38)]; (ii) the 
contribution of the planar defect itself [response matrix AT, eq. (40)]; and (iii) the 
contribution of the perfect crystal (thickness t 2 = t-t^) lying below the defect [response 
matrix M(r 2 , w)]. 

The amplitudes of the waves at the lower foil surface are just expressed by 


'4>o( t T\ 


f 

= M(t 2 ,w)-M a 

V 




(41) 


The contrast caused by an inclined defect can be calculated by an evaluation of eq. (41) 
for different depth positions 

The following results can be obtained for an inclined STF (for details see Amelinckx 
and Van Landuyt [1978]): 

(i) Bright and dark fringes are expected in the projected area of the STF. 

(ii) With increasing thickness new fringes are created in the centre of the foil. 

(iii) The fringes are parallel to the closest surface. 

(iv) The BF fringe pattern is symmetrical with respect to the foil centre, whereas the 
DF image is similar to the BF image close to the top surface but complementary close 
to the bottom surface. 

Figure 23 shows an example of the contrast of a stacking fault in a Cu-10 at% A1 
alloy. The image of a stacking fault (STF) is characterized by the value of a = lirgT. In 
fee materials stacking faults are predominantly produced on [111] planes by a shear of 
t=5<112> or by removal or insertion of a plane of atoms. This may happen when 
vacancies or interstitial atoms condense on close-packed planes. In fee materials the STF 
are classified as intrinsic or extrinsic. 

The determination of the type of STF is identical with the determination of the sign 
of a. We know from the symmetry rules that the sign of the contrast fringe in BF and 
DF at the top of the transmitted foil is positive (bright fringe) for a > 0 (thick foil). From 
the BF and Df images of a STF (see fig. 23) we can determine the top surface and the 
bottom surface. Since we know the sign of a from the edge fringe of the BF image, we 
have the information necessary to determine the sense of r. This method was first given 
by Hashimoto etal. [1962]. Gevers etal. [1963] pointed out that the type of the fault 
can be determined from a DF image alone; the sense of inclination need not be known. 
Gevers et al. showed that r can be determined by the following simple method. The 
diffraction vector g is drawn as an arrow with its origin at the centre of contrast fringes 
on the DF image. The nature of the stacking fault is intrinsic (extrinsic) if the limiting 
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Fig. 23. Images of a stacking fault in Cu-10 at% Al, intrinsic stacking fault: (a) BF, g = (l 1 1); (b) DF, 
g = (Ill); (c) BF, g = (lTI); (d) DF, g = (lll). The rule of Gevers el al. [1963] is fulfilled. (Courtesy A. 
Korner and H. P. Karnthaler.) 


fringe on the side of the arrow of g is dark (bright). This rule is valid in fee materials for 
{111}, {220} and {400} reflexions while it is reversed for {200}, {222} and {440} 
reflexions. The contrast of the STF vanishes for g*T = integer. The direction of t can be 
determined from two images taken with different diffraction vectors on which the STF 
is out of contrast. 

The contrast of domain boundaries, twins, antiphase boundaries can also be calculated 
with the matrix method, see Amelinckx and Van Landuyt [1978]. 
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8.2. Transmission electron microscopy of grain boundaries 

Grain boundaries (GBs) have been the object of intensive TEM studies for many 
years, and in that time many papers and review lectures have appeared (eg., Chadwick 
and Smith [1976], Clarebrough and Forwood [1978] and Balluffi [1980], It is well 
recognized that the structure of GBs is important for the understanding of the problems 
of segregation, recrystallization texture and intergranular embrittlement, and of course the 
bonding across the boundary also governs those properties. In the papers quoted above 
it is shown that much information can be obtained by TEM diffraction contrast studies, 
diffraction studies and recently also by direct lattice imaging. 

By TEM diffraction contrast studies, information on the intrinsic structure of GBs can 
be obtained; mainly dislocations can be analyzed. For special situations the geometrical 
parameters can be determined by TEM work. Diffraction studies allow the determination 
of the “width” and symmetry of GBs, whereas direct imaging by means of high- 
resolution TEM allows an insight into the atomic structure of the boundary. 

In ch. 9 it is explained that GBs may be described in different ways: the dislocation 
model, the plane-matching model, and the coincidence-site model. All models will 
summarize the relaxation phenomena at the GB in different configuration of defects. The 
models assume that there exist certain misorientations which possess a low GB energy, 
and these misorientations are characterized by a low £ value of the coincidence-site 
lattice *. 

If the symmetry of a boundary and the included dislocations are determined, then the 
GB is completely specified. The parameters included in the description can be determin- 
ed by TEM in principle by comparing experimental micrographs taken under well 
defined diffraction conditions with results of computer simulations carried out using the 
dynamical theory of electron diffraction. For the computer simulations the displacement 
of the atoms (lattice planes) due to relaxation as well as many experimental parameters 
such as the thickness, orientation of the interface, and extinction- and absorption lengths 
must be known. In addition, experimental diffraction parameters, such as the number of 
beams excited and their s value, must be established accurately. 

The contrast calculations can be very complicated. Each beam excited in the upper 
crystal is incident on the lower crystal and can give rise to further beams in that crystal. 
Thus, the total number of beams propagating in the lower crystal can be substantial, and 
the coupling of the beams is strongly influenced by the crystallographic relationship of 
the two crystals. There are three experimental conditions which allow a more straight- 
forward interpretation: 

(i) I\vo-beam diffraction in one crystal and negligible diffraction in the other. Either 
the upper or lower crystal can be oriented for two-beam diffraction, and the non- 
diffracting crystal is regarded as a block in which normal but not anomalous absorption 


* The CSL can be obtained by allowing the two misonented crystal lattices adjoining the boundary to 
interpenetrate and translate so that lattice points of each crystal coincide. The space lattice made up of the 
coincident lattice points is called the CSL. The fraction of lattice points (in one crystal) in good coincidence 
is defined as 1/2. Coincidence models of GBs are discussed in ch. 10B, §2.2.1.3. 
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occurs. The contrast behaviour is similar to thickness fringes. This diffraction condition 
can only be obtained approximately, since weak beams are inevitably excited in the 
“non-diffracting” crystal, and these may be coupled to some extent with the beam 
selected for image formation. This was demonstrated by Humble and Forwood [1975] 
who compared microdensitometer traces taken across experimentally obtained GB 
thickness fringes to computer profiles. Humble and Forwood observed that the “non- 
diffracting” crystal does not behave simply as an absorbing wedge, and that no simple 
function could be found which would adequately describe the attenuation in this grain. 

(ii) Simultaneous two-beam diffraction in the two crystals. In this mode the specimen 
is exactly oriented so that only one diffracted beam is strongly excited in the upper 
crystal — and in such a way that this (excited) beam does not excite further strong 
beams in the lower crystal. At the same time, only one beam is strongly excited in the 
lower crystal by the incoming beam. Usually, only BF images are taken (Humble and 
Forwood [1975]). 

(in) In cases where the crystallography permits, two-beam or systematic diffraction 
by sets of planes with identical spacing and orientation in the adjacent crystals may 
occur. Such sets of planes are referred to as common, and this case is a special instance 
of condition (ii). This orientation has been used for the measurement of the relative shift 
of the two adjacent crystals. 

Experimentally, the orientation relation between the two adjacent grains of an 
interface and the normal on the interface can be obtained by three micrographs and 
accompanying diffraction patterns taken under different, well established orientations. 
The lateral shift r can be measured for special configurations (Matthews and Stobbs 
[1977] and Pond [1979]). 

The TEM contrast of GB dislocations can be treated similarly as for lattice dis- 
locations. However, for the GB dislocations further parameters, including the geometry 
of the interface and the diffraction conditions pertaining to both crystals must be 
considered. The direction of the Burgers vector b of the GB dislocation can be determined by 
using the g*b = 0 criterion. However, the accuracy of this semi-quantitative method is small, 
since the magnitude of the Burgers vector of secondary GB dislocations is usually small. 

Boundaries between two crystals are assumed to have a low-energy structure for 
certain misorientations (e.g., Brokman and Balluffi [1981]). This low-energy structure 
can be thought to exist of finely spaced arrays of dislocations, the so-called “primary” 
dislocations. Any deviation from such low-energy orientation relations is accommodated 
by a network of dislocations, the “secondary” dislocations. Experimental observations on 
those dislocations were carried out by Balluffi and coworkers (e.g., Schober and 
Balluffi [1969]) mainly on “artificial” (001) twist boundaries in Au. The TEM studies 
reveal that a grid of undissociated secondary dislocations is present, see fig. 24. The 
distance of dislocations with the same Burgers vectors depends on the deviation A 6 from 
the exact coincidence position. The observations are in agreement with the predictions of 
the theory (Brokman and Balluffi [1981]). Secondary dislocations can split into partial 
secondary dislocations and stacking-fault-like structures. This is observed not only for 
metals but also for semiconductors (Bacman etal. [1981]). 

Lattice dislocations can interact with grain boundaries during yielding, creep and 
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Fig. 24. “Artificial” twist boundary in a gold bicrystal, twist angle near 1 = 5 (36.9°). Square grids of secondary 
screw grain-boundary dislocations can be observed, Burgers vector ft = (a/10)<310>. The dislocations 
accommodate the deviations from the exact 1 = 5 misorientation. (Courtesy T. Schober and R. W. Balluffi.) 


recrystallization (Smith [1976]). Many TEM observations on reactions of lattice 
dislocations with GB dislocations can be explained on the basis of the dislocation model 
for GB structures. For the reactions of the lattice dislocations with grain boundaries one 
has to observe that Burgers vectors are conserved and dislocation lines can end only on 
other dislocations or at a free surface. 

8.3. Diffraction studies on the structure of grain boundaries 

Sass and Bristowe [1980] demonstrated that diffraction techniques also in the TEM 
can answer questions concerning the atomic structure of GBs. Each GB represents a 
periodic arrangement of strains (misfit) and good coincidences. Electrons and X-rays are 
scattered by this periodic grid which possesses a unit cell which corresponds to the CSL. 
If the (periodic) displacements at the GBs are known, then the scattering factor of such 
boundaries (mostly twist boundaries) can be calculated for the different reflections caused 
by the GBs. The reciprocal lattice, e.g., of a twist boundary, exists of thin rods, the 
integral intensity in the rod depends on the scattering factor (see appendix). The length 
of the rod is proportional to the inverse of the “ thickness” of the disturbed region near 
the GB. This grain-boundary “ thickness” approximately equals the distance of the 
dislocations present in the GB. 

8.4. Direct imaging of grain boundaries 

High-resolution electron microscopy (HREM) allows the direct imaging of certain 
GBs. In HREM the structure of the specimen must be periodic in the direction of the 
transmitted electron beam. Therefore, only tilt boundaries can be studied where the tilt 
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axis is parallel to the beam direction. Since the strong periodicity is disturbed in the core 
region of the boundaries, the point-to-point resolution of the instrument must be good 
enough so that information on the relaxation of atoms is transferred through the objective 
lenses despite the still rather high spherical aberration of the best lenses available to date. 

Tilt boundaries in silicon (Bourret and Desseaux [1979]), germanium (Krivanek 
et al. [1977]) and molybdenum (Penisson et al. [1982]) were studied by HREM. For 
these studies the specimen thickness has to be below 15 nm. Figure 25 shows a HREM 
micrograph of a tilt boundary in molybdenum. For the determination of the positions of 
the atoms close to the GB, contrast simulations have to be performed. In the simulation 
work one assumes a certain atomic configuration near the tilt boundary. The simulated 
micrographs have to be compared with the experimentally observed pictures for different 
defocusing values. The atomic arrangements have to be modified until a complete fit 
between the observed and calculated images is reached. 

8.5. TEM contrast of heterophase boundaries 

Heterophase boundaries are formed when two materials of different structure and/or 
different chemical composition meet. If, for example, any type of phase transformation 
occurs then a heterophase boundary is present where the different phases meet. This is 
true for the interface of a martensitic phase transformation where two materials with the 
same chemical composition but different structures meet, as well as for an interface of 
thermally grown precipitate. 



Fig. 25. High-resolution image of a 1 = 1 pure tilt boundary in Mo. Foil orientation [001] (for both grains). 
(Courtesy A. Bourret and J. M. Penisson.) 
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The importance of the structure of these heterophase boundaries for the nucleation 
and growth process of the precipitate is recognized. However, only few systematic TEM 
studies were performed investigating the structure of such interfaces. The studies would 
require experimental imaging of the inclined interface under different diffraction 
conditions under which reflections from both crystal structures are excited simultaneously 
or one after the other. The observations have to be compared with contrast simulations. 
General expressions were derived even in the framework of anisotropic elasticity theory 
for the elastic displacement and stress fields by Bonnet [1981]. The results, however, 
have as yet not been applied to contrast simulations of an interface. 

The geometry of the heterophase interfaces can be described by a generalization of 
the geometry of grain boundaries (Balluffi etal. [1982]). These models are essentially 
fit-misfit models in which the regions of a good “fit” are patched where partial lattice 
matching across the boundary is achieved, and the regions of ‘misfit’ are boundary line 
defects which possess the character either of a dislocation or of a boundary step (facet). 
(See also ch. 9, § 2.3 for a discussion of heterophase [“intetphase”] boundaries). 

The model has been applied to interfaces between two cubic crystals — copper and 
silver (Laffont and Bonnet [1982]) — and for hexagonal materials (Bonnet eta!. 
[1981]). A metal-metal-carbide interface was studied by different authors (Dahmen etal. 
[1981] and Florjancic etal. [1982]). Florjancic etal. studied in detail the interface 
between Mo and Mo2C precipitates which forms after cooling from a supersaturated 
solution. In TEM micrographs (fig. 26) a dislocation-type contrast can be observed. The 
determination of the Burgers vector by means of the simple g*b = 0 rule is not unambig 
uously possible. The contrast behaviour suggests that the Burgers vector is of type 
j< 111>, diffraction patterns yield the orientation relation between Mo and Mo 2 C. Dense 
packed planes of the Mo lattices are about parallel to dense-packed planes of Mo 2 C 
(Burgers relation). Careful diffraction studies were performed for an edge-on con- 
figuration of the interface, fig. 26. It was found that besides the reflections of Mo and 
Mo 2 C, streaks can also be observed lying between them. The formation of such 
additional streaks can simply be explained with the assumption that an interface crystal 
is present between the (pure) Mo and Mo 2 C. From the maximum streak length the 
minimum thickness of the interface crystal can be determined to within nine lattice 
planes of the Mo lattice. 

9. High-resolution TEM 

9.1. Introduction 

By high-resolution TEM we understand all attempts of utilizing modem electron 
microscopes up to the ultimate resolution limit. The latter is determined by the wave 
aberrations of the objective lens — spherical aberration, chromatic aberration, astig- 
matism etc. — under optimal adjustments of size and position of the objective aperture, 
of the direction of the incident electron beam with respect to the optical axis and of the 
actual focusing of the objective lens, cf. also §2. Under such optimum conditions a 
point-to-point resolution of about 0.2 nm and a lattice-fringe resolution of <0.1 nm are 
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Fig. 26. Interface between Mo and Mo 2 C: (a) dislocation contrasts are visible, the Buigers vector cannot be 
analyzed unambiguously; (b) diffraction pattern of the same interface in an edge-on configuration, additional 
“streaks” are observable due to the “interface” crystal. 

achievable by modem microscopes specialized for high-resolution work. At the present 
time, high-resolution TEM plays only a marginal role in physical metallurgy, but this 
may change in the near future when the modem generation of electron microscopes 
becomes widely available in laboratories concerned with physical metallurgy. Therefore, 
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we present here only a brief outline of the basic principles of high-resolution microscopy. 
In the following we restrict ourselves to spherical aberration as the most important lens 
error in the high-resolution work. For more extended introductions to high-resolution 
TEM we refer to the references in the bibliography. 

9.2. The optical transfer function 

We assume that the specimen is orientated perpendicular to the optical axis of the 
microscope and introduce a Cartesian coordinate system in the lower specimen surface 
with the z-axis (unit vector ej pointing downwards, i.e., towards the objective lens. Axial 
illumination is assumed, i.e., the incident beam is parallel to e L . A space vector in the 
plane perpendicular to e L will be denoted by r. The objective lens may be focused onto 
the plane z = 8 where 8 > 0 (8 < 0) means overfocusing (underfocusing) with respect to the 
lower specimen surface (z = 0). We denote the electron wave function in the plane z = 0 
by *. By propagating from z = 0 to the plane z = 8, i/» 0 (r) is modified by Fresnel 
diffraction to tp t (P). Note that for 8 >0, the wave function i is really existing at the 
plane z = 8, whereas for 8 <0, tp t (P) means a virtual wave function, which would merge 
into i/'ofr) at z = 0 when propagating from z = 8 <0 to z = 0 in an empty space, i.e, in the 
absence of the specimen. 

The propagation of the electron wave through the imaging system of the microscope 
is considered in terms of partial waves or Fourier waves of amplitudes ip (p), where the 
Fourier waves i// 0 and lp 6 are assumed to emerge either from the plane z = 0 or from the 
plane z =8, respectively, 

<Pj{p) = j >P j (f) exp(-27ri p • r) d 2 r, j = 0, 8. (42) 

p is a “space frequency” vector perpendicular to e L , starting from the optical axis. Since 
p has the dimension of a reciprocal length, it is a vector in reciprocal space. tp 0 and ip s 
are interlinked by the Fresnel diffraction theorem, 

ipj{p) = iPo{p) ■ ex P- %(/>)’ ( 4 3) 

with 

Xe(p) = TrkSp 1 , (44) 

with A = electron wavelength. 

The partial wave, belonging to the space frequency vector p and making an angle 
a = p/k ( k= 1/A) with the optical axis, suffers when passing through the objective lens 
an additional phase shift x,(P) caused by the spherical aberration of this lens, 

A s (p) = fC s AV, (45) 


* In this section the subscript of ip refers to the z-coordinate. 
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where C s is the spherical aberration constant, typically of the order of some mm. Then, 
in the backfocal plane of the objective lens (subscript “b”), we get the amplitude 
distribution 

<A b (/») = 'Ao (p) ' ex P~ i*(p). (46) 

with the wave aberration function 

x{p) = Xi + X, = A V + 1 C s A 3 / ]• (47) 

The objective aperture in the backfocal plane (assumed to be circular in shape) cuts off 
all partial waves making an angle a >a mm with the optical axis. With a =p/k, see above, 
we introduce an aperture function A(p ) by 

A(p) = 1 forp < Pmax = ka raax , 

(48) 

A {p) ~ 0 for P>/W 

Thus, immediately behind the objective aperture we are left with 

#,(/») = 4>{p) ' T { P\ t {p) = a {p) ■ exp - i*(p). (49) 

T[p) is called the optical transfer function. The subsequent imaging system (intermediate 
lenses, projector lens) can be assumed to be aberration-free. Thus the final image 
amplitude on the viewing screen (or on the photographic plate) is found from >p h (p) 
according to eq. (49) by an inverse Fourier transformation, 

•I'Jf) = J iA 0 (p) • T(p) exp^-2-Tri (50) 

where the index “m” stands for “magnified” image, M= electron-optical magnification. 
[The same sign of the exponents of the first, eq. (42), and of the inverse, eq. (50), 
Fourier transformations accounts for the 180° rotation between the diffraction pattern and 
the image, cf. § 2.] 

9.3. Consequences of the wave aberration 

By the objective aperture all partial waves </i 0 (p) with p>p nax are cut off. These 
partial waves are produced by structural details of ip a (r) of “wavelengths” < 1/ p max . 
Accordingly, these structural details are completely suppressed in Moreover, the 

phases of the partial waves ij/ Q (p) with p < p nwx are shifted by the wave-aberration term 
X(p) so that, after recombination of these partial waves in the image plane, the “struc- 
ture” of tf/Jt) may be significantly different from <p 0 (r) even for structures of wavelength 
> l/Pmax- F° r illustration, we calculate x(p) assuming C s = 3 mm, A =3.7 pm, 8=0 
(exactly focused objective lens), and p=Pi = ( 1 nm) -1 and p 2 = (0. 1 nm)” 1 . From eq. (47) 
we find x(Pi)~ 13-7° and ,y(/j 2 ) = 57 0 . Accordingly, in this case the wave aberration is 
probably negligible for the imaging of structures of dimension of 1 nm and above, but 
it becomes important near 0.7 nm or below. Obviously, the deteriorating influence of 
X(p) on the image formation can be reduced for a certain p-interval if both terms of x(p) 
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act against one another: Since C 5 is always > 0, it is expedient to operate the microscope 
with 5 <0, i.e., with an appropriately underfocused objective lens. For a special case this 
will be quantified in the next section. 

9.4. The weak-phase object approximation 

At present, high-resolution images with an intensity distribution / m (?) =liA m (?)l 2 , which 
are directly interpretable in terms of structural details of the specimen, are only obtain- 
able from extremely thin specimens of thickness /< 10 nm depending on the composition 
of the specimen. In such cases the weak-phase object approximation may be applicable. 
It can be shown that, assuming axial illumination, the wave function i/^ 0 (?) of sufficiently 
thin specimens can be represented by 

= exp[+itu 0 (r)] = 1 + itu 0 (r), (51) 

where tu 0 (f), being real and of modulus » 1, is proportional to the electrostatic potential 
V(r, z) in the specimen projected along the electron beam direction (=c-axis, origin in the 
lower specimen surface!): 

"o(0 “= z)dz- (52) 

Then, the formalism outlined in the preceding section leads to: 

'I'J?) = 1 + + (53) 

with 


w„(r) = J G> 0 (p) ■ A(p) ■ sin *(/>) ■ exp|-27ri ^ jd 2 /?. (54) 

For the same equation is applied with sin* substituted by cos*. From *(p) =*(-/>), 
A(p)=A(-p), and tu 0 (?) real it follows by symmetry arguments that ^ and are real 
as well. Accordingly, in linear approximation of the “perturbation” term o 0 (r) we get the 
image intensity 

/ m (f) = l + 2<V), (55) 

i.e., the only phase-modulated wave function i/i 0 (?) at the specimen [eq. (51)], which 
would give rise to no contrast in a perfect microscope (\tp 0 (f , )\ 2 = 1), is converted by the 
wave aberration term sin*(p) into an amplitude-modulated wave function iA m (r) at the 
image plane, eq. (55). 

In order to keep an utmost similarity between the observable structure of / ra (?) and the 
unknown structure of i/» 0 (F), we have to adjust the focusing parameter and the size of the 
objective aperture in such a way that the partial waves of a maximum range of space 
frequencies p are transferred through the imaging system by the sin*(/j) term with a 
reasonable value close to 1 or -1 and with the same sign of sin*(p). This is achieved by 
the socalled Scherzer focus , 5 0 , which correlates S to C s and A by: 


References: p. 1105. 



1084 


M. Rtihle and M. Wilkens 


Ch. 11, §9 



Fig. 27. The transfer function sin^(p) for C s = 3 mm. Solid curve: 5 =5 0 = Scherzer focus; dashed curve: 
S = 0.85 o , dashed-dotted curve: 5 = 1 ,2S 0 . 


So =-(K s A) 1/2 . (56) 

Figure 27 displays the term sin x(p) for three values of 8 assuming C s = 3 mm and 
A = 3.7 pm. In the case of 8 = 8 0 (solid curve) we find a broad band-pass with sin^(p) 
close to -1 (the shallow dent near p = 1.8 (nm) _l is tolerable) and with the first zero at 
p=p, =2.5 (nm)' 1 . This means that all partial amplitudes of space frequencies with p<p, 
are transferred with the same sign, i.e., they may contribute to a correct reconstruction 
of the image. The partial amplitudes associated to space frequencies p>p, change in sign 
due to the sin^(p) term more and more rapidly with increasing p. So their contribution 
to the final image may be confusing rather than constructive (e.g., they may be washed 
out by a finite beam divergence). Therefore, the size of the objective aperture should be 
chosen so that p max (eq. 48) coincides approximately with p„ thus cutting off the oscillat- 
ing part of sin^(p). In this way a resolution limit of 1/p, (= 0.4 nm in fig. 27) is 
achieved. 

If 8 is reduced by 20% to 0.8S 0 (dashed curve) we find a reduction of the first zero 
to p, = 2.2 (nm) -1 which corresponds to a reduction of the resolution limit to 0.45 nm. If, 
on the other hand, 8 is increased by 20% to 1.2 8 0 (dashed-dotted curve) the shallow dent 
near p= 1.8 (nm) -1 becomes so deep that it crosses the abscissa. As a consequence, the 
broad band-pass of the solid curve is completely disturbed and the resolution becomes 
drastically worse. We learn from these examples that fully utilizing the resolving power 
of an electron microscope requires a precise knowledge of the spherical aberration 
constant C„ and a precise control over the actual defocusing value 5. 

In the meantime several methods have been developed in order to make the partial 
waves of space frequencies beyond p, also useful for the image interpretation. Here the 
key-word “image reconstruction” must be sufficient, cf., e.g., Hawkes [1980]. 
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9.5. Some remarks to the high-resolution images of crystalline specimens 

The weak-phase object approximation, originally developed for the study of non- 
crystalline specimens, is valid also for crystalline specimens, provided they are sufficient- 
ly thin. The space frequency spectrum of a perfect crystalline weak-phase object 
consists of a discrete set of sharp Bragg diffraction spots. If such a specimen is carefully 
orientated with the incident beam along a low-order zone axis and if the lattice constant 
is large compared to that of pure metals or alloys, a large number of diffraction spots 
may pass through the objective aperture. In the frame of the weak-phase object approx- 
imation they form an image which reveals many details on an atomic scale of the 
(projected) crystal structure. An example is shown in fig. 28. In particular, complicated 
stacking or shear variants of otherwise comparatively simple crystal structures, which are 
hardly, or not at all, determinable by X-ray or neutron diffraction techniques, may 
become easy to analyze on high-resolution TEM. Regarding stacking variants of ordered 
alloys (antiphase structures) we refer as an example to Van Tendeloo and Amelinckx 
[1977, 1978]. Various applications of high-resolution TEM to crystal-structure research 
in mineralogy and inorganic chemistry may be found in a book edited by Wenk [1976]. 

Silicon is a substance especially suitable for high-resolution TEM because of its 
simple structure and because of its comparatively large lattice constant. Detailed 
information on an atomic scale of split dislocation configurations and of the structure of 
dislocation cores have been obtained, cf., e.g., Spence and Kolar [1979] and Bourret 
etal. [1982]. 

One point of special interest in physical metallurgy concerns the imaging of a local 
variation of lattice-plane distances as a consequence of a local variation of the chemical 
composition, e.g., in the course of a spinodal decomposition. Sinclair et al. [1976], 



Fig. 28. High-resolution image of Ba,Nb 4 0 I5 . (hexagonal unit cell, a = 0.572 nm, c= 1.172 nm). Projection 
along <10l0>. The insert shows the projected structure: solid circles indicate positions of Ba atoms, hatched 
diamonds indicate Nb0 6 octahedra. (Courtesy J.L. Hutchison; inset from Shannon and Katz [1970].) 
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studying a spinodally decomposed Ni 3 Au alloy, found indeed a periodic modulation of 
the lattice plane fringes, in accordance with the expectation. 

However, subsequent model calculations of Cockayne and Gronsky [1981] have 
clearly demonstrated that, due to the wave aberration x(P)> the spatial modulation of the 
lattice fringes observed on the image may be significantly different from the spatial 
modulation of the lattice plane spacings in the specimen. Even the number of lattice 
fringes per modulation period may be different from the corresponding number of lattice 
planes in the specimen. 

In conclusion it is worthwhile to emphasize that at the present state of the art all 
kinds of high-resolution work, in particular that concerning lattice defects, must be 
accompanied by extended theoretical work, i.e., by corresponding model calculations. 
This holds even more if one crosses the border of the restricted regime of applicability 
of the weak-phase object approximation. 

10. Analytical electron microscopy 

10.1. Basic considerations 

Electron scattering can be divided into two categories: elastic and inelastic scattering. 
When elastic scattering occurs, the direction of the electron velocity v is changed, but the 
magnitude Ivl remains constant, so that the kinetic energy, E=^m e v 2 (m e = electron mass) 
is unchanged. 

During an inelastic scattering event, energy is transferred to the target atoms and 
electrons, and the kinetic energy of the beam electron decreases. 

Inelastic scattering can happen by: (i) plasma excitation; (ii) excitation of conducting 
electrons leading to secondary electron (low-energy) emissions; (iii) deceleration of the 
electron beam in the Coulomb field of an atom ( Bremsstrahlung)\ (iv) excitation of 
phonons; (v) ionization of the inner shells of atoms. Signals arising by the latter process 
can be used for analytical studies (fig. 29). 

In the ionization process of inner shells, a sufficiently energetic electron can interact 
with an atom and cause the ejection of a tightly bound inner-shell electron, leaving the 
atom in an ionized and highly energetic state. The energy transferred to the bound 
electrons must be at least the energy of the bound state of the electron in a shell of the 



Fig. 29. Inelastic scattenng of incoming electrons at electrons of inner shells (schematic drawing). The 
characteristic X-rays and the energy losses are used for analytical purposes. 



Ch. 11, § 10 


Transmission electron microscopy 


1087 


atoms. Subsequent decay of this excitation state results in the emission of characteristic 
X-rays and Auger electrons. The X-ray yield is plotted in fig. 30 for different elements. 
Qualitative and quantitative information on the presence of the different elements can be 
obtained from the wavelength of a characteristic X-ray and from its intensity. A detailed 
treatment of the properties of characteristic X-rays is found in the literature (e.g., Bertin 
[1975] and Goldstein el al. [1981]). 

In electron energy loss spectroscopy (EELS) the inelastic scattering of the incident 
beam is studied directly. The intensity of the inelastically scattered beam is measured as 
a function of energy loss by spectral analysis. At characteristic energy losses the 
intensities of the inelastically scattered electrons increase due to the ionization of inner 
shells. From the position of the edge and the increase in intensity, information is again 
available on the chemical composition. EELS is most powerful for light elements (atomic 
number Z< 12), whereas the analysis of the characteristic X-rays is possible for Z>9. 

The cross-section for the inelastic processes (i) to (v) has been calculated (see 
Goldstein et al. [1981]) for different metal targets as a function of energy. All of the 
cross-sections are observed to decrease with increasing energy. Considering a range of 
atomic numbers, inelastic scattering is favoured at low atomic numbers and elastic 
scattering at high atomic numbers. 

Inelastic scattering occurs by a variety of discrete processes, with a variable amount 
of energy transferred to the solid, depending on the strength of each interaction. Cross- 
sections for the individual processes are difficult to obtain for all targets of interest. It is 
useful in many calculations to consider all inelastic processes grouped together to give 
a “continuous energy loss” (Bethe [1933]). 

It is found experimentally that an appreciable fraction of the beam electrons which 
strike a target subsequently escape where the electrons enter the solid (backscattered 



Fig. 30. X-ray yield and Auger-electron yield for K-shell excitation for different elements. 
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electrons). Backscattering electrons provide an extremely useful signal for imaging in 
scanning electron microscopy and scanning transmission electron microscopy. For proper 
image interpretation, it is necessary to understand the properties of the backscattered 
electrons as a function of the parameters of the beam and the characteristics of the 
specimen (for a detailed review see Niedrig [1978]). 

It is well established by experiments and by Monte Carlo calculations that the volume 
of interaction between a high-energy electron beam and a solid possesses the shape of a 
pear (fig. 31). The width of the neck of the pear is mainly determined by the shape of 
the electron beam, while the length of the neck depends on the energy of the electrons 
as well as on the material investigated. The diameter of the pear can exceed several 
microns (/xm). Therefore, studies with high spatial resolution can only be performed in 
thin foils. 

Those studies are preferentially performed in a conventional TEM fitted with a STEM 
unit, appropriate detectors for X-rays and energy loss spectra. 

10.2. Quantitative analytical electron microscopy of thin foils: analysis of X-rays 

In X-ray microanalysis the detector is as close as possible to the specimen. Usually, 
the detectors are energy-dispersive devices, which offer the advantages of speed of 
operation and ease of attachment to the system. Crystal spectrometers (wavelength- 
dispersive devices) are usually preferred for quantitative studies. However, it is very 
cumbersome to attach those to a transmission electron microscope. An example of an 
EDS X-ray microanalysis spectrum in the STEM mode of a TEM is shown in fig. 32. 


electron 




thin 

foil 


Fig. 31. Schematic representation of the interaction of a high-energy electron beam with a bulk sample and a 
thin foil, respectively To the first approximation absorption and fluorescence effects can be ignored in the thin 
specimen. 
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Fig. 32. STEM X-ray spectrum from a low-carbon steel. The Cu peak is caused by the Cu grid of the TEM. 


10.2.1. Cross-section for inner-shell ionization 

Numerous cross-sections for inner-shell ionization can be found in the literature; this 
has been reviewed by Powell [1976]. The number of X-ray photons produced per 
incident electron in a thin foil of thickness Ax can be predicted from 


n x = (twNq — pAx, 
A 


(57) 


where cr is the ionization cross-section, vv the fluorescence yield (see fig. 30), N 0 
Avogadro’s number, A the atomic weight, and p the density of the material. Experimental 
measurements of the X-ray yield (absolute efficiency) give a range of - 0.11 x 10" 4 for 
Mg to - 0.933 x KT 4 for Ni in units of X-ray photons per electron and per angular unit. 


10.2.2. Thin-film approximation 

If the sample is sufficiently thin to carry out quantitative transmission electron 
microscopy at > 100 kV, the activated volume is approximately a cylinder equal to the 
beam diameter (see fig. 31); that means a high lateral resolution can be achieved. This 
high lateral resolution, however, results in a disadvantage in the use of thin specimens in 
that activated volume for X-ray production is small compared to a bulk-sample; x-ray 
counting rates are low and hence the potentioal accuracy is inferior to that generally 
accepted for bulk specimens in scanning electron miscroscopy. 

Cliff and Lorimer [1975] showed that for a thin foil, to a first approximation X-ray 
absorption and fluorescence in the specimen can be neglected. Therefore, the ratio of two 
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observed X-ray intensities / A // B can be related to the corresponding weight fraction ratio 
c A /c B by the equation 

c a/ c b = *ab/aAb. (^) 

where is a constant at a given accelerating voltage and is independent of specimen 
thickness and composition. A normalization procedure, e.g., 2c„ = 1, must be used to 
convert the ratios of the weight fractions into weight percentages. In mineral specimens, 
which are frequently used for calibration, assumptions must also be made concerning 
oxidation states, e.g., it is impossible to differentiate between Fe 3 0 4 and Fe 2 0 3 if ratios 
are measured, and oxygen cannot be detected (Lorimer etal. [1981]). 

For quantitative analysis the parameters in eq. (58) have to be determined 
experimentally and also compared to theoretical results. There are discussions in the 
literature on the reliability of different kAB values available (e.g., Geiss [1981]). 
Furthermore the effects of absorption of the X-rays and fluorescence in the specimen 
have to be considered for a reliable quantitative analysis (see § 10.2.4). 

10.2.3. Beam-spreading in the specimen 

Beam-spreading in thin foils is currently an active field of research in both exper- 
imental investigations and theoretical analysis (Lorimer etal. [1981] and Geiss [1981]). 

A simple approach to the problem of beam-spreading has been proposed by Gold- 
stein etal. [1977] who assumed that a single elastic scattering event occurs at the centre 
of the foil, and defined the X-ray source as that volume in which 90% of the electron 
trajectories lie. Experimental observations surprisingly show good agreement with the 
simple theory by Goldstein et al. [1977]. The beam-spreading limits the spatial 
resolution of analytical TEM, as shown in table 1. 

10.2.4. Errors limiting the data of X-ray analysis 

X-ray counting statistics are usually assumed to obey Gaussian behaviour and at the 
2d confidence level the relative error in the number of counts I is 2 \[T . Using eq. (58) 


Table 1 


Spatial resolution for X-ray microanalysis at 120 kV for incident electron probe sizes defined by d=0 nm 
and d = 5 nm (after Lorimer etal. [1981]). 
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a ratio of two counts is converted into a weight-fraction ratio via the constant k AB . Thus 
it is necessary to add the total relative errors in / A , I B and k AB to obtain the relative error 
in the weight fraction c\/c B would be 6%. Unfortunately, X-ray counts often a premium 
in thin specimens, and it may be necessary to accept only a few hundred counts for one 
element. If, for example, / A contained only 900 counts and / B stilled equalled 10000, 
then the relative error in the c A /c B ratio would be ± 10%. The statistical error was often 
not taken into account in quantitative analytical electron microscopy (AEM). Therefore, 
materials-science interpretation of AEM work is bases on weak or incorrect foundations. 

In order to make corrections for absorption, fluorescence and beam spreading, it is 
necessary to know the thickness of the sample. Various parallax techniques can be used, 
including the contamination spots formed on the top and the bottom of the foil during the 
analysis. However, it seems that the most accurate technique is that of convergent-beam 
diffraction as described by Amelinckx [1964] and Kelly et al. [1975]. This enables 
specimen thickness to be determined with an accuracy of ± 2%. 

Many investigators have been aware that the measured X-ray spectrum may contain 
X-rays generated from other than the sample region excited by the focused electron 
beam. These additional X-rays are called “spurious” or “extraneous” radiation. This 
problem has received much attention (cf. Goldstein and Williams [19811). The 
spurious radiation can be avoided by a correct experimental arrangement of the specimen 
in the analytical electron microscope. The absence of spurious radiation can be checked 
by a hole count. 

10.2.5. Examples 

Since about 1979 reliable results obtained by AEM appeared. Ciiampness et al. 
[1982] determined the partitioning of various alloying additions during the austenite- 
pearlite transformation in eutectoid steels. The composition of different precipitates in a 
variety of steels was obtained and first results on segregation of impurities at grain 
boundaries and interfaces were reported. More results are published in the proceedings 
of recent workshops and conferences on AEM (Hren etal. [1979], Lorimer etal. [1981] 
and Geiss [1981]). 

10.3. Quantitative analytical electron microscopy of thin foils — electron energy 
loss spectroscopy 

A transmission microscope fitted with an electron spectrometer can be used to 
produce energy-filtered images (Krahl et al. [1981]) or for electron energy-loss 
spectroscopy (EELS) (Joy [1981]). The latter can be used for measurements of inner- 
shell loss for quantitative microanalysis of light elements. 

There are two instrumental principles used nowadays. They are based on (i) the use 
of a magnetic analyzer below the camera chamber of a conventional TEM or a TEM 
with STEM attachments (Krivanek and Swann [1981]) and (ii) the use of an analyzer 
between the objective and the intermediate lens in a conventional TEM (Zanchi et al. 
[1977]). Figure 33 shows schematically the energy spectrum of 100 kV electrons after 
transmission through a specimen of about 25 nm thickness. The elastically or (by 
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Fig. 33. Schematic electron energy-loss spectrum of the signal intensity 1(E) versus energy loss E. The hatched 
area close to the energy E K is proportional to the number of atoms present in the specimen. 


phonons) quasi-elastically scattered electrons give rise to a zero-loss peak whose width 
provides an indication of the instrumental energy resolution. For small energy losses 
(< 100 eV) surface and bulk plasmon losses are observed (Jouffrey etal. [1978]). For 
higher energy losses E, inelastic scattering is caused at electrons of the outer atomic 
shells (e.g., valence electrons). The intensity of the inelastically scattered electrons 
decays rapidly, approximately as AE *, where A is a constant and r is between 2 and 6. 
At an energy loss equal to the ionization energy E k of an inner atomic shell k (where 
&=K, L, M) the energy loss intensity increases rather abruptly, followed by a slower 
decay. By determining A and r from a region just preceding the ionization edge, the 
outer-shell background can be subtracted (by a computer, or manually using a log-log 
plot). The area I k (a, A) above the background is measured up to an energy loss E k + A 
(see fig. 33). The number of atoms per unit area of the corresponding element is then 
given by (Joy [1981]) 

N = [/* {a, A )//, (a, A)]/c r* (a, A), (59) 

where cr k {a, A) is a partial cross-section for the ionization of shell k within an energy 
range A and for scattering angles up to a. The angle a is determined by the collecting 
angle of the spectrometer. For a quantitative analysis cr k (a. A) can be determined 
experimentally or by calculations (Egerton [1979] and Joy [1981]). 

Unlike X-ray or Auger microanalysis, energy-loss spectroscopy therefore provides a 
convenient standardless method for determining the absolute number N of atoms of light 
elements. In addition, the ratio of two (or more) elements may be determined from the 
appropriate ionization edge, provided an equal integration range A is chosen for both 
elements. For that case I { (a, A) will cancel in eq. (59) so that the low-loss region needs 
not to be measured. 

As the specimen thickness increases, multiple-scattering contributions to the back- 
ground beneath an inner-shell edge cause the background to increase faster than the 
inner-shell “signal”, so that the signal/background ratio falls, which reduces the visibility 
of an edge and may make the background-fitting and subtraction process insufficiently 
accurate (Joy [1981]). Therefore, judging by the criterion of signal/background ratio, the 
specimen should be as thin as possible within the analyzed region. 
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However, by making the sample very thin we reduce the amount of valence-electron- 
and inner-shell scattering, so that the relative noise level in both signal and background 
is increased. Consequently, the signal/noise ratio (due to the finite number of electrons 
detected within the energy window chosen for analysis) initially increases with thickness 
and then goes through a maximum at a sample thickness of the order of A„ the total 
mean free path for inelastic scattering. This last statement however ignores the practical 
problem of separating the signal from the background (i.e., background-fitting and 
subtraction) which generally becomes more difficult as the thickness increases and the 
signal/background ratio deteriorates. In practice it is generally found that the optimum 
specimen thickness is closer to A/2, i.e., 60 nm for 100 keV electrons (Egerton (1979, 
1981]), 50-100 nm for 200 keV, and 100-200 nm for 1000 keV incident energy 
(Jouffrey et al. [1978]). 

The sensitivity of a microanalytical technique is an important factor in determining 
its usefulness. For EELS, as for EDS, the limiting condition will be the statistical 
requirement for a minimum signal-to-noise ratio between the edge and the background. 
Because of the dominance of the background in the spectrum the, sensitivity will depend 
very much on the experimental parameters such as beam current, recording time and 
acceptance angle. Consequently, detailed calculations are necessary in order to obtain 
realistic results. Joy and Maher [1980] calculated the sensitivity for different atoms 
distributed in a carbon matrix, under a variety of experimental conditions typical for 
STEM operation. 

Although the absolute detection limits are very good, typically 10"' 9 g, this repre- 
sents, however, a mass- or atomic fraction of only between 0.5 and 5% in most cases. 
This indicates that EELS is best suited for studies of precipitates, segregation effects 
and similar situations where the local mass fraction is high or can be made so by 
choosing the probe diameter. It can also be seen that much of the potential gain in 
microanalytical sensitivity derived from EELS operation is sacrificed when the normal 
sequential mode of spectrum recording is used, since the analysis time is only of the 
order of a few hundred milliseconds per channel compared with 100 s or more for EDS. 
However, recent developments in parallel detector arrays which acquire data simulta- 
neously in many channels will ultimately remove this disability and yield a substantial 
gain in sensitivity. 

10.3.1. Examples 

EELS is applied mainly to ceramic specimens since it is the only technique for the 
determination of the distribution of light elements; an example is shown in fig. 34, 
below. Grain-boundary phases could be detected and analyzed chemically (Geiss [1981], 
RCthle and Petzow [1981]). Ion implantation is sometimes employed to introduce 
light-element dopants into semiconductors. EELS could be used to measure local dopant 
concentrations, if the latter exceed -0.1% (Joy and Maher [1980]). Light elements 
occur in certain metal specimens in the form of nitride or carbide precipitates; these 
materials have also been analyzed by the EELS technique (Rossouw et al. [1976], 
Geiss [1981]). 
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Fig. 34. Electron energy-loss spectrum of a 25 nm thick amorphous area in a silicon nitride ceramic. Edges in 
the spectrum are found at characteristic energy losses. The concentration of the different elements can be 
determined from the hatched areas: a) Energy losses 250-850 eV. b) Energy losses 1350-1950 eV. (From 
Bischoff [1983].) 
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Appendix. Elements of the kinematical diffraction theory 

A.l. Introduction 

In materials science, scattering or diffraction experiments using some kind of waves 
of wavelengths X comparable to, or smaller than, interatomic distances play a key-role 



Ch. 1 1 , Appendix 


Transmission electron microscopy 


1095 


for the study of the atomistic structure of solids *. This holds in particular for solids of 
more or less perfect crystalline structure. Such experiments are carried out either by X- 
rays, as provided by conventional X-ray generators or, more recently, by synchrotron 
radiation of electron- or positron storage rings, or by the de Broglie waves of thermalized 
neutrons, as provided by nuclear reactors om more recently, by spallation sources or by 
the de Broglie waves of high-energy electrons as used in a transmission electron 
microscope (TEM). 

One of the basic theories which describe the interaction of the above-mentioned 
radiation with matter is the so-called kinematical diffraction theory which can be derived 
as a first Bom approximation, for X-rays from the Maxwell equations, and for neutron 
and electron waves from the Schrodinger equation. The kinematical diffraction theory is 
applicable under the following conditions: 

(i) The scattering at individual atoms is elastic, which means that the wavelength of 
the scattered wave is the same as the wavelength of the incoming wave. 

(ii) The scattering at atoms occurs coherently, i.e., the phase relationship between the 
scattered wave and the incoming wave is always the same for a given kind of atoms. 

(iii) The scattering at individual atoms is so weak that a second scattering of the 
already scattered wave at other atoms can be neglected. 

(iv) The total diffraction at an ensemble of atoms remains so weak that the intensity 
of the incoming wave is practically not affected, i.e., all the atoms of the ensemble 
(specimen) are irradiated with the same intensity. 

Under these assumptions a fairly sophisticated kinematical diffraction theory has been 
developed and applied — using X-ray and neutron waves - to various problems in 
crystallography and materials science, as set out in ch. 12. However, it turns out that this 
kind of theory is only of restricted use for TEM investigations of crystalline material: 
Conditions (iii) and (iv) are in general strongly violated. Therefore, in TEM the theory 
must be extended to the so-called dynamical diffraction theory. Nevertheless, a number 
of elements of the kinematical diffraction theory will be needed also for understanding 
TEM of crystalline materials. Therefore, we give here a brief introduction to the general 
concept of the kinematical diffraction theory, which should be understood as an 
introduction to the more detailed reports in this chapter and ch. 12. 

The amplitudes of X-rays are vectors, because X-rays are electromagnetic waves, 
whereas the amplitudes of neutron and electrons are scalar. This gives some marginal 
differences in the formulation of the kinematical diffraction theory for these two kinds 
of radiation. For simplicity we consider here waves of scalar amplitudes only. 

A.2. Fundamental equations 

The incident plane wave A(r, t ) is given by 


* In the following the term scattering is used for interaction of individual scattering centres, as e.g., individual 
atoms, with radiation. On the other hand, we speak about diffraction if the coherent superposition of scattered 
waves of more than one scattering centre is considered. 


References: p. 1 105. 



1096 


M. Rtihle and M. Wilkens 


Ch. 11, Appendix 


A(r, /) = A 0 exp[27ri(fc 0 r - tf)], (A. 1 ) 

where r is the space vector, t the time coordinate, k 0 the wave vector with l&J = k= 1/A, 
A the wavelength and v the wave frequency. Further, A and A 0 are scalar terms with 
IA 0 I 2 = intensity per unit area perpendicular to k Q . From 

fc 0 • r -vt - const. (A.2) 

we obtain the phase velocity v in direction of k u , 

v = v/k = v ■ A. (A. 3) 


Since we consider only elastic scattering, where the wavenumber k and the frequency v 
remain unchanged by scattering events, we can omit the time-dependent part of A. Thus, 
in the following we start with 

A(r) = A^ exp(2-7ri£ 0 • r) (A.4) 

as the primary wave incident on a number of N atoms. For the moment these atoms are 
assumed to be point-like. They are characterized by atomic scattering amplitudes /„ and 
are located at positions r„ near the origin r = 0; /? = 1 N. At all these atoms, the 
primary wave excites the emission of spherical waves in coherency with the primary 
wave. Thus the sum of the scattered waves can be represented by A s (r), with 


A(r) = -XA(r„)./„ 


exp[27ri£|r - r n | 

k - / -»l 


(A. 5) 


On the righthand side the term A(r„) represents amplitude and phase of the incident wave 
at the positions of the atoms. The term behind /„ represents a spherical wave emerging 
with phase tt from the atom positions. It has the dimension of a reciprocal length. Thus, 
since A, and A must be of the same dimension, the atomic scattering amplitude /„ must 
have the dimension of a length. Accordingly, in neutron diffraction /„ is called the 
“scattering length”, b„. 

A detector for receiving the resultant intensity of the scattered, or diffracted, waves 
may be placed at a position r= R so far away from the N atoms that in the denominator 
of eq. (A. 5) the approximation 

\R - r| = \B\ = R (A. 6) 

is applicable. This is the condition for Frauenhofer diffraction. Further, we assume that 
near the detector all spherical wave fronts can be approximated by plane waves with a 
common wave vector k i parallel to R. Thus, near the detector, we have 

K • { R ~ r n) = k \ R ~ 'J cos y, |* s j = k. (A. 7) 

where y is the angle between k % and R-rn. Since y — » 0 for R — » °°, we set cosy = 1. 
Then we can rearrange eq. (A.5) with eqs. (A.6) and (A.7) to the form 
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A( R ) = ^ f ( k ) exp(27rik, • 7?), 

(A.8) 

with 


k - k s -k 0 , |k = 2k a sin 6 

(A.9) 

and 


f (k) = Y,f» exp(-27riKT„). 

(A. 10a) 


k is called the diffraction vector (in neutron diffraction, k is often used for 4irk sin 6). 6 
is half the angle between k 0 and k s , and F{k) may be called the diffraction amplitude of 
the ensemble of scattering atoms. Note that information about this ensemble (r n ,f n ) is 
only contained in F(k ) and that F depends only on k but not explicitly on the moduli and 
directions of #c 0 and #c s . Equation (A. 10a) may easily be extended to a continuous 
distribution of the atomic scattering amplitude. If pfr) denotes the scattering amplitude 
per unit volume at the point r then eq. (A. 10a) is modified to 

F(k) = f p r (r) exp(-27riK • r)d 3 r. (A. 10b) 

The detector may receive a diffracted intensity 7 S via a window of area Q, with Q/R 2 « 1 . 
Then 7 S is given by: 

7 s (K) = |A s (/?)| 2 e = |A 0 j 2 |F(K)| 2 ^-. (A. 11) 

Q/ R 2 = A ft is the solid-angle interval under which the detector receives radiation from 
the scattering atoms. Thus, IF(ic)l 2 , which has the dimension of an area, will be called the 
differential diffraction cross-section of the ensemble, i.e., the cross-section per unit solid 
angle, d a/ dfl, which is a function of k only. Accordingly, the significance of IF(ic)l 2 is 
rather general, and it is in particular free of “instrumental” parameters of the experiment 
such as the actual values of A 0 , R and Q. Therefore, in the following we concentrate 
mainly on a more detailed evaluation of F(k) or IF(/c)l 2 . 

A.3. Real space and reciprocal space, description of perfect crystal structures 

We assume now that the atoms are periodically distributed, i.e., they form a perfect 
crystal lattice. The elementary cell of the lattice is given by three non-coplanar basis 
vectors a„ a 2 , a 3 , or briefly a p y = 1 , 2, 3,. For simplicity we assume here that the crystal 
is shaped like a parallel epiped, i.e., in the direction of a } the elementary cells are 
repeated Nj times so that the lattice consists of N i N 2 N 3 unit cells. The elementary cell 
contains M atoms, associated with atomic scattering amplitudes/,,, m= 1 M. Then 
the positions of the lattice atoms are given by: 

r = r + r, (A. 12) 

nm n m 7 v ' 

where r„ denotes the position of the elementary cell, 
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r *='L n i a i. 1 ( A - 13 ) 

j 

and r m denotes the position of the with atom within the elementary cell. We express r m in 
dimensionless coordinates ^ related to the a t as: 

r *=l,tl' a r 1 (A. 14) 


Besides the basis vectors a l we introduce so-called reciprocal basis vectors which 
constitute the reciprocal lattice : 


b _ x a 3 

1 V. 


b <h_^i 

z 1/ 


b 3 = 

3 1 / 


(A. 15) 


where x means the vector product and V c the volume of the elementary cell. The bj are 
related to the by: 


a ■ ■ b , 
> j 




(A. 16) 


where 


8# = 1 f ory = f, 

= 0 for j * y. 

Note that the b i have the dimension of a reciprocal length, as is also true for the wave vectors 
k 0 , k % and the scattering vector k. So k can be expressed by a linear combination of the bj, 

k = X* A . (A. 17) 


where the hj are dimensionless numbers. Within the ic-space, or reciprocal space, the 
reciprocal lattice points given by = integers play a special role. We denote them by g 
or, if necessary, by g hu , with 

g = guj = hb l +kb 2 + lb 2 , h,k,l = integers. (A. 18) 

The integers h, k, l are the well-known Miller indices of a given set of lattice planes. In 
standard text books of crystallography it is shown that g hk , is perpendicular to the set 
{ hk[) of lattice planes and that 

kw/l = V^hu’ (A. 19) 

where d M is the repetition distance of the lattice planes (hkl). 

In the following we will denote reciprocal lattice points, depending on the context, by 
g M , or g, or simply by ( hkl ). If the h, k, l refer to the ensemble of crystallographically 
equivalent reciprocal lattice vectors we use the symbol {hkl}. If a particular one is meant 
we write (hkl). For instance, concerning a crystal of cubic symmetry, {100} comprises 
six g-vectors, namely ±(100), ±(010), and ±(001). In quite a similar way we will denote 
vectors in real space in an abbreviated form by r = [mvw] with the meaning 
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r = ua [ + va 2 + wa y (A.20) 

If the ensemble of all crystallographically equivalent vectors r is meant, we write 
r = <uvw>. For instance, in a cubic crystal the symbol <100> means all vectors of the 
kind ±[100], ±[010], and ±[001]. Sets of different lattice planes {hid) with a common 
direction in real space, e.g., r v are called a zone, and r z is the zone axis. The corres- 
ponding vectors g ul constitute a plane in the reciprocal lattice with g hk />r z = 0. In a cubic 
crystal, r z = [100], for instance, is the zone axis for all lattice planes given by (0 Id). Note 
that for crystals of cubic symmetry a real space vector r = [nvw] is parallel to a reciprocal 
lattice vector g = (hid) with u = h, v = k, w = l. For crystals of lower symmetry this is, in 
general, not the case. Considering, for instance, a crystal of tetragonal symmetry (lo 3 l = c 
* Irql =la 2 l=a), it turns out that the real space vector r = [101] is not parallel to the 
reciprocal lattice vector g = (101). Accordingly, if crystallographic elements of crystals of 
lower than cubic symmetry have to be denoted, a careful distinction between vectors r 
in real space and lattice plane normals g in reciprocal space is necessary. 

A.4. The kinematical diffraction amplitude F(K) of a perfect crystal 

Applying the coordinates of the crystal atoms as given by eqs. (A.12)-(A.14), eq. 
(A. 10a) can be written as 

F(k) = F l (k)F s (k) (A. 21) 

with 

f l ( k ) = X ex P(-' 7ri * • T, )- ( A.22) 

n 

where the summation extends over all n„ n 2 , n 3 , i.e., over all unit cells, and with 

^(*) = £ L exp(-27riK ■ r m ). (A.23) 

The subscript “L” stands for “lattice amplitudes”. Accordingly, F L reflects the outer 
shape of the crystal and is independent of details of the structure of the elementary cell. 
The subscript “s” stands for “structure amplitude” by which the structure of the 
elementary cell is taken into account. Both terms have to be considered separately. First 
we derive F L . For simplicity we assume an orthogonal elementary cell where the a t are 
parallel to bj and IZ^I = 1/10,1. (This restriction is not necessary, but facilitates the interpret- 
ation of the final result.) We consider the environment of a particular diffraction vector 
g and write: 

K = g + s, s = '£s 1 b j /|Z> y | = £ Sj a j b j . ( A.24) 

J ) 

Then, with (A.13)-(A.18), 

k • r„ = \n x h + r^k + n^l] + rtjSjOj . (A.25) 
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The term in square brackets is integer and can be omitted in the exponent of eq. (A. 22). 
Thus: 


F d K ) = F l ( s ) = X ex P ~ 2nl 'L n i s i a i 


n r i 

>=1.2.3 


(A. 26) 


For SjOj 1, the argument in the exponent varies smoothly with varying the integers «■. 
Thus, the summations can be replaced by an integration 


^lOO = ^ J^ 0 ex p(- 2 ™X V;) 11 -*! dx 2 dx i 


(A.27) 


sin^L, sin7rr 2 i^ sin 77 X 3 !, 
TTS ] a i Trs 2 a 2 r rrs 1 a 1 


(A. 28) 


with Lj = Njaj denoting the length of the crystal in the direction of a, and V e = a,a 2 a 3 . In 
eq. (A.28) a phase factor was omitted, since only IF L I 2 is accessible by experiment. 

An indication of the “structure” of the lattice factor IFJ 2 in the reciprocal space is 
obtained by considering the square of one of the three factors of F h , e.g., 




sin 2 77 XJL, 

7T 2 X, 2 fl, 2 


(A. 29) 


The following properties of /, can be recognized: /, (x,) is sharply peaked at x, =0 where 
/, reaches the value /V, 2 . In the tails /, oscillates and decreases proportional to x,~ 2 . The 
first zero on both sides occurs at lx,l= 1/L, = \/N l a i . Accordingly, the half-width of /, is 
about l/N l a l . Thus, one may suppose that 

J A(x,)dx, = — , (A. 30) 

a i 


which is indeed the case. (If eq. (A. 26) is summed exactly, it turns out that the terms 
ttsjUj in the denominator of eq. (A.28) have to be replaced by sin itsp-. This small “error” 
in (A.28) is, however, without any practical relevance in general.) 

Having understood the structure of /,(x,), it is easy to understand the three-dimen- 
sional structures of IFJ 2 , 

\ f l \ 2 = /.(*.K(*2K(4 (A. 31) 

Around every reciprocal lattice point g the intensity of IF L I 2 is sharply peaked, and the 
shape of the peak is reciprocal to the shape of the crystal: parallel to a direction where 
the crystal is thin ( Lj small), IFJ 2 is broad, and parallel to a direction where the crystal 
is large (L ; large) IFJ 2 is narrow. For instance, a plate-like crystal gives a rod-like 
structure of the peaks of I.F L I 2 and vice versa. 

Now we proceed to the term /\.(#c), cf. eq. (A.23). We recall that for N ]t N 2 , N 3 a> 1, 
the lattice factor IF L I 2 is essentially * 0 only for #c close to g, or for (h { h 2 h 3 ) close to ( hkl ). 
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Further, we realize from eqs. (A. 14) and (A.23) that F s is, comparative to Fl, a smooth 
function of k [or {h y h 2 hjj\. Therefore, considering k close to g it is allowed to approx- 
imate F s by 

M 

F^fhfh) - F s (hkl) = £ f m exp[-27ri(^r + k& + l &) ]. (A.32) 

We consider F s (hkf) for some simple cubic lattices. 

(i) face-centred cubic (fee) lattice; four equivalent atoms in the unit cell are located 
at r — [000], {[Oil], {[101], {[110], Thus: 

FJJikl) =/{ 1 + exp[-i7r(£ + /)] + exp[-iir[/i + /)] + exp[-iir(/i + £)] } 

= 4 f for h, k, l unmixed, i.e. all even or all uneven 
= 0 for h, k, l mixed. 

(ii) body-centred cubic (bcc) lattice; two equivalent atoms at r= [000], {[111]. Then: 
F s (hkT) = 2/ for h + k + 1 = even, 

= 0 for h + k-l= uneven. 

For these two types of Bravais lattices, F s takes either full value (f multiplied with the 
number of atoms per unit cell) or vanishes. 

(iii) Ordered lattice of type Ll 2 (e.g. Cu 3 Au); one atom of kind A in r= [000], three 
atoms of kind B in r ={[01 1], {[101], {[110]. This gives: 

FJjikl) =/ A + 3/ b for hkl unmixed, 

-/a _ /b f° r hkl mixed. 

Lattice plane of the first category are called fundamental lattice planes and those of the 
second category are superlattice planes. The /^-values of the latter are in general much 
smaller than those of the former. 

A.5. The Ewald sphere and Bragg’s law 

In order to decide whether a crystal gives rise to a diffracted beam for a given 
direction and a given wavelength of the incident radiation we introduce the so-called 
Ewald Sphere in the reciprocal lattice. Figure 35 gives a two-dimensional representation. 
A reciprocal lattice is schematically drawn and the wave vector k 0 of the primary beam 
is inserted in such a way that k 0 ends at the origin of the reciprocal lattice. A sphere, the 
Ewald sphere, is drawn around the starting poine of k 0 (point C) with l& 0 l = k as radius. 
All vectors starting from C and ending on the Ewald sphere represent possible wave 
vectors k. of diffracted waves of intensity IF(#c)l 2 , with k =k i -k 0 . If the Ewald sphere 
cuts a reciprocal lattice point g where !F(#c)l 2 takes high values a strong intensity will be 
diffracted in the direction of the particular wave vector k s , named k g , pointing from C to 
g. We denote the angle between k 0 and k g by 2 Q m . Then we have: 

\g hkl \ = 2ksmd hU , (A. 33) 

or with eq. (A. 19): 
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Fig. 35. The Ewald sphere and the intensity distribution IF(k)I 2 in the reciprocal space. 

A = 2d hkl sin 6 m . (A.34) 

This is Bragg’s law with 0 ^ being the Bragg angle of the set of lattice planes (hkl). 

These Bragg peaks (or “Bragg reflections”) are not infinitely sharp but rather reveal 
a certain finite width (reciprocal to the crystal dimensions). Therefore, appreciable 
intensity will be diffracted not only for one discrete direction of k 0 but for a (more or 
less) small interval of directions. In the same way the wave vectors k s of the diffracted 
beam will cover a similarly small interval of directions. Consequently, for a particular 
direction of k 0 the diffracted intensity I hkl is given by a two-dimensional integration, 

(A.35) 

over the Ewald sphere in the neighbourhood of g m . d/g/k 2 is a differential solid angle dft 
with d/ E = differential area on the Ewald sphere. 

In a practical diffraction experiment the crystal will be rocked with respect to k 0 over 
an angular interval so that the Ewald sphere is swept over the peak at the reciprocal 
lattice point under consideration. If effective source size and detector aperture are 
sufficiently large to capture all the intensity belonging to g, the diffracted intensity 1^ is 
proportional to the integral over the diffraction spot, 

~ J d 3 * = y Hhklf< (A.36) 

'e 

cf. § A.4, where the integration runs over the neighbourhood of g hU , N is the number of 
unit cells and \F i (hkl)\ 2 is the structure factor of the Bragg reflection (hkl). 

A.6. The atomic scattering amplitudes and the Debye- Waller factor 

In §A.l we had assumed that the atoms are point-like scattering centres and that, 
consequently, the atomic scattering amplitudes, or atomic scattering lengths, are constants 
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which are characteristic for the kind of atoms considered. 

This assumption is a very good approximation for nuclear scattering of thermal 
neutrons having wavelengths of the order of 0.1 nm. Nuclear scattering occurs at the 
atomic nuclei the diameters of which are of the order of some 10~ 14 m, i.e., smaller by 
a factor of - 10 4 than the neutron wavelengths. In neutron diffraction physics it is 
customary to denote the atomic scattering length by the symbol b. In general b is positive 
and of the order of 10 14 m; however, for some isotopes b may be negative. Considered 
as a function of the atomic number Z or the atomic mass A, no smooth dependence of b 
on Z or A exists. The values of b may even vary considerably for different isotopes of 
a given element. However, apart from these variations the general trend is that b 
increases roughly proportional to A Ui , i.e., to the size of the nuclei. 

X-rays are (nearly) exclusively scattered by the electrons constituting the electron 
cloud of an atom. From Maxwell’s equations it follows that a single electron scatters X- 
rays of wavelength of the order of 0.1 nm like a point-like scattering centre with a 
scattering amplitude f 0 with 

f 0 - £ 2 /(tf! 0 c 2 ) = 2.8 x 10" l5 m, (A. 37) 

where e is the elementary charge, m Q is the rest mass of the electron and c is the velocity 
of light. Around an atomic nucleus, located at r = 0, the electrons are distributed 
according to a density function p c (r). If the atom of atomic number Z is neutral, 

Jp,(r)d 3 r = Z. (A. 38) 


Applying eq. (A. 10b) we get for the atomic scattering amplitude 

/ X =/(/>(*). 

with 

/,(«) = Jp e (*)exp(-K • r)d 3 r. (A.39) 


The index “X” stands for X-rays. The term f t is named the atomic form factor and is 
tabulated in general as a function of licl/2 = sin 0/A . Obviously, we have / a = Z for 
sin 9/A =0. With increasing sin 0/A,/ decreases monotonously. In a very rough approx- 
imation it decreases to about half its maximum value for sin 6/ A ~ 1 /d A where d A is the 
atomic diameter. 

Transmission electron microscopes operate at accelerating voltages U of 100 kV or 
higher. Taking U = 100 kV, we get the corresponding de Broglie wavelength from 



( eU \ 

Ml 

2m 0 eU 




t 2m 0 c ) 



(A.40) 


giving A = 3.7 pm (h = Planck’s constant). The wavelengths of such electrons are thus 
smaller by a factor of about fifty than the wavelengths normally used in X-ray and 
neutron diffraction work. Because of Bragg’s law [eq. (A.34)] the Bragg angles 0 hkl are 
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also smaller by about the same factor. Electrons are scattered at an atom by the atomic 
electrostatic potential V(r) built up by the charge +Ze of the atomic nucleus and by the 
charge distribution -ep e (r) of the surrounding electron cloud. Since charge distribution 
and potential are linked by Poisson’s equation, 

V 2 V(r) = Aire ■ p e (r), (A.41) 

(V 2 = dVD.r 2 + d 2 /dy 2 + dVdz 2 ), the atomic scattering amplitudes for electrons are closely 
related to/ a . The result is 


/ e (sin 0/A) 


m 0 e 2 Z-fX sine/A) 
2/i 2 (sin 0/A) 2 


(A.42) 


with m 0 e 2 /2h 2 = 2.39 x 10 s /m./ e decreases rapidly (more rapidly than / a ) for small values 
of sin 0/A but only slowly, «Z/( sin 0/A) 2 , for large sin 0/A where / a becomes small 
compared to Z. The value / e (0) will not be discussed here. 

Diffracted intensities are proportional to the square of the atomic scattering ampli- 
tudes. In order to compare the different scattering strengths for neutron-, X-ray- and 
electron waves, we choose as a representative example the 200 Bragg reflection of a 
copper crystal. With sin 0 2O o/ A = 1/2^200 we get for copper sin 02,*/ A = 2.76 nm" 1 . Taking 
tabulated values for b, / a and / e we obtain: 

neutron waves: b 2 = 5.8 ■ 10~ 29 m 2 . 

X-rays: f£ = 3.10“ 27 m 2 , 
electron waves: / e 2 =8.7 ■ 10 _2 ° m 2 . 

It is obvious that neutrons are least effectively scattered and that electron scattering is 
orders of magnitude stronger than X-ray scattering. For Bragg reflections of high order 
OShJ l^rge or d hkl small), the difference between neutron and X-ray scattering becomes 
less spectacular, because f x decreases, but not b, with increasing sin 0/A. 

There is another effect which reduces the intensities of Bragg reflections with 
increasing Igl (or decreasing d) even for “perfect” crystals: The atoms are not at rest 
because of thermal vibrations. This leads to an additional “smearing-out” in space of the 
scattering power of the individual atoms. If (u 2 ) is the mean square displacement of an 
atom from its ideal lattice position by thermal vibration the atomic scattering amplitudes 
/ have to be multiplied by a factor e“' u , with 

M = 8ir 2 <u 2 )^ r --. (A.43) 

A 

Of course, (u 2 ) increases with temperature T. For T> 0 U , where 0 D is the Debye tem- 
perature (for copper, @ D = 320 K), <u 2 >, and thus M, increases about linearly with T. 
The diffracted intensities are damped by the factor e _2Af . Because g 2 , this damping is 
particularly effective for high-order reflections. The factor eT M is named Debye-Waller factor. 


A.7. Imperfect crystals 

The Debye-Waller factor as an unavoidable consequence of thermal vibrations is but 
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one example for the effect of displacements of atoms from ideal lattice positions. Apart 
from thermal displacements, “static” displacements are usually encountered with lattice 
defects, e.g,, stacking faults, point defects, dislocations (including solute atoms), clusters 
and coherent precipitates. Displacements will generally vary with the position in the 
crystal. Let r n denote the position for the nth unit cell of the ideal reference lattice and 
Rn=R(r„) the displacement of the first atom in this cell, R nm are the corresponding 
displacements from the other ideal positions r nm in the nth cell. A factorization of l/ r (/c)l 2 
as in eq. (A.21) is now no longer possible. Instead, we can write: 

F(k) = X ( K ) exp(-27riic • r„ ) exp(-2mK • R n ), (A.44) 

n 

with 

(*) = X f<m exp(-27riK • r nm ) exp[-2mK • (R im - R„ )] (A.45) 

in 

for the structure amplitude of the nth cell. 

Equation (A.45) shows that positional disorder alone leads to different structure 
amplitudes for different cells. Furthermore, eq. (A.45) also shows the general effect of 
compositional disorder (chemical disorder), as f nm reflects exactly the occupation of the 
various sites by different atoms. In the special case, where compositional disorder is not 
considered and where the displacement vector R„ m can be regarded as constant for all 
atoms of the elementary cell, the structure amplitude F s (k) remains unchanged, and using 
eqs. (A.24) and (A.25) the lattice amplitude F h can be written in the integral form: 

f l( k ) = f v exp{-27ri[g • R(r) + s • r + s ■ J?(r)]}d 3 r, (A.46) 

where the integration runs over the sample volume V s . In many cases the term s*R can 
be neglected. These general expressions are the basis for more detailed evaluations in this 
chapter and ch. 12, where several special cases are discussed. 
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Addendum 

Since the release of the third edition of Physical Metallurgy (R. W. Cahn and P. Haasen, 
eds.) enormous progress has been made regarding instrumention as well as further 
development in transmission electron microscopy of materials. The description of the 
basic physics of the scattering processes of electrons within a specimen still holds as 
described in the third edition. The main applications of transmission electron microscopy 
in materials science are still concentrated mainly on conventional transmission electron 
microscopy; however, high-resolution transmission electron mciroscopy and analytical 
electron microscopy are becoming powerful tools, too. This addendum will focus on 
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progress made in different areas of transmission electron microscopy. 

Transmission electron microscopy (TEM) is becoming more specialized and can be 
divided into conventional transmission electron microscopy (CTEM), analytical electron 
microscopy (AEM) and high-resolution transmission electron microscopy (HRTEM). In 
CTEM the morphology of materials as well as the distribution of different phases can be 
determined as described in sect. 4. Information on structure can be obtained with great 
success. In addition, the analysis of defects such as interfaces (phase boundaries, etc.), 
dislocations is possible (sect. 5 to 8). Recently, major progress has been made in 
determining the dislocation structure in intermetallics with emphasis on detailed analysis 
of the splitting of superdislocations into partial dislocations using the weak beam 
technique (Yu and Karnthaler [1993], Karnthaler et al. [1993]). 

In addition, in situ experiments in the microscope allow heating, cooling and 
deformation of the specimen. The latter technique revealed dynamic processes (Louchet 
and Saka [1993] and Fujita et al. [1991]). Interesting results could be obtained for an 
understanding of deformation mechanisms in metals and ceramics, radiation damage, 
crystal growth and corrosion. 

In AEM the main emphasis of present studies is on extracting information on chemical 
composition with high spatial resolution (Egerton [1986], Joy etal. [1986] and Reimer 
[1993]). The technique is applied especially for the determination of the composition at 
or close to the interface and gradients in materials (Krivanek [1991] and Disko et al. 
[1992]). Recently, special techniques of electron energy loss spectroscopy (EELS) have 
come to allow the determination of bonding and electronic structure of materials. 

HRTEM allows the determination of the structure of materials, especially the analysis 
of specific defects in materials (Spence [1988], Buseck etal. [1988], Cowley etal. 
[1992] and Horiuchi [1994]). These defects are dislocations in specific orientations in 
the thin film and interfaces. With great success the structure of complex high-temperature 
superconductors could be revealed by TEM (Horiuchi [1994], Mitchell and Davies 
[1988], Matsui and Krakow [1995]). 

This progress was possible because of major advances in instrumentation as described 
in the first section below. 

B.l. Instrumentation 

Over the last ten years enormous progress has been made towards better instrumen- 
tation and auxiliary parts. In conventional TEM it is now quite common that near ultra- 
high vacuum conditions can be reached close to the specimen in a microscope. The 
conditions of the specimen environment are extremely well-defined and the resolution 
power of conventional TEMs (operating at 200 kV to 400 kV) approaches 0.2 nm or 
better depending on the high-voltage of the instrument. This enormous progress was 
made through middle-voltage instruments where the high voltage is in the range of 
300 kV to 400 kV resulting in a resolution of better than 0.2 nm. 

The specimen stages of the different instruments are mechanically extremely stable. New 
developments in fine mechanics allow a stability of the specimen which often results in a drift 
rate smaller than 0.001 nra/s. Large progress has therefore been made in the analysis of 
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defects as described in sect. 5 since extremely long exposure times are now possible. 

The acquisition of data is now also possible with slow-scan CCD (charge-coupled 
device) cameras (Reimer [1993]) which possess a large linear detection range. 

Field emission guns are available that operate at high voltages between 100 kV and 
300 kV. These guns possess an extreme brightness and coherence (Reimer [1993] and 
Spence [1988]). 

In AEM, the high-voltage stability and sensitive X-ray detectors (Newbury et al. 
[1986]) together with very sensitive energy parallel detection systems for energy loss 
spectroscopy, result in shorter exposure times (by an order of magnitude compared to ten 
years ago). Of course, the same physical principles apply; however, the detection limit 
in AEM was drastically reduced by more sensitive EDS detectors and by parallel EELS 
detection systems. 

Since 1990, energy-filtering electron microscopes or imaging filters are available 
which allow imaging with elastically scattered electrons only (Reimer [1995]). All 
inelastically scattered electrons are filtered out of the image, resulting in micrographs of 
much higher contrast. The filtering can be done either by a system of magnetic sector 
field segments built into the microscope column or by an imaging filter mounted at the 
bottom of the instrument (Reimer [1995]). 

In HRTEM the resolution power of the instruments has been pushed towards 0.1 nm. 
This resolution is reached by using high-voltage high-resolution electron microscopy 
instruments operating at 1 MeV or beyond. A resolution of 0.1 nm can be reached 
(R0heec/«/. [1994]). 

B.2. Conventional transmission electron microscopy including weak beam 

In conventional TEM the resolution of weak beam techniques has reached its 
theoretical limits. The limitation is caused by the width of the contrast lines which in 
turn is correlated to the extinction length of the operating reflection. The observation of 
splitting of dislocations down to the 1 nm level could be achieved and valuable infor- 
mation on the splitting, especially, of superdislocations in intermetallics could be 
obtained (Liu et al. [1992]). 

B.3. Analytical electron microsopy 

In AEM the energy-filtered convergent beam electron diffraction (CBED) (Spence 
and Zuo [1992], Reimer [1995]) has become an important tool for the analysis of crystal 
structures. A convergent beam with a converging angle in the order of 10” 2 is falling on 
a specimen and scattering occurs in different directions. From the structure within the 
diffraction disc of a CBED information on the structure of the material can be obtained 
with high spatial resolution. Scattering factors for different reflections can be calculated 
by comparing the calculated intensity distributions within a disc of the CBED with 
corresponding experimental energy-filtered micrographs. Using this method, the scattering 
factors of reflections in silicon could be reproduced with high accuracy. Recently, :he 
scattering factors of inlermetallic alloys could also be determined and the Debye-Waller 
factor could be determined for the different elements existing in the ordered alloy. 
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The electronic structure of the material with high spatial resolution can also be 
determined by spatially resolved valence electron energy loss spectroscopy (Srkels) 
(Mullejans et at. [1994], Bruley [1993]). Recently, it was shown that the spatial 
variation of the electronic structure of interfaces can be determined from valence 
spectroscopy. This value is critical to both interatomic bonding at atomically abrupt 
interfaces such as grain boundaries and also for the development of van der Waals 
attraction forces at wetted interfaces (Hoffmann and Petzow [1994]). The physics for 
the quantitative analysis of Sreels requires a series of corrections of transformation of 
data including single scattering deconvolution, convergence correction and Kronig- 
Kramer analysis (MCllejans et at. [1993] and French et at. [1995]). With this 
technique the bonding across clean interfaces and wetted interfaces could be determined. 
The near-edge structure of EELS gives some information on bonding across the 
interfaces and the atomic environment at defects. The experimental technique is 
conducted as follows. First a spectrum is taken at the bulk material (next to the interface) 
and then another spectrum at the interface itself. The difference of both spectras results 
in a interface specific contribution. For this analysis a linear addition of the different 
components is assumend. This technique was applied for grain boundaries in alumina 
(MCllejans et at. [1993]) as well as metal/ceramic interfaces (Bruley et at. [1994]). 
The type of bonding and the environment of the bonding could be determined with high 
precision. 

B.4. High-resolution transmission electron microscopy 

Similar advances have also been made in the analysis of micrographs obtained by 
HRTEM. Methods and programs have been developed that allow the simulation of 
HRTEM images of any given atom arrangement. These simulated images are used to 
interpret experimentally obtained micrographs. The recent developments enable us to use 
HRTEM as a method for solving problems in materials science. Atomic structures of 
different lattice defects, such as phase boundaries, grain boundaries and dislocations, can 
be determined by HRTEM (Finnis and RChle [1993], Sutton and Balluffi [1995]). 
The difficulties in the interpretation of HRTEM are similar to those in conventional 
transmission electron microscopy of lattice defects in crystalline materials: the exper- 
imentally obtained micrographs do not usually present a direct image of the object. 

The extraction of information on the structure of lattice defects from HRTEM 
micrographs is complicated (Spence [1988] and Buseck et al. [1988]). Despite this, very 
useful information has been obtained on the structure of lattice defects using HRTEM, 
particularly for semiconductors. The new generation of instruments also allows the 
characterization of defects in ceramics and metals by HRTEM. 
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1. Introduction 

There are many textbooks and monographs on X-ray and neutron scattering (see the 
selection at the end of this chapter), and the number of publications is immense. This short 
chapter cannot give a complete account of the history or the current state of the field. The 
basic theory, standard methods and many of the more classical applications are well 
described in various textbooks (a brief compendium of fundamentals of scattering and 
crystallography is given in the Appendix to ch. 11). The main purpose of the present 
chapter will be to demonstrate that there is much more we can “learn from scattering 
experiments besides the average structure” (Schwartz and Cohen [1987], ch. 7). 

One immediate question concerns the range of stability of a given structure, and 
scattering experiments are helpful in locating phase transformations. There are classical 
methods (e.g., powder diffraction) and very sophisticated recent techniques (e.g., quasi- 
elastic neutron scattering) to study transformations and phase separation in metallic 
systems. Some examples will be discussed. Much space, however, will be devoted to 
studies of inhomogeneities, i.e., deviations from the average structure (point defects, 
clusters, short-range order, precipitates, etc.). 

In §2, some remarks on scattering from crystalline material that contains defects are 
followed by a discussion of X-rays and neutrons as the two types of radiation now 
commonly used for scattering studies of essentially bulk materials. Scattering of low- 
energy electrons, important in the investigation of surfaces and surface phenomena, will 
not be covered. High-energy electron diffraction is discussed in ch. 11. Throughout this 
chapter, scattering intensities are expressed in terms of scattering cross-sections (differen- 
tial cross-sections without energy analysis, double-differential cross-sections for scattering 
experiments with energy analysis), mostly applicable to both X-rays and neutrons. The 
reader should not underestimate the difficulties in converting measured intensities to 
absolute cross-sections. Calibration, absorption, background, polarization and other 
corrections may introduce considerable errors. These problems and questions of 
instrument design and optimization can only be mentioned occasionally. 

§§ 3-6 cover recent applications, and the subject matter has been divided according 
to simple criteria. Diffraction and elastic scattering at and near Bragg peaks, between 
Bragg peaks, and near the incident beam are treated in §§ 3-5, and § 6 gives a few 
examples for inelastic and quasi-elastic scattering. Theoretical expressions are kept to a 
minimum but some are required to interpret the main effects. There are several sys- 
tematically elegant and quite general schemes to describe the same scattering phenomena 
(starting, e.g., from lattice sums, correlation functions or convolutions). No formal 
elaboration of this type will be attempted. Results of the kinematical theory and of first- 
order Born approximation adapted to simple but manageable cases will mostly be relied 
upon. Dynamic diffraction theory as the basis of transmission electron microscopy is 
discussed in ch. 11, and related X-ray and neutron applications can be found there. 
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2. Scattering from real crystals 


2.1. General predictions of the kinematical theory 

As can be seen in ch. 11, eq. (A.29), the kinematical scattering theory for an 
otherwise perfect, but finite crystal yields Bragg peaks whose width Ak s equals 1 /L K 
where L k is the linear dimension of the crystal parallel to k. As Ax g is independent of 
the magnitude of g, this “finite size effect” broadening can be recognized by comparing 
the width of different Bragg peaks (at Bragg angles 9 llkl ). For powders, we thus have a 
method of determining crystallite sizes, since a constant Ak means: 


L cos 6 hk i ’ 



with L as the average crystallite diameter. However, experimental conditions will 
severely limit the range over which peak widths are controlled by the crystallite size 
according to eq. (1). If L is too large (> 1 /rm), we may reach the resolution limit of the 
experiment, and if L is too small (< 100 A = 10 nm), it becomes difficult to separate the 
diffraction lines (see, e.g., Barrett and Massalski [1980J for a detailed discussion). 
For large bulk samples, the size broadening may be ignored, and the differential cross- 
section for Bragg scattering may be written as [cf. eq. (A. 36), ch. 11]: 


do-^ 

d?2. 





( 2 ) 


where S(k -g) is the (three-dimensional) delta function and V c is the unit-cell volume. 
Nevertheless, a line or peak broadening is frequently found in real crystals — caused by 
imperfections. 

We discuss the consequences of imperfections on the scattering pattern starting from 
the general expressions given in § A. 7 of ch. 11. If defects are present in a crystal, the 
displacements u n and the structure amplitudes F sn will be different in different cells. For 
specific defects it is useful to express these quantities in terms of the properties of 
individual defects and their mutual arrangement. If t denotes all possible positions for a 
defect, and u ln is the displacement vector at position n related to the defect at /, the total 
displacement u n may be expressed as 

=X C ' U '"' (3) 

t 

where 

f 1 if a defect is present at site t , 

c. = \ (4) 

[0 if no defect is present at site t. 


This simple superposition is certainly justified for low defect concentrations but may be 
questionable in the case of concentrated alloys. 

The corresponding expression for the structure amplitude of the nth cell reads 
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F sn = F sO+H C A F *'n> ( 5 ) 

/ 

where F s0 is the structure amplitude of the defect-free crystal. Replacing u n and F M in eq. 
(A.44), ch. 11 by eqs. (3) and (5), the mean value for IF(#c)l 2 , i.e., the scattering cross 
section dcr/dJl of the scattering ensemble, calculated for a random distribution of defects 
of concentration c, is (see Krivoglaz [1969]): 



where 

= c X{l ‘ ex p[“ 27r ‘ K '{ u m ~ «,„')]} 1 + 4r {A F s,n + AF m .) , (7) 

t L "so 

and F is the average structure amplitude of the crystal with defects: 

= (8) 

! 

It is now possible to split dcr/dJl into two terms, a Bragg-like term with sharp maxima 
at the Bragg positions, and a second term describing scattering for all other scattering 
vectors. Taking into account that for the Bragg peaks, dtr/dfl from the double sum of 
eq. (6) is controlled by terms corresponding to large distances/) =lr n -r„J, we may write: 

where 2 M is the real part of 9" according to eq. (7) for p — » 

2 M = c lim jl - cos[27TK*(u m - «,„')]} 1 + + . (10) 

p ~*°° r 1 JJ L ^sO 

Equation (9) defines a generalized Debye-Waller factor e~ M . The reciprocal lattice vec- 
tors g occurring in eq. (9) are defined in the average lattice that contains defects. Defects 
may cause a shift of Bragg peaks because the average lattice constant changes, but also 
a reduction in intensity due to the factor e _2W [eqs. (9) and (10)]. As long as 2 M remains 
finite, Bragg peaks will still be present, but there are cases where 2 M tends to infinity. 

If we subtract the Bragg intensity according to eq. (9) from the total scattering cross- 
section, eq. (6), we obtain the diffuse scattering cross-section 


do - drr ( d a \ 

Zf2j d ~ ~dh VdfiJu 


01 ) 
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The cross-section ( dcr/dfl ) d contains no 5-function but varies smoothly with k, even for 
an infinitely large crystal. Krivoglaz [1969] distinguishes two types of defects depend- 
ing on whether or not the Bragg intensities are reduced to zero [eq. (9) with 2M — » °°], 
The limiting behaviour of 2M [eq. (10)] may be discussed by considering a displacement 
u m that decreases rapidly towards zero with increasing distance between t and n. Then, 
only one of the displacements u and one of the A F s in eq. (10) are markedly different 
from zero, and: 


M = c£[l - cos(27tk*« w )| 


( AF ^ 
1 + ^ 
^sO J 


( 12 ) 


The convergence of the sum depends on the contributions from large distances between 
n and t. There, u m is small and the cosine can be expanded, so that 1 -cos(27r#cn/ w ) = 
(2-7r#cn/ m ) 2 /2, and if one writes u tn as a continuous function of / - r t -r n , the convergence 
of the integral }(«»K /n ) 2 dV, °° > / > r ' 0 (» lattice constant), will assure a finite value of 
M. We see that for a large class of defects where (r 7 ) -2 (see ch. 17), M remains finite. 
Others, e.g., straight dislocations, small-angle grain boundaries and stacking faults, cause 
e -2A/ to decrease to very small values for large crystals, and the total scattering is diffuse 
although it will be concentrated, but not in a 5-like fashion, near the original Bragg 
positions (line broadening). 

The distinction between Bragg intensity [eq. (9)] and diffuse intensity (line broaden- 
ing and scattering far away from Bragg peaks), discussed here for systems with a random 
distribution of equivalent defects, will not always be possible experimentally, as the 
width of the Bragg peak is also affected by the resolution function of the instrument. 
Correlations in the arrangement of defects may reduce long-range displacements and 
modify the expected effect, as will the presence of different types of defects. Figure 1 
shows schematically, how the scattering of an ideal crystal is modified by defects. 


2.2. X-rays and neutrons 

Apart from electrons (see ch. 11), X-rays (and y-rays) and thermal neutrons are 
frequently used for structure determination and the study of defects as they provide the 
appropriate range of wavelengths for such investigations. Table 1 gives some of the 
important properties of both types of radiation. For the wavelengths of interest in normal 
diffraction work (0.5 to 20 A), corresponding photon energies are in the range of about 
1-40 keV whereas neutron energies are between 0.85 and 400 meV. Excitations in 
condensed matter (phonons, magnons, etc.) are in the range of a few meV and above. 
The relative energy change of X-rays scattered inelastically (with energy loss or gain) by 
any sample is then very small (say < 10 -6 ) and cannot easily be resolved (except if Moss- 
bauer sources or synchrotron radiation combined with backscattering techniques are 
used). In contrast, neutrons can experience an appreciable relative change in energy, so 
that elastic (no energy change) and inelastic scattering can be distinguished (see below, 
§2.4). Another important difference arises from the magnetic moment of the neutron 
which interacts with the local magnetization density. This leads to magnetic scattering 
which has very important applications as a sensitive probe in the study of magnetic 
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Fig. 1 . Scattering from a crystal containing defects. Bragg positions of a perfect crystal are indicated by thin 
vertical lines. 


substances (see §2.3). Finally, absorption differs appreciably for the two types of 
radiation. (Measurements of absorption can also yield considerable insight in the structure 
of matter, e.g., in EXAFS = extended X-ray absorption fine structure, see § 5.4.). 

The linear absorption coefficient is defined by 

ln(/// 0 ) = -I t D s , (13) 

where / 0 and I are the intensities of incident and transmitted beam, respectively, and D % 
is the sample thickness. The absorption coefficient has the dimension of 1/length or 
area/ volume, and can be understood as a total macroscopic removal cross-section, as is 


Table 1 

Some properties of X-rays and neutrons. 


Property 


Value" 

For X-rays (photons) 

For neutrons 

mass m [kg] 

0 

1.675 x 10' 27 

momentum 

h/X 

h/X = mv 

energy 

hc/X 

h 2 / 2mA 2 

scattering length 

Zf g (for k — > 0) 

^(nuclear) 

absorption 

« Z*X 3 (strong) 

“= A (mostly weak) 

speed v[mA] b 

c 

437 E' n 

wavelength A [A] b 

\2A/E 

3956/v 

magnetic moment 

0 

— 1 .913 fi a 

frequency vls“'] b 

2.42 x 10 17 E 

2.42 x 10" E 


‘ h = Planck's constant, c = velocity of light, f 0 = classical electron radius, fj. n = nuclear magneton. 
b For photons, E in keV; for neutrons, E in meV. 
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common in neutron scattering. In the X-ray literature, a mass absorption coefficient is 
defined by S/p with p = density. Its dimensions are area/mass, and values are indepen- 
dent of the thermodynamic state of the sample material. 

All beam-attenuating processes (including coherent and incoherent scattering) are 
included in X,, but for X-rays the excitation of fluorescence radiation can be singled out 
as the most important true absorption mechanism. When the energy of the incident 
X-rays approaches a resonance energy in the electronic states, an absorption edge is 
observed, corresponding to the excitation of electrons in the K, L, etc., levels (see fig. 2 
where the mass absorption coefficient is shown as a function of wavelength). The 
electrons associated with the absorption move out of phase from the others and a 
destructive interference results that reduces the atomic scattering factor by the equivalent 
of twice the number of resonance electrons (e.g., at least a total of four at the K edge). 
The atomic scattering factor has to be corrected according to 

/a - /m + f' + if"' (14) 

where /' and f" are the real and the imaginary part of the Honl corrections. The wave- 
length dependence of/' and /"is shown schematically in fig. 3 (see International Tables 
[1952-74], vol. III. pp. 214-216 and vol. IV, p. 148, and Sasaki 11989, 1990] for 
tabulated values). As synchrotron radiation sources now provide strong X-ray beams with 
continuously tunable wavelength, diffraction work in the neighbourhood of absorption 
edges can take advantage of the variation of specific scattering factors. 



X 


Fig. 2, Mass absorplion coefficient X f /p for X-rays as a function of wavelength near the K and L edges of an 
element (schematic). 
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Fig. 3. Variation of real and imaginary components/' and /"of the Honl corrections as a function of X-ray 
wavelength near the K absorption edge (at A K ) of an element. (After Cowley [1981].) 


Table 2 gives a comparison of linear (and mass) absorption coefficients for a few 
metals. According to eq. (13), 2,£) s = 1 corresponds to a reduction in intensity by a factor 
1/e which gives an estimate for the typical thickness of a sample. Table 2 shows that 
even Cd, a material frequently used for shielding purposes in thermal neutron work, has 
a smaller absorption for neutrons than most common metals for X-rays. On the other 
hand, lead, a good shielding material for X-rays, is almost transparent to neutrons. (See 
Bacon [1975] and Schwartz and Cohen [1987] for more details on the different 
properties of X-rays and neutrons.) 

For high-energy X-rays (about 100 keV or more), the absorption by matter is very 


Table 2 

Linear absorption coefficients of some metals, for neutrons (true absorption, S/ and X-rays (/,, total 
removal) and corresponding mass absorption coefficients (values from Bacon [1975], and Kostorz and 

Lovesey [1979]). 


Metal 

Neutrons 

(cm' 1 ) 

1.8 A 

X-rays 

(cm ') 

1.54 A 

Neutrons 

S./p 

(m 2 /kg) 

1.8 A 

X-rays 

S/p 

(m 2 /kg) 

1.54 A 

Be 

0.001 

2.7 

0.00006 

0.150 

A1 

0.0014 

131 


4.86 

Cu 

0.326 

474 


5.29 

Cd 

119 

2000 

1.29 

23.1 

w 

1.22 

3320 

0.0061 

17.2 

Au 

5.72 


0.029 

20.8 

Pb 


2630 

0.00005 

23.2 
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weak, and the incident beam is mainly attenuated by Compton scattering. Diffraction 
experiments with the 412 keV -y-radiation from radioactive gold were initiated in 
Grenoble in the sixties (see Schneider [1981] for a detailed account) as a means to 
study the mosaic structure of large single crystals, of sizes comparable to those used for 
monochromators of thermal neutrons or in neutron scattering experiments. As the 
-y-wavelength is small (0.03 A in this case), the Bragg angles are also correspondingly 
small (~ 1°). Thus, lattice tilts may be monitored directly for large crystals, also, owing 
to the easy penetration of windows, etc., during crystal growth (see Breuf.r and 
Bleichert [1993] and references therein for further applications). Diffractometers in the 
100-150 keV range may now also be found at suitable synchrotron radiation facilities 
(see, e.g., Bouchard etal. [1993]). 

2.3. Magnetic scattering 

Although there is a measurable interaction between photons and electronic magnetic 
moments, it is too small (cross-sections about six to eight orders of magnitude smaller 
than for Bragg scattering, see De Bergevin and Brunel [1972]) to be used conveniently 
in the study of magnetic structures. 

The magnetic moment of the neutron has a value of yp n (see table 1) with y = 
-1.913. Examination of the dipole interaction between neutron and local magnetic 
induction (see Gurevich and Tarasov [1968], Lovesey [1984]) shows that of the 
Fourier transform of the local magnetization density, M(k), only the component M s (k), 
perpendicular to the scattering vector k, contributes to the scattering. Therefore, a 
magnetic interaction vector may be defined by 

«m =M 1 (k)/|M(k)|. (15) 

The magnetic scattering length , commonly denoted by p, is a vector. 

p = pq M . (16) 

with p proportional to IAf(ic)l. If we consider magnetic scattering due to unpaired spins 
(without orbital moments), p is given by 

P(x) = M^/mag (•<)■ (17) 

m 0 c 

Here, e 2 /m ( f 2 is the classical electron radius / 0 known from X-ray scattering [ f 0 = 
2.8 x 10 -15 m, see eq. (A. 37), ch. 11], 5 is the total number of unpaired spins, and / mag (/c) 
is the Fourier transform of the spin density, normalized to / mag (0) = 1 . As the number p 
of Bohr magnetons (p B = eh/m 0 c) equals p =25, we can write 

p( K )[l0 _l ‘ , m] = 0.27/i/ roag (K). (18) 

For not too large values of k, p is comparable to the values of the nuclear scattering 
amplitude b (for the pure elements Fe, Co, Ni one has 6 = 0.96, 0.28, 1.03 and p(0) = 0.6, 
0.47 and 0.16, respectively, all in units of 10~ 14 m). The magnetic form factor / mag (ic) 
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falls off more rapidly with sin 9/ A than the atomic form factor for X-rays (ch. 11), as 
only a few electrons in an outer shell contribute to / . 

The total scattering of a magnetic material will show a combination of nuclear and 
magnetic scattering and will also depend on the polarization of the incident beam. Figure 
4 shows the scattering geometry, assuming a fully polarized beam with the polarization 
vector parallel to the z-axis (up: +, down: -). If the scattered beam is analyzed along the 
same direction, four different scattering cross-sections can be defined (++, + -,-+ and 
— ). In an otherwise perfect crystal, the nuclear unit-cell structure factor, F s (ic) [see eq. 
(A. 32), ch. 11]), can be combined with the magnetic structure factor, 

f m(*) = X«M™^ ex P( _2wiK * r <»)' O 9 ) 

m 

where q Mm [eq. (15)] is defined according to the direction of the moment at site m, and p m 
at each site is given by eq. (18). This yields IF S +F^I 2 for (+ +), IF^P + IF^I 2 
for (+ -) and (-+), and l^-F^I 2 for ( — ), where the F ^ are the Cartesian compo- 
nents of F 

An unpolarized beam can be assumed to be composed of 50% positive and 50% 
negative polarization, and the scattering cross-section (per atom) without polarization 
analysis is dcr/dfi= IF^I 2 + IF M I 2 . For a simple ferromagnet, q ^ is the same for all sites, 
and with =q M F aag [see eq. (19)]: 



Fig. 4. Scattering geometry for magnetic scattering. The incident neutrons are polarized along the z-axis. 
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where a is the angle between k and M (see fig. 4). By varying a (via the external 
magnetic field or the scattering geometry), nuclear and magnetic contributions can be 
separated. Equation (20) also applies to antiferromagnets if we define a relative to the 
direction of the moment at site one and write / r mag = 2 m a )n p m exp(-27ri K m r m ) where cr m 
is either +1 or -1 depending on whether or not the moment at site m is parallel or 
antiparallel to q v Figure 5 shows the consequences of ferro- and antiferromagnetic order 
in the bcc lattice (a * 0). Additional peaks (“magnetic peaks”) occur for antiferromagnetic 
order whereas in ferromagnets the nuclear peaks are enhanced. 

If a polarized beam is used, the cross-section for scattering without polarization 
analysis is the sum of, e.g., (+ +) and (+ -) scattering and contains an interference term 
between nuclear and magnetic scattering with F s and F ^ appearing unsquared; 



where P 0 is a unit vector indicating the direction of polarization. “Flipping” P 0 from +1 
to -1, we have a very sensitive method to measure and consequently F mag (ic), p„,(k) 
or / mag (if). Equation (21), if generalized as indicated by eq. (8) for structural disorder, is 
also the basis for a separation of structural and magnetic disorder in alloys. 

Finally we mention the original result of Halpern and Johnson [1939] for paramag- 




Fig. 5. Schematic neutron diffraction patterns from a bcc poylcrystallinc (a) fcrromagnet and (b) antiferro- 
magnet. The shaded areas represent the magnetic scattering contributions which decrease with 0 because / 
decreases. (After Schwartz and Cohen [1987].) 
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netic scattering, i.e., for a randomly oriented ensemble of identical magnetic moments 
(spins 5). The paramagnetic scattering cross-section (per atom) reads 

(dS) = I S ( S + 0/oV/i g . (22) 

This scattering is similar to the Laue scattering term in diffuse scattering. Deviations 
from the simple monotonic k -dependence are of interest in both cases as they relate to 
correlations in the atomic or the spin arrangement. 

There are many more special features of magnetic scattering than this brief discussion 
can indicate (see Brown [1979], Hicks [1979] for more details) and many complex 
magnetic structures have been studied in recent years. 

2.4. Inelastic and quasi-elastic scattering 

As mentioned in § 2.2., the relative energy gain or loss of X-rays scattered from a 
sample with lattice vibrations (phonons) is very small. Near an absorption edge resonant 
Raman scattering (see Sparks [1974]) occurs just below the edge, and incoherent 
fluorescence radiation is emitted above the edge. Compton scattering is another inelastic 
scattering process of X-rays. From the conservation of energy and momentum for the 
scattering of a photon from an individual electron, the wavelength shift of Compton- 
modified radiation is (for electrons assumed at rest): 

zU [A] = 0.0243(1 - cos 20) , (23) 

independent of the incident wavelength. The scattering is incoherent as there is no fixed 
phase-relationship between the different inelastic scattering events. The relative contrib- 
ution of Compton scattering to the total scattering is given by 1 where f aj is 

the form factor (normalized to one) of each of the Z electrons of an atom. Compton 
scattering increases the background in X-ray diffraction experiments, but it can be 
eliminated experimentally (e.g., with a monochromator in the diffracted beam) or by 
calculation (see Schwartz and Cohen [1987]). Detailed study of the energy distribution 
of Compton intensities provides information on the momentum distribution of electrons 
in condensed matter (see, e.g.. Cooper [1977]), one of the few techniques also applicable 
to alloys. 

Much smaller (absolute) energy transfers can be detected by neutron scattering, either 
by analyzing the change of wavelength with a single crystal or the change of neutron 
momentum by time-of-flight methods. Here, we consider one-phonon scattering only (for 
a complete account see, e.g., Lovesey [1984] and Bacon [1975]). 

Figure 6 shows two possible scattering configurations in reciprocal space. The 
scattering vector k can be written as 

K = k i -k 0 =g + q (24) 

where q is the phonon wavevector, counted from the nearest reciprocal lattice point. If 
the wavevector after scattering is i sl , the neutron has lost energy (/c sl <k 0 ), and a phonon 
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Fig. 6. Two possible inelastic scattering events in a plane of reciprocal space involving the creation 
(k = | -k u ) and the annihilation (k = Jt s2 -k 0 ) of a phonon. 


has been created. For k s2 , a phonon has been annihilated. As the neutron momentum is 
h/X, eq. (24) states the conservation of momentum in the scattering process (X q = \/q is 
the phonon wavelength). Simultaneously, the conservation of energy, 

h 2 /2mX-h 2 /2m n X] =hv p , (25) 

defines the energy of the phonon participating in the scattering (A 0 and A s denote the 
wavelength of the incident and scattered neutrons, respectively). There are only discrete 
values of phonon frequency for a given q , and appropriate scans can be designed to 
obtain directly the phonon dispersion curves of a crystal. 

Apart from coherent inelastic scattering processes, analysis of the incoherent inelastic 
scattering of neutrons may often be interesting. True incoherent scattering processes are due 
to the interaction of the neutron spin with nuclear spins I* 0. The scattering length of the 
compound nucleus depends on its total spin which is /+ 1/2 or I- 1/2. The tabulated 
values of coherent scattering lengths and incoherent cross-sections for individual isotopes 
represent properly weighted averages (for unpolarized nuclei). Natural elements are 
frequently a mixture of different isotopes, each with its own nuclear spin. Coherent scattering 
lengths and spin-incoherent cross-sections are simply arithmetic averages, but owing to the 
random distribution of nuclei with different coherent scattering lengths over the sites of the 
sample, another ^-independent term, a i = 4-rrlb 2 - (5) 2 j , occurs in complete analogy to 
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the monotonic Laue scattering term in diffuse X-ray scattering (see §4). For neutrons, 
this part is included in the total value of incoherent scattering of an element. Incoherent 
neutron scattering cross-sections are quite large in several cases, and for coherent 
scattering experiments (see § 2.5), they may impose severe limitations. However, similar 
to the case of Compton scattering and electrons, energy analysis of the incoherently 
scattered neutrons will reveal details of the motion of nuclei. The incoherent scattering 
function can be calculated for different processes (see, e.g., Bacon [1975], Lovesey 
[1984]). If the energy transfer is centred around hi>=0, i.e., is small, the scattering is 
called quasi-elastic. In condensed matter, the motion of atoms is restricted by the 
environment, and quasi-elastic neutron scattering has become a widely used technique to 
study atomic and molecular motion, especially if hydrogen with its high incoherent cross- 
section is involved. Especially Ln crystalline solids, a study of the quasi-elastic line width 
(coherent or incoherent) may help to identify diffusion mechanisms (see § 6.2). 

2.5. Some experimental considerations 

Incoherent scattering as a source of background and absorption was already men- 
tioned in §2.4. Extensive discussions on general experimental problems in X-ray and 
neutron scattering can be found in the book by Schwartz and Cohen [1987]), and, 
more specifically oriented towards defects in crystals, in the articles by Schmatz [1973, 
1978], Some remarks will be found in subsequent chapters, relating to particular 
applications. Here, we only state a few quite general points. 

Laboratory X-ray sources with rotating anodes have been built, the maximum power 
of 100 kW (see Haubold [1975]) yielding 4x 10 16 quanta/s cm 2 sr (CuK a ). Higher 
luminosities are obtained (originally as a by-product, now from dedicated sources) from 
electron or positron synchrotrons and storage rings. Depending on the energy of the 
electrons or positrons, synchrotron radiation emerges in a wavelength range from infrared 
to -0.2 A. Synchrotron radiation is continuous and very intense ( 10 12 — 10 14 quanta/s 
mrad per 0.1% bandwidth, integrated over the vertical direction). The radiation is highly 
collimated perpendicular to the orbital plane. In the GeV region, the divergence is -lO^ 1 
rad. In the plane of the ideal orbit, synchrotron radiation is 100% polarized with the 
electrical vector parallel to this plane. There is a well-defined time structure of the beam 
with pulse durations as short as 10“ 10 s with repetition rates of 1 MHz or more. The use 
of synchrotron radiation for materials studies is currently experiencing a very rapid 
growth (see e.g. Baruchel et al. [1993]). 

Compared even with a classical sealed X-ray tube, neutron sources are not very 
powerful. Figure 7 shows a comparison of several X-ray and neutron sources according 
to Schmatz [1973]. As suggested by Maier-Leibnitz [1966], the momentum space 
density p(k 0 ) ( hk 0 , where h is Planck’s constant, is the linear momentum of a particle) is 
an adequate quantity for comparison, as the count rate Z at a detector is proportional to 
p(k 0 ) multiplied by the momentum space elements d 3 & 0 and d 3 A: s that can be optimized 
instrument parameters (within the bounds of k x , k y k z fixed by the properties of the 
source), Z p(k 0 )S(K , p)d i k 0 d i k i> where S(k, v) is the scattering law to be studied. We 
see that p(k 0 ) for neutrons is several orders of magnitude lower than the values of all 
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Fig. 7. Momentum space density p(2irk 0 ) for different X-ray and neutron sources (after Schmatz [1973]). 


X-ray sources. On the other hand, restrictions on the choice of parameters of the incident 
beam (Ak(/k Q or vertical collimation), which are common for X-rays, are less severe for 
neutrons where d 3 & 0 can be chosen more flexibly to match the resolution requirements 
for a given S(k, v). This may compensate in part for the lower values of p(k 0 ). 

Traditionally, thermal neutrons are produced by fission and subsequent moderations 
i.e., in a nuclear reactor. A steady-state thermal flux of about 10 15 n/cm 2 is virtually 
impossible to surpass as heat removal is a limiting factor. With hot and cold sources, the 
Maxwellian spectrum of neutrons may be shifted to smaller or larger wavelengths (see 
fig. 7). Pulsed neutron sources may provide a higher peak flux. If the pulsed structure of 
the neutron beam is maintained after moderation, time-of-flight experiments are advantageous 
at pulsed sources because the relevant flux at the sample will be higher than the average flux. 
One concept for the production of pulsed neutrons is based on the use of charged particles 
from accelerators (spallation). A variety of spallation sources are now operational. 
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Apart from film techniques (with a converter foil for neutrons), the detection of the 
scattered radiation is based on electronic counting circuits attached to gas or solid state 
detectors. For X-rays, gas-filled proportional “counters” and solid-state scintillation 
“counters” detect the incident quanta via the ionization of a gas or the production of 
photoelectrons by scintillation photons. The energy resolution is poor for both detector 
types, typically ~ 20% in the 10 keV range for gas detectors and -50% for scintillation 
detectors. Solid-state detectors based on electron-hole pair production in doped (Ge or 
Si with Li) or intrinsic (Ge) semiconductors have a theoretical resolution of about 1% at 
10 keV, about 200 eV in practice. 

Another important aspect for many experiments is the spatial resolution of large 
detectors. Linear position-sensitive detectors with a resolution of about 30 /xm employ a 
resistive wire, and peak heights or pulse shapes are analyzed as a function of position of 
the detected event. Two-dimensional gas-filled detectors for X-rays employ (sets of) 
mutually perpendicular wires and different electronic techniques to locate the detected 
events. The simultaneous measurement of scattering over a large solid angle is of 
particular importance for weak scattering signals but also for in-situ kinetic studies. 

Neutrons can only be detected after they have participated in a nuclear reaction with 
the emission of charged particles or y-rays. As beam dimensions are usually much larger 
in neutron scattering than in X-ray scattering, the use of large arrays of individual 
detectors is feasible without loss of resolution (and with time-of-flight resolution, i.e., 
energy resolution for each desired detector). Position-sensitive detectors have been 
developed for diffraction work, planar detectors for small-angle scattering and curved 
ones for powder patterns. 

3. Bragg peaks and vicinity 

Scattering from real crystals may occur anywhere in reciprocal space but Bragg 
peaks, though modified, will remain a predominant feature as long as an average lattice 
can be defined. A diffraction pattern, after all instrumental corrections, reflects the 
distribution of scattering matter within the (average) unit cell of a substance, and the 
atomic coordinates (location of the centres of scattering objects) as well as the scattering- 
length density distribution (electrons with X-rays, nuclei and magnetic moments with 
neutrons) can be determined from an analysis of Bragg peaks (see, e.g., Lipson and 
Cochran [19531, Warren [1969], Schwartz and Cohen [1987]). Particular Bragg 
peaks occur for different phases in a sample, and diffraction methods are thus essential 
in the study of phase diagrams and phase transitions. The orientation distribution of 
Bragg peaks for a polycrystalline sample reveals its texture (see, e.g., Barrett and 
Massalski [1980]). 

3.1. Peak shifts 

Changes in peak position may be caused by residual stresses, by faulting on certain 
crystallographic planes (Warren [1969], Wagner [1966], Cowley [1981]), or by lattice 
parameter changes as a function of alloying or of defect concentration. Whereas a lattice 
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parameter change affects all crystallographically equivalent Bragg peaks in the same way, 
the other two causes may lead to different shifts depending on the orientation of the 
reflecting planes relative to a preferred axis of the disturbance. Figure 8 illustrates this 
for a polycrystal with a surface under compression. As residual stresses and applied stresses 
combined determine load carrying capabilities, X-ray methods in this field have received 
considerable attention (e.g., Barrett and Massalski [1980], Culuty [1977]). The book 
by Noyan and Cohen [1987] gives a detailed account of procedures and applications. 

As the penetration depth of X-rays is limited and strongly depends on wavelength, 
only near-surface regions are accessible. This requires some caution in the interpretation, 
but may also be used to obtain some information on stress gradients perpendicular to the 
surface (see, e.g., Oettel [1989]). The preferred Bragg diffraction from near-surface 
regions leads to important applications in the study of epitaxial layers and hetero- 
structures (see, e.g., Chu etal. [1985], Zogg etal. [1994]), as accommodation stresses, 
their inhomogeneity and their relaxation may be determined. 

With the availability of powerful X-ray beams at synchrotron radiation sources, the 
surface sensitivity for Bragg scattering may be considerably improved (several nm 
instead of ju,m) by grazing incidence. For an incident beam which impinges on the 
sample at a glancing angle at or below the limiting angle of total reflection, diffraction 
peaks may arise from the evanescent wave travelling in the crystal parallel to the surface 
and decaying exponentially perpendicular to the surface (see Marra etal. [1979]). 



Fig. 8. Residual stress measurements with an X-ray diffractometer for a surface of a polycrystal under 
compression (schematic, not to scale), (a) Bragg’s law is satisfied for lattice planes parallel to the surface. 
These planes are further apart than in the stress-free state (Poisson’s effect). Their spacing is obtained from the 
position of the Bragg peak, (b) The specimen has been tilted, and other grains now present suitable lattice 
planes for Bragg scattering. As these planes are more nearly perpendicular to the compressive stress, they arc 
less separated than in (a), and the Bragg peak moves to higher angles, (c) The direction of the measured stress 
is given by the intersection of the circle of tilt and the surface. (After M.R. James and Cohen [1980].) 
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Evanescent scattering of X-rays and neutrons offers many new experimental possibilities 
in surface science (see Feidenhans'l [1989], Zabel and Robinson [1992], and Dosch 
[1992, 1993]). As examples for evanescent X-ray Bragg scattering we quote the studies 
on depth-resolved near-surface ordering of Cu 3 Au (Zhu et al. [1988], Dosch et al. 
[1991a] and on surface-melting of an Al single crystal (Dosch et al. [1991b]). 

Gradients of lattice constants may also occur in diffusion couples, e.g., when a plated 
sample is annealed. The (nondestructive) analysis of such diffusion profiles using X-ray 
diffraction starts from similar ideas (see Houska [1980]). 

Neutron diffraction averages over larger volumes, and several nondestructive 
measurements of internal stresses in “real” components have been reported (see e.g., 
Allen et al. [1985], Priesmeyer et al. [1994]). The method has a special value for 
“hidden” stress concentrations. 

3.2. Peak broadening and intensity changes 

In §2.1 we have already mentioned a broadening due to sample size that could be 
relevant for powders [eq. (1)]. However, for bulk samples we have tacitly ignored size 
broadening and written the Bragg cross-section as a delta-function. Thus, for a well 
collimated X-ray beam of characteristic radiation, the width of a Bragg peak should be 
controlled by the natural linewidth, since Bragg’s law yields ( 0 being a Bragg angle): 

A(2d) = 2 — tan 6. (26) 

A 

With A\ /A = 5 x 10 -4 , A(26) is about one minute of arc at 26 ~ 30° but increases dramati- 
cally for 6 — » 90°, and eq. (26) can be confirmed for large perfect crystals. However, 
quite frequently, the Bragg peaks are much broader (many minutes at moderate Bragg 
angles), and only crystals with such mosaicity will actually approach the predictions of 
the kinematical theory as may be judged from the integrated reflecting power (which 
should be proportional to IF S I 2 ). According to Darwin ([1922], see Zachariasen [1945] 
and R.W. James [1963]), the kinematical theory is valid for an ideal mosaic crystal 
consisting of small perfect domains which are tilted against each other by small angles 
(a few minutes). According to this concept, the effect of coherent multiple reflection of 
the same beam can be reduced to just the planes within one mosaic block and finally 
neglected (i.e., primary extinction plays no role). The dynamical theory of diffraction shows 
(see ch. 11) that the tolerable size of the blocks depends on wavelength. As the extinction 
length (for Bragg reflections) is proportional to (AIFJ^ 1 , a sample of given thickness or 
mosaicity will approach the case of the ideal mosaic crystal with decreasing wavelength. 

Mosaicity in a real crystal is a merely formal concept for peak broadening which may 
be caused by various defects, e.g., dislocations. Referring to crystals containing dis- 
locations in connection with Bragg peaks appears contradictory since eq. (12) and the 
subsequent discussion imply that dislocations cause Bragg peaks to disappear as their 
strain fields decrease proportionally to (/)“*. However, this would hold only for a 
random arrangement of dislocations in a large crystal. As Wilkens [1969] has shown, 
a random distribution is not a very suitable model for the calculation of X-ray line 
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broadening or of the elastic energy. A so-called restrictedly random distribution of 
dislocations was used to calculate the line broadening and the peak profile more 
realistically (Wilkens [1970, 1975]) from the kinematical theory. The linewidth Ak is 
proportional to g m p l d 2 In M e , where p d is the dislocation density and M e is a parameter 
influencing the lineshape. It is given by M e = p d 2 R e where R e is an effective outer cut- 
off radius, indicating the range over which a random dislocation arrangement can be 
admitted. Beyond this distance, long-range stresses should compensate each other (see 
Ungar et al. [1984] and Mughrabi et al. [1986] for an application to deformed Cu 
single crystals). For peak profiles with neutrons, see Swallowe and Osborn [1992], 
who studied in situ the effects of elastic and plastic deformation in Cu. 

Rocking-curve broading, on the other hand, is only very indirectly related to 
dislocation densities. It rather reflects the spread of lattice rotations around the rocking 
axis. An analysis of rocking curves taken at different angles with respect to the tensile 
axis of a deformed crystal may provide useful details on the slip geometry (for [001]-Cu, 
see Wilkens etal. [1987]). 

An analysis of the “static Debye-Waller” factor may be useful to determine displace- 
ments around defects (see Krivoglaz [1969], Dederichs [1973], Trinkaus [1975], 
Dietrich and Fenzl [1989a]). In a dilute alloy, (c«l), eq. (12) simplifies to 

M = c £[l - cos(2m<*w n )] , (27) 

n 

or even 2 M = (2 TTK*u n f if « 1. The u n are displacements caused by a defect 

taken at the " origin, and the summation extends over all cells. 

In martensite, where interstitially dissolved carbon causes a tetragonal distortion, A/ 002 
is expected to be different from M 200 or A/ 020 , and for polycrystals ln(2/ 002 // 200 020 ) = 
2^002 020- Chen et al. [1980] have evaluated such (X-ray) intensity ratios for 

martensite formed in quenched fine-grained Fe-18 wt% Ni-(0.9/1.15)wt% C. A similar 
study of cubic NbH x was performed by Metzger et al. [1982] who obtained 
k, = (0.100±0.007) A for H occupying tetrahedral sites, a value very close to theoretical 
calculations using lattice theory. In binary ordered crystals (e.g., Ni 3 Al, see Morinaga 
et al. [1988]) containing dilute solutes, the static atomic displacements in the two 
sublattices may be determined individually from the analysis of a large number of 
fundamental and superstructure peaks. For alloying elements substituting mainly for Al 
in Ni 3 Al, it was found that there is a reverse correlation between Al and Ni displacements 
and that the average lattice constant is primarily controlled by the Al displacements. 

In these applications, a separation of static and dynamic displacements relies on the 
thermal displacements known from the defect-free reference state. Neutron and X-ray 
results for pure Cu 3 Au just above or below T c , the critical temperature for appearance of 
long-range order, showed a very strong increase of the Debye-Waller exponent when T c 
was approached (Bardhan etal. [1977]). From the difference in the intensity ratios of 
fundamental and superstructure/ diffuse peaks, individual mean-square displacements for 
Au and Cu (based on an expansion of eq. (27) for M) were calculated, and large 
displacements of Au near T c were found. From a wealth of other information, including 
phonon dispersion curves (Chen etal. [1977]) near T c , the authors concluded that these 
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displacements are static and related to a high density of antiphase boundaries within large 
long-range ordered regions. Elastic neutron scattering (Lander and Brown [1985]), 
kinetic measurements of domain growth (Ludwig et al. [1988]) and new inelastic 
neutron scattering studies of phonons (Gaulin et al. [1990]) around T c have provided 
further details on this first-order phase transformation. 


3.3. Diffuse scattering near Bragg peaks 

We now turn to the scattering cross-section given formally by eq. (11), i.e., con- 
tinuous scattering for k &g. There is no general analytical solution, and approximations 
have to be introduced to obtain manageable expressions. 

As pointed out in § 2.4, thermal vibrations (phonons) are always present in a crystal, 
and their effect on the Bragg intensity is usually contained in the Debye-Waller factor. 
The diffuse intensity is called thermal-diffuse scattering (TDS) and has been treated 
extensively (e.g., R. W. James [1963], Warren [1969], Bacon [1975], Willis and 
Pryor [1975]). TDS occurs for all values of k but is particularly pronounced near 
reciprocal lattice points. This is also true for static displacements but there are some 
important differences regarding the symmetry of diffuse scattering in reciprocal space. 
The theory of diffuse scattering from point defects and clusters has been considered by 
Krivoglaz [1969], Dederichs [1969, 1973], Trinkaus [1972], and Dietrich and 
Fenzl [1989a, b]. For small defect concentrations c one obtains from eqs. (6) and (11) 
the diffuse scattering cross-section: 



with F(k) = 4/b + / a £ exp(-2rr iK*r„ a ) [exp(-2rr i k*h 0 „ ) - l] , 

n 


(29) 


where u 0n is the displacement at site n due to one defect at the origin, r n a denotes the 
position vector in the average lattice, and 4/ B ,/ A are the scattering lengths of defect and host 
atoms (one atom per unit cell assumed for simplicity), including thermal and static Debye- 
Waller factors. The value of Af B depends on the type of defect (e.g., Dederichs [1973]): 


4/b — 


fi exp(— 27r iff*/*]) 

/b ~ fk 
-/a 

/a [2 cos(27tk # /i ) — l] 


for interstitial impurities, 
for substitutional impurities, 
for vacancies 
for split (self-)interstitials, 


(30) 


where r x is the position of the interstitial (or of one of the split interstitials). For k close 
to a Bragg peak (k =g + s, and s«g), but also, with a different reasoning, for any k if 
I27 tk*b 0ii I « 1 , eq. (29) yields: 

F(k) = 4/b - 27ri/ A K*£u 0 „ exp(-27riK*r„ a ), 


(31) 
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and finally, for the Huang scattering region (s « g, Af s neglected): 


(32) 


with u{k) = £ u 0n exp(-27 r iK*r„ a ) = -M u 0 {r) exp(-27r iK*r)dV . (33) 

n V 

The displacements u are generally determined by lattice statics (see, e.g., Dederichs et 
al. [1978] and §4.1) relating displacements to forces exerted on atoms around the defect. 
For large distances from the defect, the asymptotic behaviour of the displacement is 
given by continuum theory, 

»o(' , ) k T~' ( 34 ) 

r r 

(For isotropic materials, the proportionality factor is A V/Att, where A V = volume change 
per defect in an infinite medium.) Thus, the long-range displacement field is always 
centrosymmetric, and the Fourier transform can be written as [«(#c) is periodic in 
reciprocal space, and only s = K-g is relevant]: 

u(k) = u(s) = -j- S(s)*[P«s] . (35) 


Here, S is a tensor containing combinations of elastic compliances depending on the 
orientation of S where £ is a unit vector in the direction of s. The tensor P is the dipole 
force tensor of the defect (see, e.g., Dederichs [1973], Seeger [1961]) with components 

Pjk = Pkj = X Pjn X kn • (36) 


where p Jn are the components of the forces p n and x kn the components of the position 
vectors r n . We note that the volume change caused by defects is proportional to the trace 
of P, and the lattice parameter change Aa/a can be obtained from 


AV _ 3 Aa _ c TrP 
V ~ * a ~ F c C n +2C n 


(37) 


for a cubic crystal with the elastic constants C n , C 12 . Equation (37) is valid for isotropic 
and anisotropic defects, provided the latter show no preferred orientational arrangement 
relative to one of the cubic axes (see Peisl etal. [1974]). 

Substituting eq. (35) into eq. (32), the Huang intensity will be: 


dcr'l 


= 7)t/a 7 2u#-{S(f>[P«#J 


(38) 


We see that Huang scattering has the same general k dependence as first-order TDS for 
k —>g, especially inversion symmetry around g, but the intensity distribution around a 
reciprocal lattice point is different, as the intensity depends on g m u(s). For example, an 
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isotropic defect with three equal, mutually perpendicular double forces P 0 in an isotropic 
cubic crystal has 


u{s) 


i PpS 

* 2 V c C u ’ 


(39) 


and it follows from eq. (32) that there is a zero-intensity plane for s perpendicular to 
K ~ g . Figure 9a (from Peisl [1976]) illustrates this case. Double-force tensor, defect 
configuration and lines of equal intensity in reciprocal space are indicated. In fig. 9b, c, 
d some other cases are shown where the defect has a preferred axis (“anisotropic” defect, 
in an isotropic matrix). Because the defect axis can be oriented along several crystal- 
lographically equivalent directions, the scattering cross-section [eq. (32) or (39)] has to 
be averaged over all these orientations. As the orientation of a zero-intensity plane (nodal 
surface) is determined by the axis of the defect, nodal planes will then be reduced to a 
nodal line or disappear entirely, as shown in fig. 10. The isointensity surfaces in 
anisotropic crystals look deformed and rotated but additional nodal planes can also 
appear (see Dederichs [1973]). 

As the Huang-scattering cross-section is proportional to the square of the dipole tensor 
[eq. (38)], interstitials will scatter considerably more than vacancies (e.g., for Cu, the ratio 
exceeds 20). A Huang-scattering study of a sample containing Frenkel pairs will therefore 
yield information predominantly related to interstitials. In a classical series of experiments 
on electron-irradiated A1 and Cu single crystals, Ehrhart et al. [1974] (see also 
Ehrhart and Schilling [1973], Ehrhart and Schlagheck [1974]) have analyzed the 
Huang scattering from as-irradiated and step-annealed samples in terms of single defects 
and clusters. Clustering of defects leads to an increase of Huang scattering [proportional 
to the number of clustering defects, z, if the dipole strengths superpose linearly, see eq. 
(38)]. Interstitial clusters (z~ 3) in Al irradiated with fast neutrons at low temperatures 
(4.6 K) have been reported by v. Guerard etal. [1980] who compared Huang scattering 



Fig. 9. Schematic contours of equal scattering intensity (right column) for various defect symmetries in a cubic 
lattice (central column), for anisotropic defects averaged over all crystallographically equivalent defect 
orientations. The column on the left indicates the form of the dipole force tensor. (After Peisl [1976].) 
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Fig. 10 . Types of surfaces of equal scattering intensity in reciprocal space: (a) double-drop type with one zero- 
intensity surface; (b) apple-shaped type with one zero-intensity line; (c) single-bubble type. (After Trinkaus 
[ 1972 ].) 


from these samples with single-defect scattering after electron irradiation. The analysis 
yields further details on the size (radius ~ 50 A) of displacement cascades, the number of 
defects per cascade (~ 200), overlap at larger irradiation doses and annealing behaviour. 

Strong distortions as they may be expected for clustered point defects reduce the 
range of validity of the Huang approximation. This can be seen in neutron-irradiated A1 
and is also found when clusters form upon recovery of electron-irradiated A1 and Cu 
(Ehrhart and Schilling [1973], Ehrhart and Schlagheck [1974]). Figure 1 1 shows 
results for Al. The s' 4 dependence at the larger s is expected from the theory for strongly 
distorting defects (Stokes-Wilson approximation, see, e.g., Dederichs [1973]). This type 
of scattering occurs if closely spaced scattering centres introduce large phase shifts [i.e., 
2 M becomes very large, see eqs. (10) and (12)]. The so-called asymptotic distortion 
scattering is given by (g ~ k): 


dc^j 


_ cN f 2 
~ y2 JA 
v c 


PUI 


4>(s). 


(40) 


and depends linearly on the modulus of the double-force tensor P and on g, in contrast 
to the square dependence for Huang scattering [eq. (38)]. 

The comparable magnitudes of TDS and Huang scattering by point defects suggest 
that for X-rays one should work at low temperatures. This is not always possible, as the 
defects may not be present in the state of interest. Metzger etal. [1976] and Metzger 
and Peisl [1978] have shown that Huang scattering from H in Nb and Ta can be studied 
at room temperature (where the solubility in the bcc a-phase is sufficiently high), and 
that after appropriate corrections, especially for the static Debye-Waller factor, the 
results can be interpreted in terms of single-defect scattering up to surprisingly high 
concentrations of H in Ta (11%, Metzger and Peisl [1978]). The long-range stress field 
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Fig. 11. Symmetrical part (average for +s and -s) of the scattering intensity of electron-irradiated A1 after 
annealing at 130 K, measured near the 400 reflection. Open circles: molar fraction of Frenkel pairs 5 x 10" 4 , 
k parallel to (100); squares: molar fraction 3 x KT 4 , k parallel to (100); solid circles: 5 x 10" 4 , k parallel to 
(110). (After Ehrhart and Schilling [1973].) 


shows cubic symmetry in both systems (with a possible anisotropy of less than 7%), i.e., 
the distortion caused by H atoms occupying tetrahedral sites (as known from channeling 
experiments, see Carstanjen and Sizmann [1972]) is isotropic for large distances. 
Although a Kanzaki-force model involving two forces can be adjusted to account for the 
cubic symmetry of the long-range distortions (see Bauer [1979]), more recent results of 
diffuse elastic neutron scattering between Bragg peaks have led to a modified model 
(Dosch etal. [1992], see §4). 

For otherwise perfect crystals, one remaining “defect” is the surface. Even if this is 
very smooth, it will cause diffuse intensity at each Bragg peak along a line perpendicular 
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to the surface (Andrews and Cowley [1985], Robinson [1986]). This so-called 
truncation-rod scattering is now more easily accessible with the use of synchrotron 
radiation. Surface roughness, misorientations and other surface properties will be 
reflected in changes of this scattering. For example, atomic displacements at the surface 
of (lll)-Si wafers have been revealed using this technique (Kashihara et al. [1987]). 
Truncation-rod scattering for neutrons has also been seen from the same type of surface 
(Al Usta et al. [1990]). 

4. Between Bragg peaks 

If we now extend the range of k and look for intensity at some distance from the 
Bragg peaks of a given structure, we will find diffuse intensity from TDS and diffuse 
scattering from static displacements if defects are present. This scattering (occasionally 
called structural diffuse scattering or also “Zwischenreflex” scattering) is particularly sen- 
sitive to the symmetry of displacements in the immediate vicinity of defects and can help 
to confirm (or eliminate) certain defect models. In addition to this “positional disorder”, 
alloys will also exhibit “compositional disorder”. Whereas the Bragg peaks are controlled 
simply by the average structure factor of the unit cell, diffuse scattering will reflect 
deviations from a random arrangement (short-range order or clustering). Basic interact- 
ions in alloys can thus be studied. Finally, new Bragg peaks may appear when (chemical 
or magnetic) order or (reconstructive or displacive) phase transformations occur. 

The experimental requirements for diffuse scattering between Bragg peaks are 
somewhat different from those in § 3. There, very good resolution near Bragg peaks was 
desirable. Now, as small intensities are to be monitored over a wide k range, resolution 
demands may often be relaxed in favour of higher intensity, and simultaneous observa- 
tions at various values of #c are advantageous. Multidetector arrangements, position-sensi- 
tive detectors or films (these last for a broad survey of diffuse scattering, and still as an 
important tool in the study of phase transformations) are therefore used wherever possible. 

4.1. Displacement scattering 

The diffuse scattering for #c not too close to a Bragg peak depends on the displace- 
ments in the immediate vicinity of the defect. If we remain in the dilute limit c« 1, eq. 
(28) is still valid with F(k) according to eq. (29). But eq. (31) may not be valid for all 
displacements, and we write 

F(k) = Af B + / A X exp(-27r iK*r 0 a )[exp(-27T iK*« 0 „) - 1 + 2 tt iK*n 0 „] 

« (41) 

-27ri/ A K*«(K) 

Thus, if there are any u 0n , too large to permit an expansion of the exponential, they are 
taken into account by the second term of eq. (41). If l/c*« 0n l« 1, eq. (41) reduces to eq. 
(31). The u 0n may be obtained from lattice statics (see, e.g., Dederichs etal. [1978]). A 
set of (virtual) forces (called Matsubara-Kanzaki forces) is assumed to be applied to the 
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neighbouring atoms of a defect in order to produce the displacements u 0n within the 
harmonic approximation, i.e., with the dynamical matrix of the ideal, defect-free crystal. 
In reciprocal space, we have: 

/(«) = #«)•«(«), (42) 

where /( k ) is the Fourier transform of the virtual forces / 0n , and 4>(k), the dynamical 
matrix (inverse of the Fourier transform of the lattice Green’s function), contains 
combinations of squares of phonon frequencies. In symmetry directions, <£(#c) can be 
expressed in terms of the three phonon branch frequencies and v^. The 
transition to continuum theory (v 2 °c q 2 ) has already been mentioned in §3.3. Although 
the actual displacements may sometimes be more interesting, the concept of virtual forces 
has the advantage of giving a more rapidly converging method to calculate the scattering 
of (model) defects as the relevant forces may be restricted to the immediate surroundings 
of the defect (see Schmatz [1973] and Bauer [1979] for further discussions). Figure 12 
illustrates the sensitivity of diffuse scattering between Bragg peaks to the details of the 
displacement field near the defect. Experiments with X-rays have first been reported for 
electron-irradiated A1 (Haubold [1975]) and subsequently for Cu (Haubold and 
Martinsen [1978]). These experiments clearly confirmed the dumbbell configuration in 
(100) direction (see ch. 18, §3.3 for a further discussion of the results). 

Early displacement scattering studies with neutrons have been reviewed by Bauer 
[1979]. Although the incident beam intensities are much smaller than with X-rays, 
experiments are feasible, and the energy analysis usually achieved with a chopper allows 
one to extend diffuse scattering measurements to higher temperatures. Several dilute 
alloys with low incoherent scattering cross-sections have been studied (Al, Nb, Pb are 
favourable solvents). Figure 13 shows diffuse elastic scattering contours for an Al-0.8 
at% Cu single crystal (see Bauer [1979]). The scattering at 300 K shows signs of 
precipitation, the results at 800 K are indicative of a random distribution of defects. 
Figure 14 shows the scattering cross-sections along the three main symmetry directions, 
compared with calculated values from a model with radial virtual forces acting on the 
nearest and next-nearest neighbours of the substitutional Cu solute. As the strength of the 
dipole force tensor is given by the lattice parameter change and c is known, no adjustable 
parameters remain in a nearest-neighbour model, and the forces f 0 must be -4.22 x 
1(T 10 N, directed towards the Cu atom (lattice contraction). For this and any other model 
with centrosymmetric forces (inversion symmetry), the distortion scattering must be zero 
halfway between the Bragg peaks. For a centrosymmetric defect, one thus obtains just a 
diffuse scattering contribution from Af B [see eq. (41)] at these particular positions. Figure 
14 shows that for Cu in Al, inversion symmetry is fully compatible with the results. The 
nearest-neighbour forces f 0 alone yield quite a satisfactory fit. Detailed analysis of the 
slope near k =g/2 led to the best fit indicated in the figure. Different signs for the radial 
forces /j and f 2 are not in conflict with the oscillatory behaviour of the effective interac- 
tion potential in Al (see ch. 2). 

Pseudopotential theory has been applied to Mg in Al to calculate diffuse elastic 
scattering cross-sections, and neutron scattering results obtained at 720 K (Al-3.2 at% 
Mg) along [110] compare favourably (Werner et al. [1978]). Simple virtual-force 
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Fig. 12. Calculated cross-sections for diffuse scattering by self-interstitials in Al. Solid line; <100) dumbbell, 
dashed-dotted line: tetrahedral, dotted line: octahedral configuration. (After Haubold [1975].) 

models fail in this case, as they do in Al-Zn and, even more so, in Al-Mn (see Bauer 
[1979]). Another complicated case is Pb-Bi. Alloy crystals with 2 and 4% Bi (and pure 
Pb) were studied with neutrons (see Bauer [1979]). An unexpected peak at k =£ 2 2</2 
was found, which cannot be attributed to short-range order because a similar peak would 
then be required at a distance g ul from g 2 2 (/ 2 , i.e., at g 2 ot/^- If displacements are 
responsible, the inversion symmetry one intuitively expects for a substitutional solute has 
to be perturbed. The suggested model assumes equal outward forces /j on all nearest 
neighbours and three extra forces f e pulling three neighbours in one plane closer to the 
Bi atom (an extension of the 8 -IV rule). 

In the high-temperature bcc phase of Zr-Co and Zr-Nb, strong diffuse scattering was 
found in the (llO) and (001) planes of reciprocal space (Heiming et al. [1991b]). A 
centrosymmetric model with Kanzaki forces on nearest neighbours (inward) and next 
nearest neighbours (outward) can be fitted to the results. Phonon modes (see § 6) indicate 
a tendency toward the formation of the co-phase which appears at lower temperatures by 
a periodic collapse of [111] planes along (111) directions (see, e.g., Currat and Pynn 
[1979]). Heiming etal. [1991b] conclude that the observed diffuse scattering is due to a 
superposition of all displacements for which low-energy phonons exist. 

In Nb single crystals containing small quantities of nitrogen (0.15 and 1.4 at%, see 
Rowe and Magerl [1980], Dosch and Peisl [1985]), elastic neutron scattering showed 
diffuse peaks reminiscent of the co-phase. Assuming octahedral sites for N in Nb, Dosch 
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(a) (b) 

Fig. 13. Lines of equal elastic diffuse scattering cross-section (in mbams/sr per Cu atom) for Al-0.8 at% Cu 
in the (llO) plane of the reciprocal lattice for (a) T = 300 K and (b) T = 800 K. Data in the cross-hatched 
regions have been omitted. Scale: 1 A" 1 = 2m. (After Bauer [1979].) 


and Peisl [1985] showed that their measurements could not be rationalized with a 
Kanzaki force model including only two radial force constants (nearest and next-nearest 
neighbours). As the individual elements of the dipole force tensor are known from 
independent diffuse X-ray measurements in the Huang regime (Schubert etal. [1984]), 
the values of the forces are again fixed. Using a set of three forces (for the three 
neighbouring shells) compatible with the dipole force tensor, the continuous curves 
shown in fig. 15 were obtained. The displacements of the nearest Nb neighbours sur- 
rounding N remain large (-0.5 A) in the model involving three forces and are compat- 
ible with other experimental evidence, the next-nearest neighbours are displaced toward 
the interstitial ( — 0.2 A). These displacements lead to a local approach of neighbouring 
[111] planes towards a small w-like region. Similar results have been obtained for Nb-0 
and interpreted quite similarly (Dosch etal. [1986]). Although a Kanzaki force model 
involving two forces roughly reproduces the measured diffuse scattering of Nb-D 
(deuterium is used to reduce the incoherent scattering which is much higher for protons) 
at room temperature and the forces can be adjusted to account for the (almost) cubic 
symmetry of the long-range stress field (see Bauer [1979]), more recent work by Dosch 
and Peisl [1986] and Dosch etal. [1987, 1992] shows that the fit is not perfect (see fig. 
16) and can be considerably improved by a new model which adds a finite probability 
for D to be in a mobile state, appearing delocalized over three tetrahedral sites. That the 
jump rate of the diffusing defect (D) is much larger than previously assumed, and 
increasingly so at higher temperatures, is revealed by quasielastic scattering (Dosch et 
al. [1992]). The mobile state is modelled by combining scattering contributions from a 



Ch. 12, §4 


X-ray and neutron scattering 


1143 



Fig. 14. Comparison of measured cross-sections for elastic diffuse scattering from Al-0.8 at% Cu at 800 K with 
the results of model calculations. The solid lines marked “Laue + Inc.” indicate the expected behaviour without 
static displacements (but including the Debye-Waller factor). Dashed lines: model with forces on nearest 
neighbours only; dashed dotted lines: “best fit” with forces /j on nearest and / 2 on next-nearest neighbours, 
f 2 /fi = - 0.2^j^i . The symbols represent experimental data. (After Bauer [1979].) 


modified environment around the three neighbouring tetrahedral sites involved, a 
nonvanishing occupation probability of the triangular site within this same environment, 
and an adjustment for the high mobility of the defect (see fig. 16 ). 
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Fig. 15. Diffuse elastic scattering for a Nb-1.4 at% N single crystal at room temperature for two different 
K-scans (along the lines in reciprocal space indicated by SF in the insets, crystal axis parallel [Oil]). 
Experimental results are compared with a model using two forces (dashed lines) and three forces (solid lines). 
(After Dosch and Peisl [1985].) 


4.2. Short-range order 

So far, we have considered dilute solutions of defects and assumed a random 
distribution. This approach yields simple scattering laws and is sufficient for many cases. 
However, in concentrated solid solutions, the notion of single defects is difficult to 
maintain, and additional scattering effects will occur if there is any deviation from a 
random occupation of lattice sites by the different scattering centres, i.e., short-range 
order (SRO) occurs. 

Much work has been devoted to binary substitutional alloys. If we neglect positional 
disorder, eq. (28) for diffuse scattering (i.e., K*g) can be replaced by (Kjrivoglaz 
[1969]) 

=4>)| 2 (/b-/a) 2 (43) 

V aJ/ VsRO 

for an A-B alloy of concentration c = c B . (We consider alloys with one sublattice only. 
More general expressions for several sublattices, important for SRO in intermetallic 
compounds, hydrogen in metals, non-stoichiometric compounds etc., are given, e.g., by 
Hayakawa and Cohen [1975] and Bauer [1979]. For some further discussions 
including ternary alloys see Cohen [1970] and De Fontaine [1971, 1979].) In eq. (43), 
Debye-Waller factors are again included in the scattering lengths / A ,/ B . The term lc(#c)l 2 
contains the Fourier transform of the compositional fluctuations (of component B), 

c M = ( c Bn - c B ) exp (-277 iKV„ a ) , (44) 

and c Bn is defined as in eq. (4) to be one if a B atom is present at site n and zero 
otherwise. In the limit of small concentrations, Mc(#c)l 2 -» Nc as in eq. (28). For a 
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Fig. 16. Coherent diffuse (quasi-)elastic neutron scattering I(2ttk) (energy resolution 3.8 meV) from Nb-1.7 at% 
D at room temperature (Dosch and Peisl [1986]) along a line S-F in reciprocal space as shown in the inset 
(crystal axis along [Oil]). The dashed line is the expected scattering from a static cubic defect with the correct 
long-range stress field. The solid line is the best fit for a delocalized defect (Dosch et al. [1992]). 

random distribution of B atoms of arbitrary concentration, Mc(k)I 2 ->(Vc(1-c), and eq. 
(43) then yields the well-known monotonic Laue scattering «c(l-c)(f B -/ A ) 2 . For the 
theory of alloys, deviations from a random distribution are of course more interesting. If 
displacements were negligible, Ic(k)I 2 could be obtained directly from diffuse scattering 
measurements, but quite frequently, displacements have to be taken into account. 

Figure 17 shows a classical experimental result (Mozer etal. [1968]) on a polycrys- 
talline Cu-47.5 at% Ni sample studied with neutrons. A unique possibility offered by 
neutrons has been exploited here. Some isotopes (see § 2.2) have a negative scattering 
length, and for one (or more, if one changes the isotopic composition of the alloying 
partners) chemical composition of a binary alloy, it is thus possible to obtain a “null- 
matrix alloy” with an average scattering length /=0. This eliminates Bragg peaks that 
normally, in the case of polycrystals, severely limit the analysis of diffuse scattering. The 
alloy used by Mozer etal. [1968] contained natural Cu and Ni enriched to 99% in 62 Ni. 
In fig. 17, only very small Bragg intensities are visible, but a very pronounced modulat- 
ion of the diffuse scattering indicates a non-random distribution of the alloying elements 
on the fee lattice sites. 

It is common to describe SRO in terms of the Warren-Cowley parameters a 0n 
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Fig. 17. Diffuse neutron scattering from polycrystalline Cu-47.5 al% 62 Ni, furnace-cooled from 1021°C. The 
data indicated by crosses were taken with a neutron wavelength A = 1.951 A, those indicated by solid circles 
with A = 1,024 A. The Bragg positions are marked. The dashed curve is a fit with one SRO parameter only, 
the solid curve is the “best fit” described in the text. (After Mozer etal. [1968].) 


(Cowley [1950], see e.g., Schwartz and Cohen [1987], and Welberry and Butler 
[1994] for a survey of other descriptions). They are related to lc(#c)l 2 by: 

|c(k)| 2 =c(1-c)«(k), (45) 


where «(k) = £ a 0 „ exp(-2ir iK*r 0 „ ) , (46) 

n 

• * BB 

and the a 0n are defined using “conditional probabilities” P 0n (indicating the probability 
that a B atom is at position n if an A atom is at the origin), etc. 


pBB 

r 0;> 


C-Pc 


AB 

On 


«0n = 


1 - C 


C 


(47) 
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Obviously, «oo = independent of the state of SRO, and for a random distribution a 0n = 0 
(n 5* 0). In cubic structures, the a 0n are usually labeled per coordination shell i or with 
indices Imn referring to all combinations of coordinates for positions within one shell. 

For a polycrystalline sample, the diffuse SRO scattering from eqs. (43)-(46), 
averaged over all orientations (e.g., Warren [1969]), is 



i 


sin Ittk^ 

iTTKf 


(48) 


where Z, is the number of neighbours in the ith shell with radius r ( - (the sum includes 
i — 0). The dashed curve in fig. 17 is a fit according to eq. (48) with only one a ; adjusted 
(a j = 0.131). This fit already looks quite reasonable and indicates that short-range 
clustering is present in this alloy. According to eq. (47), a positive SRO parameter means 
more BB pairs than expected from the average concentration. This “negative SRO” leads 
to a reduction of diffuse scattering (relative to the Laue scattering) between Bragg 
positions, and an enhancement near Bragg positions (including 000, i.e., k= 0). The 
opposite is true for normal (“positive”) SRO where £^<0. 

The deviations from the simple a,- fit visible in fig. 17 cannot be explained by 
contributions of more a t terms alone (except for k near zero). As discussed in §3, 
following eq. (38), displacements from ideal lattice sites introduce Huang scattering 
(symmetric around Bragg peaks and «= s~ 2 ) and “size effect” scattering (antisymmetric 
around reciprocal lattice points). Both contributions have to be considered in concentrated 
alloys, and the treatment, even for the approximation of small displacements, becomes 
rather involved (see, e.g., Warren [1969] and Schwartz and Cohen [1987]). In the 
approximation exp[-27r i/c*(w n -w n .)] = 1-2tt iK*(u n -u n ), neglecting the Huang term 
[(32), jc is no longer restricted to the vicinity of a Bragg peak], the diffuse scattering can 
be written (see Warren [1969]): 


% =Nc{l- c)(/ B - A f X ("; - 27r i ft K * r i i ) exp(-27r i kv, ) , (49) 

V all Jd i,j 

where i indicates the shell and j the position in the ith shell, and the “size effect 
coefficients” are given by 

A = fA f — + «; Vaa — ( — + 4b . (50) 

/a-/bI c ') AA A-/bU-c ') BB 

where the are defined as the average relative deviations of the distance between a /j. 
atom at the origin and a v atom in the /th shell, 

w = , i a ( l+ 44 ( 51 ) 

Equation (49), averaged for polycrystals, was used to fit the data in fig. 17 (Mozer et al. 
[1968]). The “best fit”, shown by the continuous curve, was obtained with nine adjust- 
able a, and size effect coefficients for five shells. In this case, a, =0.121 and 
a 2 = -0.008, indicating a preference for unlike second-nearest neighbours. 
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In a more extensive study, Vrijen and Radelaar [1978] investigated Cu-Ni alloys 
containing 20-80% Ni and were able to quench from thermal equilibrium at various 
temperatures prior to the neutron scattering measurements at room temperature. 

Linear size effect corrections, frequently limited to nearest neighbours, were also used 
to separate SRO and size effect scattering from single crystals, exploiting the different 
symmetry of the a,- and /3,-terms in eq. (49) (see Sparks and Borie [1966], Moss and 
Clapp [1968] and Schmatz [1973] for surveys of early experimental results), until 
Borie and Sparks [1971] and Gragg and Cohen [1971] extended the evaluation to 
include size-effect contributions up to second order, i.e., starting from 

exp[-27r iK*(«„ - «„,)] - 1 - 2rr iK*(«„ - u n .) - 2t7 2 [k‘(m„ - u n ,)f . (52) 

The “Borie-Sparks approach” is fully described in textbooks (e.g., Schwartz and 
Cohen [1987] and Cowley [1981]). For cubic crystals, the total diffuse intensity, 
divided by Nc(l -c)(/g-/ A ) 2 (the result is then given in “Laue units”), can be written in 
this approximation as (see Schwartz and Cohen [1987], a corresponding treatment for 
hexagonal crystals is given by Khanna [1989]) 

Jd = h + h \Q x + hiQy + hQz + h l R x +h%Ry+ + Wh^xy + !hh S yz ^ 3 ) 

+ , 

where I a is the short-range order term, 

/« = X a inm exp[-27T i (/*,/ + f^m + V)] (54) 

I,m y n 

(with l,m,n from to +°°), ft,,/ i 2 ,ft 3 are the components of the scattering vector in 
units of the reciprocal lattice vectors ft, [see eq. (A.17), ch. 1 1] and the l, m, n coordinates 
are measured in units of the lattice vectors a,. The functions Q are the size effect terms 
containing corresponding components of the displacements , 8 Imn [see eqs. (50) and (51)]. 
Similarly, the R and S functions contain pair probabilities and squares and products of 
displacements. All functions Q, R and S also contain ratios of atomic form factors, 
/a/(/b-/aW (/b~/a)> an d only if these are constant, Q, R and S are periodic functions 
in reciprocal space. Figure 18 shows schematically the three contributions along [ft, 00]. 
Using the symmetry properties of Q, R and S, it is now possible to construct a minimum 
volume in reciprocal space where the diffuse intensity must be measured in order to form 
appropriate sums and differences for certain sets of reciprocal lattice points that will 
yield the various terms of eq. (53) separately (see Schwartz and Cohen [1987]). 
Figure 19 shows the minimum volume in reciprocal space for fee crystals. It corresponds 
to one eighth of a unit cell in reciprocal space and has been placed in a region accessible 
in reflection geometry. The method was first applied to diffuse X-ray scattering from a 
clustering system (Al-5 at% Ag) by Gragg and Cohen [1971], 

Apart from the fact that the linear combination of the different e^ v cannot be 
separated, the assumption of constant ratios of the atomic form factors is the main 
limitation of the Borie-Sparks approach if X-rays are used (see Tiballs [1975] and 
Georgopoulos and Cohen [1977a, 1979]), whereas in this respect, it is ideally suited 
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Fig. 18. Schematic of the diffuse intensity components according to eq. (53) (after Schwartz and Cohen 
[1987]). The intensity scale is not the same for the three components. Continuous line: I a , dashed line: h { Q x , 
dotted line: h*R x . 

for neutron scattering. To exploit the substantial variation of the atomic form factors 
across the experimental tc range for the X-ray case, Tiballs [1975] has proposed to write 
the functions Q, R and S of the Borie-Sparks approach as products of the K-dependent 
ratios of form factors and new functions Q\ R' and S' that are strictly periodic in 
reciprocal space. There are now 25 Fourier series instead of 10 in the Borie-Sparks 
method. Georgopoulos and Cohen [1977a, 1979] have developed a separation method 
based on this approach. The procedure is considerably more laborious than for the 
Borie-Sparks method. From a detailed error analysis the authors conclude that the extra 
effort may be worthwile, especially for clustering systems. Apart from more reliable 
values for a ( , the method also yields individual (AA, BB, AB) values for the displace- 
ment terms. With the availability of continuously tunable X-ray wavelengths at synchro- 
tron radiation sources, the anomalous scattering variation of the atomic form factors 
offers additional advantages since with a proper choice of / A , / B , the thermal diffuse 
background may be removed and a reliable separation of short-range order and individual 
displacement terms is achieved (Ramesh and Rameseshan [1971], Ice et al. [1992], 
Reinhard et al. [1992], SchOnfeld et al. [1994]). 

From a given set of SRO parameters, the local atomic arrangement in SRO structures 
can be simulated on a computer by rearranging many thousands of atoms on a given 
lattice until sufficient agreement is obtained with the SRO parameters (Gehlen and 
Cohen [1965], Gragg et al. [1971]). The simulated structure can then be analyzed in 
terms of specific atomic arrangements. It is not yet possible to completely simulate local 
atomic arrangements and displacements, though first attempts have been reported (Kyobu 
et al. [1994]). If the set of SRO parameters belongs to an equilibrium state, they may be 
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(a) (b) 

Fig. 19. (a) Typical volumes in reciprocal space required to separate the diffuse intensity from a fee crystal into 
components due to local order and due to atomic displacements (to second order in the displacements), (b) As 
(a), after symmetry of intensity in reciprocal space across the planes A, = hj has been employed. (After 
Hayakawa el al. [1975].) 


used to calculate basic alloy parameters (i.e., effective pair interaction potentials V lmn ). 
Apart from approximate analytical methods (Cowley [1950], Krivoglaz [1969] and 
Clapp and Moss [1966, 1968], Masanskii et al. [1991], see Reinhard and Moss [1994] 
for an assessment), exact numerical calculations are now feasible. The inverse Monte 
Carlo method proposed by Gerold and Kern [1987] starts from an equilibrium model 
crystal (obtained using the Gehlen-Cohen method) and makes use of the fact that for any 
equilibrium quantity of the system, the average over many fluctuations (weighted 
according to conditions of detailed balance) of such quantity must be zero. From many 
(virtual) pair exchanges in the model crystal, one thus obtains a set of effective pair 
interaction potentials V lmn (defined by the difference of V^ n + and 2vf^ n ). These 
may in turn be used in Monte Carlo simulations for other situations (see fig. 20 for a 
schematic diagram). An inverse cluster variation algorithm has also been introduced 
(Solal etal. [1988]). 

A brief survey of experimental results must start with the Cu-Au system, as this is 
certainly the most extensively studied binary system, providing several SRO (and long- 
range order) effects that have been used to test theoretical ideas at various stages (see 
e.g., Moss and Clapp [1968], Krivoglaz [1969], Moss [1969], Cowley and Wilkins 
[1972], Moss and Walker [1975], De Fontaine [1979]). Bardhan and Cohen [1976] 
measured the diffuse scattering of Cu 3 Au in a volume of reciprocal space at six temper- 
atures ranging from 2°C above T c for long-range order (~ 394°C) to 930°C, and used the 
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Fig. 20. Schematic of the steps involved in the inverse Monte Carlo method (IMC) and the Monte Carlo 
method (MC). GeC = simulation of a model crystal according to Gehlen and Cohen [1965]. (After Gerold 
and Kern [1987].) 


Borie-Sparks approach to separate the corrected, absolute intensities (including detailed 
error analysis, also for higher-order displacement terms, see Gragg etal. [1973]). More 
limited measurements with smaller temperature intervals indicated discontinuous changes 
in slope of the diffuse intensity versus temperature near specific-heat anomalies reported 
in the literature. The SRO scattering separated from complete measurements at selected 
temperatures revealed several new features, especially a change of symmetry of the 
separated short-range order scattering from the expected diffuse peaks at 100 and 110 
(Ll 2 is the long-range ordered structure below T c ) to a concentration of SRO scattering 
around 555 near 930°C, see fig. 21a. This type of scattering would result e.g. from 
alternating Cu- and Au-rich {111} planes. More recently, Butler and Cohen [1989] 
have compared all previous diffuse X-ray results for Cu-25 at% Au for temperatures just 
above T c and evaluated new measurements at 430°C using the Georgopoulos-Cohen 
approach. The SRO scattering component is shown in fig. 21b. The SRO parameters 
agree best with the early results of Cowley [1950]. The simulated short-range ordered 
structure contains small clusters ( 1-10 unit cells in diameter) resembling Ll 2 ordered 
domains. The individual first-order displacement terms are all negative (up to Imn = 530) 
for Au-Au and positive for Au-Cu (except for Imn = 200), although the lattice parameter 
of ordered Cu 3 Au is slightly smaller than in the disordered state. 

Apart from the Cu-Au system (see also Bessiere etal. [1982] for other compositions) 
and the Al-Ag crystal mentioned above, the diffuse X-ray scattering of several other 
alloys has been analysed with the Borie-Sparks approach; Ni-20 at% Mo (Chakravarti 
et al. [1974]), Cu-(9.1-14.8) at% A1 (Epperson et al. [1978a]), Cu-29.8 at% Pd 
(Ohshima etal. [1976], Au-18 at% Mn (FOrnrohr etal. [1980]) and Au-20.3 at% Mn 
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(a) (b) 

Fig. 21. Short-range order diffuse X-ray scattering in Laue units from Cu-25 at% Au at two different 
temperatures: (a) 930°C, (hk 0.5) section of reciprocal space, scale: 1 = 1.9 x KT 2 (Borie-Sparks analysis, after 
Bardhan and Cohen [1976]), (b) 430°C, (MO) section (Georgopoulos-Cohen evaluation, from Butler and 
Cohen [1989]). 


(Suzuki etal. [1982]), Ni-12.7 at% A1 (Epperson and FOrnrohr [1983]) and Ni-7.3/ 
10.5 at% A1 (Chassagne et al. [1989]), and Ag-15 at% Mg (Ohshima and Harada 
[1986]). The X-ray study on Cu-Al (Epperson et al. [1978a], Epperson [1979]) was 
performed at room temperature, with samples quenched from different temperatures 
ranging from 150-900°C. The apparent degree of SRO was found to pass through a 
minimum near 350°C for Cu-14.8 at% Al. The values obtained for samples quenched 
from higher temperatures do not reflect thermal equilibrium. For some of the equilibrium 
states (1580 h annealing at 150°C), a computer simulation was performed using the 
experimental a, and requiring agreement for the first six of them. The resulting structures 
were searched for specific nearest-neighbour configurations (Clapp [1971]), and fig. 22 
shows those found in all conditions investigated. In these configurations, none of the Al 
atoms are nearest neighbours. The configurations C9, C16 and C17 are particularly 
frequent in the equilibrium states. That the Borie-Sparks evaluation might suffice in 
some cases (Chassagne et al. [1989]), has been shown explicitely by Klaiber et al. 
[1987] for a non-equilibrium quenched state of Ni-9.5/9.8 at% Al by a detailed compari- 
son of the two methods mentioned and a third one based on a least-squares fitting of the 
various Fourier coefficients as originally proposed by R. Williams [1972]. A similar 
comparison for Au-25,48,75 at% Ag in thermodynamic equilibrium (SchOnfeld et al. 
[1992a]), another system where, as in Ni-Al, linear displacement terms are sufficient, 
shows that differences between the Borie-Sparks and the Georgopoulos-Cohen 
approaches are noticeable but quite subtle as the degree of SRO is very low in the 
equilibrium states accessible for investigation. The diffuse SRO peaks are at 100 
positions for all three alloys. Figure 23 shows the typical localized Ll 2 features expected, 
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Fig. 22. The nearest-neighbour configurations of A1 atoms (solid circles) around Cu atoms (crosses) found in 
a Cu- 14.76 at% AI single crystal annealed for 1 h at 650°C and quenched to room temperature. 1\vo adjacent 
unit cells are outlined. Not specially marked lattice sites are occupied by Cu atoms. The number of 
crystallographically equivalent variants of a configuration is indicated by m. (After Epperson el al. [1978a].) 


but there are no indications of larger domains at this low degree of SRO (a; « -0.08). 
The Clapp configurations C16 and C17 are four to seven times more frequent in all three 
crystals than in a random solid solution. Table 3 shows the first three effective pair 
interaction potentials Vl,nn obtained for all three crystals and used for a comparison of 
internal energies for the Ll 2 , D0 22 , D0 23 structures for 3:1 stoichiometry and the Ll 0 and 
1|0 (see Clapp and Moss [1968]) structures for AuAg. The most stable states (with one 
exception where the differences are within 0.1 meV for the structures considered) are Ll 2 
and Ll 0 . The V lmn were also used in Monte-Carlo simulations to determine critical 
temperatures for disordering, starting from completely ordered stoichiometric crystals. 
The results ( T c between 100 and 200 K) compare well with ab-initio calculations (see 
SchOnfeld etal. [1992a] for more details). In these alloys, the effective interactions are 
weak enough to employ the approximation of Clapp and Moss [1966]. The resulting 
V lmn are only slightly smaller than the “exact” values. A similar X-ray study of Ni-11.2 
at% Mo (Schwander et al. [1992]) equilibrated at 500°C, using the Georgopoulos-Cohen 
method, shows that only the inverse Monte-Carlo method gives reliable values for V lmn (V 110 
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Fig. 23. A (001) plane of Au-25% Ag modelled (a) from the measured SRO parameters corresponding to equi- 
librium at215°C and (b) with a random distribution of Au (O) and Ag (•) atoms. (From Schonfeld etal. [1992a].) 
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Table 3 

Effective pair interaction potentials (in meV) for the first three shells as determined by the inverse 
Monte Carlo method with the short-range-order parameters a tmn from a Georgopoulos-Cohen analysis (GC) 
or a Bone-Sparks analysis (BS) according to SchOnfeld etal. [1992], The errors are the standard devi- 
ations resulting from the use of five independently modelled crystals in each case. 


Shell 

Au-25.0 at% Ag 

Au- 47.7 at% Ag 

Au- 74.7 al% Ag 

index Imn 

GC 

BS 

GC 

BS 

GC 

BS 

110 

15.6(2) 

10.4(1) 

10.7(1) 

10.4(1) 

16.1(2) 

12.2(2) 

200 

2.3(2) 

-1.0(1) 

-0.1(1) 

0.7(1) 

2.2(2) 

1.0(1) 

211 

2.0(1) 

0.2(1) 

0.7(1) 

0.4(1) 

1.0(1) 

0.2(1) 

Ground 

Ll 2 

Ll 2 

Ll 2 

Ll 2 

Ll 2 

DO 22 

states 

Ll„ 

Ll 0 

Ll 0 

Ll 0 

Ll 0 

no 


is about 100 meV, i.e., larger than kT). The work of Schwander et al. [1992]) also shows 
how the change of configurational energy per unit area, upon successive passages of 
unit dislocations may be calculated for arbitrary numbers of SRO parameters and effective 
pair potentials. This quantity intervenes in the transition from wavy glide to planar glide 
in concentrated alloys (see Schwander etal. [1992] and Wolf etal. [1994] for details). 

The first applications of the Georgopoulos-Cohen method were in fact to systems 
with strong displacements, namely non-stoichiometric yS'-NiAl (Georgopoulos and 
Cohen [1977b, 1981]) and aged Al-1.7 at% Cu containing Guinier-Preston zones 
(Auvray etal. [1981]). The resulting a lmn for Al-Cu were used to synthesize a model 
crystal which was subsequently searched for Cu-rich zones. A mixture of single- and 
multi layer zones with {100} faces was found, consisting essentially of pure Cu whereas 
practically no Cu was found in the matrix. The original model for Guinier-Preston-I 
zones (Gerold [1954, 1958]) is a Cu monolayer disc about 5 nm in diameter with the 
adjacent Al planes collapsing toward the zone. The displacements found by Auvray et 
al. [1981] were also different (substantially larger but decreasing more rapidly). 

The results of Auvray etal. [1981] were essentially confirmed by Matsubara and 
Cohen [1985] in a study employing X-rays from a synchrotron radiation source (see also 
Cohen [1986]), but questioned by Bubeck and Gerold [1986] (see the viewpoint set 
edited by Gerold [1988], and Cohen [1992] for further discussions) who demonstrated 
with a simple one-dimensional model that diffuse intensities from within the first 
Brillouin zone, i.e., where the displacement terms are minimal, are required to obtain a 
reliable separation. Smaller scattering vectors than for X-rays (reflection geometry) are 
easily accessible with neutrons as sizeable samples may also be studied in transmission 
on the same instrument. For Al-1.75 at% Cu (two Cu isotopes were used to enhance and 
vary the different scattering contributions), aged for 60 min at 80°C to obtain Guinier- 
Preston-I zones, a linear scan of diffuse intensities (inelastic scattering was eliminated 
experimentally by measuring the elastic scattering on a three-axes spectrometer with an 
energy resolution ± 0.7 or ± 4 meV) covering scattering vectors [/i00] from 0.175 to 6.9 
reciprocal lattice units was analyzed (Muller et al. [1988, 1989]), and it became 
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apparent that results for I a (see eq. (53) depend on the range of h values used and that 
I a retains no structure (i.e., becomes independent of h) if the lowest h ranges are used for 
the separation (see fig. 24). Thus, there remains only a small (< 10%) probability for 
multi-layer zones. These investigations have been extended to obtain complete three- 
dimensional scattering data (Malik et al. [1996]). 

The diffuse X-ray scattering of another system forming platelet-shaped zones, Cu-Be, 
has been studied in as-quenched (Koo et al. [1988]) and aged (Koo and Cohen [1989]) 
Cu-10.9 at% Be crystals. Clusters after quenching and single and multilayer [100] zones 
after aging are identified and modelled, and (110) diffuse streaks in X-ray scattering are 
found to be due to thermal diffuse scattering (as confirmed by elastic neutron scattering 
where no streaks are found, Koo et al. [1988]). 

Neutron scattering studies of SRO have become more numerous in recent years, 
taking advantage of the low absorption cross-sections (i.e., lower scattering angles are 
accessible), the ease of energy resolution to eliminate thermal diffuse scattering, and the 
possibility of good scattering contrast also for neighbouring elements (e.g., Ni-Fe, Cu-Zn, 
etc.). Neutrons are, of course, particularly useful if magnetic SRO is to be investigated. 

The SRO study of Ni-25 at% Fe (quenched from -535°C) (Lefebvre et al. [1980, 



Fig. 24. Short-range order intensity l a for Al-Cu. x: calculated from the au, m of Auvray el al. [1981], +: taken 
from fig. 2 of Matsubara et al. [1985], incomparison with values obtained by MOllf.r et al. [1989] within 
different separation ranges, O, •: h = 3 and A, ▲: h = 2 to 5, using the quadratic approximation for A]- 63 Cu (open 
symbols) and A!- 65 Cu (solid symbols). 
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1981]) demonstrated for the first time the ideal suitability of the Borie-Sparks method 
for the analysis of diffuse neutron scattering. Furthermore, the least-squares fitting 
method developed by R. Williams [1972, 1976] which also includes second-order size- 
effect corrections, was compared with the Borie-Sparks approach. All displacements 
were found to be very small, in agreement with the small size effect in Ni-Fe. The a 0 
values were always close to one (within 7%, with an uncertainty of 6% attributed to the 
calibration alone). Effective pair potentials were calculated in the Clapp and Moss 
[1966, 1968] approximation. They show an oscillatory behaviour and a rapid decrease, 
similar to CuAu (Metcalfe and Leake [1975]). The ratios V^ot/ ^no an ^ ^ 211 /^no pl 0ce 
Ni 3 Fe well within the Ll 2 stability region, in contrast to Cu 3 Au. The use of three 
different wavelengths at a synchrotron radiation source offers another method to remove 
the thermal diffuse background, as recently demonstrated for Ni-22.5 at% Fe by Ice et 
al. [1992]. The results for a lmn of the neutron study are essentially reproduced, but in 
addition, individual displacement parameters are obtained which may be useful in the 
theoretical modelling of size-effects in these (magnetic) alloys. 

In Ni-Cr, displacements are also small, and it is sufficient to consider only linear 
displacement scattering terms. Three diffuse neutron scattering studies, one on Ni-20 at% 
Cr (SchOnfeld et al. [1986, 1988]), the other on Ni-10 at% Cr (Schweika and 
Haubold [1988]) - where the 58 Ni isotope was used to enhance the diffuse elastic cross- 
section and simultaneously reduce the incoherent scattering - and on Ni-25/33 at% Cr 
(Caudron et al. [1992]) have been published. Figure 25 shows that diffuse peaks are 
found at 1 2 0 positions and that the displacement scattering introduces only a small 
asymmetry. Using IMC procedures, very similar results for the effective pair potentials 
v lmn were obtained for all these Ni-Cr alloys, indicating that any concentration depen- 
dence of v lml , must be very small. There is also no systematic variation of V hm with 
equilibration temperature (SchOnfeld etal. [1988]). For example V 110 is 26 meV for Ni- 
11 at% Cr quenched from 833 K (Schweika and Haubold [1988]), and for Ni-20 at% 
Cr, values of 26.1(4), 29.3(2) and 28.0(2) are found for crystals quenched from 741, 828 
and 973 K, respectively. (The subsequent effective pair potentials decrease rapidly in 
magnitude, with varying signs.) The Clapp-Moss approximation also yields results 
comparable to the IMC values, with some deviation for F 1[0 , which is smaller than in 
Ni-Mo, but larger than in Ni-Al. Figure 26 summarizes (SchOnfeld [1994]) V /m;| results 
for Ni-Al (8.9 at% Al, 23.5 h at 500°C, neutron scattering), Ni-Cr and Ni-Mo. The y/y 
phase boundary in Ni-Al can only be reproduced if a concentration dependence of the 
V lmn is admitted (SchOnfeld [1994, 1995]). 

In similar neutron scattering studies for Cu-31 at% Zn (Reinhard et al. [1990, 
1993]) and Cu-22 at% Zn (Reinhard et al. [1993]) (using 65 Cu to enhance the contrast) 
equilibrium was established at 473 K or 493 K. Although several other properties had 
been related to SRO in the a-phase of the Cu-Zn system, no direct evidence was 
available. Diffuse elastic neutron scattering showed four distinct maxima in the (001) 
plane (see fig. 27), distorted by displacement scattering. The peak positions may be 
related to flat portions of the Fermi surface across the (110) directions, as indicated in 
fig. 27b. A minimum in the Fourier transform of the v, mn will lead to a diffuse scattering 
maximum. The static dielectric function b{k) has a singularity at the Fermi surface. 
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As measured 



Fig. 25. Diffuse elastic scattering in 0.1 Laue units for a Ni-20 at% Cr single crystal aged for 480 h at 741 K, 
in a (100) reciprocal plane. Experimental (as-measured) results except for shaded areas (near Bragg peaks or 
incident beam) are compared with recalculated lines of equal intensity using 21 a lmt and 26 y w (After 
SchOnfeld etal. [1988].) 


leading to a reduction of the pure ion-ion interaction. After separation of / SR0 and linear 
displacement scattering, effective pair potentials for Cu-Zn were again obtained from the 
IMC method (they show no marked temperature dependence and, again, compare 
favourably with values from the Clapp-Moss approximation). Using these effective pair 
potentials, the ground-state energies for different (hypothetical) ordered structures of 
Cu 3 Zn were calculated. The D0 23 structure was found to be more stable than other 
candidates if more than nine V lmn were used. (The D0 23 superstructure would introduce 
order peaks at 100 and 1^0 positions. Diffuse maxima near these positions were also 
found by Turchi etal. [1991a] based on first-principles concentration- functional theory.) 
Monte Carlo simulations were performed (Reinhard etal. [1990]) to estimate the critical 
temperature T c for the long-range order transition. Values of T c near 330 K were 
obtained, too low to be easily established experimentally. 

An extensive combined X-ray and neutron scattering study to elucidate the interplay 
of atomic (chemical) and magnetic short-range order has been performed by Koga et al. 
[1993a, b] on a series of three Ag-Mn single crystals. Ag-Mn shows spin-glass behaviour 
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Fig. 26. Effective pair interaction potentials for 
Ni-Al, -Cr, -Mo from neutron scattering (A) 
and X-ray scattering (x: Georgopoulos-Cohen 
evaluation, O: three-lambda evaluation, see 
SchOnfeld et al. [1994], (From Sciionfeld 
[1994]). 


for 10 to 25 at% Mn (see Kouvel [1961]), and Bouchiat et al. [1981] had reported on 
diffuse X-ray peaks near 1 j-0 and proposed several complex short-range order models, as 
D0 22 can be ruled out as the basic unit. The X-ray measurements of Koga et al. 
[1993a, b] show diffuse peaks with maxima at 1 j±S 0. After a Borie-Sparks analysis, 
there is a preference of next-nearest neighbour Mn-Mn pairs for quenched crystals 
containing 13.3 and 20.8 at% Mn, whereas the crystal containing 28.1 at% Mn shows a 
dramatic decrease of such correlations. The magnetic diffuse (neutron) scattering 
confirms that there is antiferromagnetic coupling for nearest-neighbour Mn pairs and 
ferromagnetic coupling for next-nearest neighbours. A second, longer-ranged ferromag- 
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Fig. 27. Diffuse clastic neutron scattering in 0.1 Laue units for a Cu-31 at% Zn single crystal aged for 160 h 
at 473 K. (a) Lines of equal intensity as measured in a (001) reciprocal plane (the cross-hatched areas near the 
Bragg peaks are excluded as they may be affected by the weak wings of the mosaic spread), (b) schematic 
indication of 2jfcp'°-positions (ip 10 = Fermi vector in (110) direction). (From Rhinhard el al. [1990].) 

netic coupling component has been identified, too. It is believed that this component, 
attributed to Rudermann-Kittel-Kasuga-Yoshida (RKKY) interactions (see Harder and 
Wells [1983]), is connected with the spin-glass freezing temperature which varies 
linearly with Mn concentration, whereas the locally coupled spin structure has a strong, 
“confusing” influence on the details of spin-freezing (Koga et al. [1993a,b]). The 
homologous system Cu-Mn has been the subject of numerous magnetic scattering studies 
(see Hicks [1979] and Tsunoda and Cable [1992] for a complete literature list). Here, 
the atomic SRO peak seems to be centred at ljO whereas the magnetic SRO peaks 
develop as satellites at 1 |±S 0 (Cu-25 at% Mn, polarized neutrons, Cable etal. [1984]). 
These satellites are strongly inelastic, but if atomic SRO is well-developed, an elastic 
part remains at room temperature in Cu-34.8 at% Mn (Tsunoda and Cable [1992]). It 
is suggested that with the development of atomic SRO, there is a tetragonal distortion 
and spin-density-wave clusters couple locally to (210) directions in the basal plane (as in 
Ag-Mn, D0 22 is not a compatible long-range ordered structure, and an A 2 B 2 structure 
seems to be approached). A more extensive diffuse neutron scattering study of Cu-17 
at% Mn, separating SRO and linear static displacement scattering, has recently provided 
first results on the effective pair interaction potentials (equilibrium at 483 K where all 
magnetic scattering is inelastic) (Roelofs et al. [1995]). The results resemble those 
obtained for Ni-Cr (see above), with both components just one atomic number below 
Cu-Mn, but ^222 and all V lt0 up to / = 5 are positive. 

Almost all the diffuse scattering work quoted so far concerned fee alloys. Transition 
metal carbides and nitrides of NaCl structure can also be treated as fee. There are frequently 
vacancies in the metalloid sublattice which distribute non-randomly. The long-range 
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superstructures and SRO as well as displacement effects have been reviewed by De Novion 
and Landesmann [1987], The diffuse scattering is evaluated following the same principles 
as described above. A number of elastic diffuse neutron scattering studies have been reported 
more recently, e.g., on TiC 0 64 , TiCo.76' TiN 0 82> NbC (h73 and NbC 0 gj single crystals (Priem 
et al. [1989a]), TiN 082 (Priem et al. [1989b]) and on NbC 073 (Beuneu el al. [1990]). 

No complete SRO separation is known for any other structures, except for the recent 
X-ray work on bcc Fe-47 at% Cr exploiting the anomalous scattering when tuning the 
wavelength close to the Fe and Cr K edges (Reinhard et al. [1992]) and a third 
wavelength minimizing SRO and linear displacement scattering. The alloy shows 
clustering (e.g., a m =0.16), and the effective pair interaction potentials v, mn determined 
from IMC calculations and from the Clapp-Moss approximation agree well with the 
calculations of Turchi et al. [1991b] based on the generalized perturbation method of 
Ducastelle and Gautier [1976] combined with the Korringa-Kohn-Rostoker coherent- 
potential (KKR-CPA) approximation, see, e.g., Turchi et al. [1988]. 

Diffuse neutron scattering in (magnetic) bcc Fe-13.5 at% V was measured along a 
few reciprocal lattice lines and fitted with SRO parameters and effective pair potentials 
in the Clapp-Moss approximation (Cable et al. [1989]). This system shows a tendency 
towards ordering (/H'-CuZn type), and the fitted V lmn agree well with results of a self- 
consistent field KKR-CPA treatment of the electronic structure including magnetic order 
within the local spin-density approximation (Staunton et al. [1990]). 

Diffuse neutron scattering of a quenched /3-Cu-40 at% Zn crystal was measured in a 
volume of reciprocal space and evaluated according to Borie and Sparks including 
second-order displacement scattering (Zolliker et al. [1992]). The /?' superstructure 
peaks were already well developed and were excluded from the analysis. This non- 
equilibrium state shows an almost uncorrelated distribution of surplus Cu atoms on the 
Zn sublattice of the B2 structure. 

Some high-temperature elastic neutron scattering work on Fe-30 at% Al showed 
broad diffuse peaks that were evaluated as long-range ordered domains (Hilfrich el al. 
[1991]) but might deserve a more elaborate investigation in terms of SRO. Finally, 
liquid-quenched polycrystalline dilute Fe-Mn and Fe-Ti were studied by neutron diffuse 
elastic scattering (polarized and unpolarized neutrons) to characterize atomic SRO and 
magnetic ordering (Parette [1991]), indicating a clustering tendency for Mn and 
displacement scattering only around Ti solutes. 

An extension of the Borie-Sparks and Georgopoulos-Cohen methods to multiple 
sublattices has been presented by Hayakawa and Cohen [1975]. A recent diffuse X-ray 
scattering study (synchrotron radiation) on Hg 0 8 Cd 0 2 Te (zincblende structure) with two 
wavelengths, one near the HgL m edge, revealed small ordered regions with local 3:1 
Hg:Cd stoichiometry and second-order displacement effects from the mixed-cation 
sublattice (Quintana and Cohen [1993]). 

5. Near the incident beam 

This section will be mostly concerned with the region around the origin of the 
reciprocal lattice, i.e., k «g m where g m is the modulus of the smallest reciprocal lattice 
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vector. This field is commonly called low- or small-angle scattering which means diffuse 
scattering for small k, not just Bragg diffraction at small angles as in the electron 
microscope or with y-rays. After a summary of the principles of small-angle scattering, 
some applications to the study of inhomogeneities in alloys and of defect clusters will be 
presented. Finally, some other effects and methods for k ~ 0 will be discussed. 


5.1. Small-angle scattering 

Small-angle scattering (SAS) is caused by the variation of scattering length density 
over distances exceeding the normal interatomic distances in condensed matter. If K max is 
the maximum value of the scattering vector accessible in a SAS experiment, details on a 
scale smaller than about 1/K max will not be resolved, and the discrete arrangement of 
scattering centres (atoms or nuclei) can be replaced by a continuous distribution of 
scattering length over volumes of about 1/ . Sums may then be replaced by integrals 

(cf. eq. (AlO.b), ch. 11), and the theory of SAS specializes on the evaluation of these 
integrals for a variety of cases. For a full account of the theoretical principles, developed 
for X-rays but equally valid for neutrons, see Guinier and Fournet [1955], Beeman et 
al. [1957], Guinier [1963], Gerold [1967] and Porod [1982], for neutrons see 
Schmatz [1978] and Kostorz [1979], The basic expression for the SAS cross-section 
is 



|2 


J Pf( r ) exp(-27r \ier)d 3 r\ 
v. 


(55) 


where the integration extends over the sample volume, V s , and p f (r) is the scattering 
length density, locally averaged as indicated above. It is also useful to write 

p c {r) = Ap ( (r) + p f , (56) 

where p f is averaged over distances much larger than 1/K min , and K min is the smallest k 
value accessible in the experiment. For the k range considered, only Ap^ r ) will contrib- 
ute to the scattering, i.e.: 

2 

J [Pf ('')] exp(-27j- i#<*/-)d 3 #- . (57) 


(a \ 

do - 

k d/2. 


Equations (55) or (57) may be used as a starting point for analytical or numerical 
calculations of scattering for model distributions. With the help of the equations of § 2.3, 
magnetic SAS may also be included for the case of neutrons. 

The simplest and very widely used approach to SAS is based on the two-phase 
model, assuming small particles with a homogeneous scattering-length density p fp 
embedded in a homogeneous matrix of scattering-length density p fm (p fm = 0 covers the 
case of small particles in vacuo). If there are N p identical particles without spatial cor- 
relations (dilute limit), but - if they are anisometric - all aligned in the same direction: 
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ydfl; 




(58) 


where V is the volume of one particle and S(k), the single-particle scattering function , 
is given by: 


S(k) 


1 


J exp(-2ir i«*r)d 3 


(59) 


S(k) can be calculated for a variety of particle shapes (see Porod [1982]) and depends 
on the direction of k relative to any given axis, except for spheres. For all other particle 
shapes, S(k) has to be averaged for a given orientational distribution. Only if the 
orientational distribution is isotropic, k can be replaced by k for anisometric particles. 

In many real scattering systems, the single-particle approach is not valid as particle 
sizes and possibly particle shape as a function of size may vary and spatial as well as 
orientational correlations must be considered. Generalized cases can only be treated by 
finite-element methods, but several approximations have been suggested, e.g., evaluation 
of size distributions of spheres or other uncorrelated particles of known shape, or linear 
distance distributions of polydisperse spheres (Guinif.r [1963]). These problems are 
extensively discussed e.g., by Porod [1982], Glatter [1982] and Pedersen [1993]. If 
eq. (58) can be used (possibly after correction for interparticle interference), the 
evaluation of SAS yields important parameters of the scattering systems. We summarize 
some general properties. 


a. Extrapolation to k = 0. If the extrapolation can be performed reliably (experimental 
problems at very small angles may interfere), eq. (58) yields [because S(0)= 1]: 


'do r 

v d n 


\ 

(*-* 

) 


0) = N p ( Ptp - P(m fv p 2 . 


(60) 


Three parameters determine d<r/d G(k —> 0): number of particles (i.e., particle density 
n p = Ny/ V s ), scattering contrast and particle size. Combined with information from other 
parts of the scattering curve, they may be obtained separately. 

b. Guinier approximation. For any particle shape, the scattering function at small values 
of *a (where a is the relevant size of the particle) can be approximated by an exponential 
function (Guinier [1939, 1963]), 

S(k) = exp(-4ir 2 K 2 /^) , (61) 


where R k is the average inertial radius (Guinier and Fournet [1955]) of the particle 
defined by 


Rl=^j v J 2 A r M r *’ 


(62) 
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where q(r K ) is the geometrical cross section of the particle perpendicular to a distance 
vector r K parallel to the direction of k, with its origin inside the particle (defined by 
J r K q(r K )dr K = 0). The scattering function reflects the anisometry of a particle as its 
decrease with increasing k is the steeper the larger R K . For randomly oriented particles: 

S{k) = S(k) = exp(-4ir 2 k 2 R%/3), (63) 

with the radius of gyration R G defined by 



Confusion often arises if the term “Guinier radius” is used. It may signify the radius of 
a sphere, R s , calculated from R G , but sometimes R G itself. For a sphere, R G = 3/? s 2 /5, 
for an ellipsoid with axes 2a, 2b, 2c, R G =(a 1 + b 2 +c 1 )/5. For other particle shapes, see 
e.g„ Glatter [1982]. 

The Guinier approximation results from an expansion of the exponential function in 
eq. (59) and a reinterpretation of the resulting series for S(k) as an exponential [eq. (61)]. 
The term proportional to k 2 in the Guinier approximation is exact for any particle shape. 
For spheres, the term with k 4 from eq. (63) is correct to better than 10%, and higher- 
order terms are in fair agreement. The Guinier approximation is therefore acceptable for 
2wR g k< 1.2. For not too strongly anisometric particles, a wider range may even be 
possible, whereas for extremely anisometric particles (e.g., rods or platelets) the higher- 
order terms will contribute at smaller values of k (see, e.g., POROD [1982]). 

c. Integrated intensity. From eq. (57): 

2 = i(^(K )d3 K = jirf, (65) 

where the integration extends over the entire reciprocal space (0 <,k and the bar 
denotes averaging over the sample. As Q represents the mean-square fluctuation of the 
scattering length density of the system which is insensitive to detailed structural features, 
it is sometimes (see Porod [1982]) called ‘invariant’. In the two-phase model: 

Q = (Pfp - Pf)(Pf - Pfm) ( 66a ) 

or G = C p (l-C p Xftp-p hB ) 2 , (66b) 

where C p = iV p V' p /F s is the volume fraction of particles. The “invariant” is, as eq. (65) 
indicates, the Fourier transform of the scattering cross-section for r = 0, i.e., a special 
value y(0) of a correlation function y(r) (see Porod [1982]) which, however, has a 
simple meaning only for dilute isotropic systems. 

d. Characteristic length. An average correlation length L c can be defined by (the bar 
denotes an average of all directions of k if the system is anisotropic): 
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with Q as defined in eq. (65), for any scattering system. In the two-phase model, L c can 
be interpreted as the mean length of all lines passing through all points in all directions. 
The average of all the chords is L p , the characteristic length, given by 

Lp = 2rrV p J kS(k) d* . (68) 

e. Porod approximation. For scattering systems with well-defined interfaces between the 
two phases (see Porod [1951, 1982]), the final slope of the scattering function is 
proportional to k~ 4 . For particles of any shape (k must be larger than the inverse of the 
shortest dimension of the particle): 

%) = (27rr 3 ^V, (69) 

where A p is the surface area of the particle. Inserting eq. (69) into eq. (58), we get: 

^ = (27T)- 3 VpfPfr - Pftn) 2 * -4 ; (70) 

the cross-section becomes proportional to the total surface area of the scattering particles. 
The Porod approximation is not valid e.g., for particles with sharp edges or comers 
(Porod [1982]), diffuse boundaries (Ruland [1971]) or rough interfaces (see Wong and 
Bray [1988], Sinha etal. [1988]). If the scattering objects or the interfaces are self- 
similar fractals (see Wong and Bray [1988], Bale and Schmidt [1984], Martin and 
Hurd [1987], Tf.ixeira [1988], Schmidt [1991]), power laws reflecting the fractal 
dimensions are found, but power law scattering deviating from a a - 4 dependence may 
also be caused by specific size distributions (see Schmidt [1982]). Interparticle 
interference effects diminish at large k. 

The advantage of the Guinier approximation is that the radius of gyration, i.e., a size 
parameter, can be determined from uncalibrated SAS data [cf. eqs. (61) and (63), 
inserted into eq. (58)]. The combination of several of the above relationships [(60) and 
(61) or (63), (66), (68) and (70)] allows one to evaluate shape, size, number and 
composition of uniform particles from precise measurements over a sufficiently large 
range of k. For example, eqs. (66b) and (70) yield the surface-to- volume ratio of 
particles, again without calibration of the measured intensities. 

Although X-rays are more widely available than neutrons, the use of neutrons in SAS 
studies of metallic systems is basically more generally applicable. As SAS is measured 
in transmission geometry, the most severe restriction for X-rays stems from the ab- 
sorption properties of materials (see § 2.2, table 2). For Cu K a radiation, the optimum 
thickness [see eq. (13), 2^D S = 1] of a pure A1 sample is D s = 76 iiim, but less than 10 /im 
for many other metals (atomic numbers 23-27 and above 41, except 55). With MoK a , 
a thickness of 713 /im is ideal for Al, but for most heavier metals, 20-100 /z m should 
not be exceeded. Although with more powerful sources or yet shorter wavelengths some- 
what thicker samples may be used, the sample volume may still not be representative of 
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the bulk material. Undesired scattering from surface irregularities (Parker [1972], Roth 
[1977], Kostorz [1976], Simon etal. [1992a]) and sample environment (windows, heat 
shields etc.) will be more perturbing with X-rays than with neutrons. Another difficulty 
arises from double Bragg scattering which can obscure the SAS effects and sometimes 
exceed them. The obvious remedy, increasing the incident wavelength above the Bragg 
cut-off, works well for neutrons but not for X-rays as absorption becomes prohibitively 
large. Double Bragg scattering may then only be avoided using properly oriented single 
crystals. Finally, inelastic scattering may be appreciable at high sample temperatures, and 
if an energy-resolved separation is required, neutrons must be used. 

The applications of SAS of X-rays and neutrons to metallurgical problems are 
numerous and have frequently been reviewed (Gerold and Kostorz [1978] and 
Kostorz [1979, 1982, 1983, 1988a, b,c, 1991, 1992, 1993], Wagner and Kampmann 
[1992], Williams etal. [1993]). From a simple verification of sample homogeneity to 
the determination of sizes, size distributions and interparticle interference effects, the 
degree of complexity of evaluation procedures varies from simple analytical methods to 
very involved computer routines. Here, only a few examples of SAS research related to 
alloys and defects can be given. 

5.2. Alloys 

Guinier [1938, 1939] was the first to report SAS results from various Al-rich alloys, 
e.g. Al-Cu and Al-Ag. and found evidence for the existence of very small coherent 
precipitates, now known as Guinier-Preston zones (GP zones). An important problem 
which still has not been solved completely is the question how phase separation is 
initiated and progresses during the early stages when large parts of the sample are still 
in a supersaturated state (see ch. 15, § 3.1). The SAS technique is sensitive to small (in scale 
and in amplitude) compositional variations, bridging and overlapping with the domains 
accessible by field-ion microscopy and transmission-electron microscopy. Many SAS 
experiments have therefore been performed to study phase separation, but also coarsening 
reactions, precipitate parameters in relation to other properties, and dissolution of 
precipitates (see the reviews quoted above). For X-rays, Al-rich alloys are most suitable, 
and very few other systems have been investigated until very recently when synchrotron 
radiation sources became available. Initial experiments with neutrons concentrated on A1 
alloys too, but many other systems have been studied during the last ten years. 

As an example, fig. 28 (Messoloras [1974]) shows a set of SAS curves obtained for 
a polycrystalline Al-6.8 at% Zn alloy (neutron measurements in a cryostat at 4.2 K). As 
the scattering did not depend on the direction of k, the SAS cross-sections (calibration 
with the incoherent scattering of vanadium) are given as a function of the modulus of k. 
A peak of the SAS intensity at a value ic m * 0 is quite common for decomposing or 
decomposed alloys containing - in their fully decomposed metastable or stable state - a 
few percent of precipitates (C p ^7% in the present example). An early appearance of a 
peak has sometimes been taken as evidence for the mechanism of spinodal decompo- 
sition (see ch. 15) but this feature of SAS alone is insufficient to distinguish between 
concentration fluctuations and well-defined homogeneous particles. Quantitative kinetic 
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Fig. 28. Neutron SAS cross-sections as a function of 2ttk for an Al-6.8 at% Zn alloy quenched from 310°C 
into ice water and aged at room temperature as indicated (from MossoloraS (19741). 


measurements with a variation of important parameters (initial concentration, quenching 
conditions, aging conditions) are necessary and must be compared with specific 
predictions of the different theoretical models. A SAS peak will of course also occur in 
a two-phase system as the first maximum of the interparticle interference function (see 
Guinier and Fournet [1955], Laslaz et al. [1977] and Glatter [1982]), and many 
other cases can be constructed, e.g., a “three-phase” system representing particle, 
surroundings depleted of the alloying element, and supersaturated matrix (Walker and 
Guinier [1953]). Figure 29 shows measurements obtained at a synchrotron (LURE-DCI, 
Orsay) with A = 1 .54 A, revealing a SAS peak at very small k for Al-6.8 at% Zn aged at 
135°C, a temperature which is definitely above any spinodal at this composition. 

Considerable progress has been made in recent years in the theoretical description of 
the evolution of thermodynamically unstable (spinodal decomposition) or metastable 
(nucleation-and-growth) systems, and a large variety of analytical and numerical methods 
have been employed (see ch. 15 and Gunton et al. L 1983], Furukawa [1985], Binder 
[1992], and Wagner and Kampmann [1992] for reviews), providing scattering functions 
for different situations and stages of phase separation, primarily for statistically isotropic 
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(a) (b) (c) 


Fig. 29. Multichannel analyzer display of SAS patterns obtained with a linear position-sensitive detector. The 
centre was covered by a beam stop, (a) Pure Al; (b) Al-6.8 al% Zn, aged for 7 h at 135°C; (c) difference 
between alloy and pure Al (long white lines in the centre are artefacts). Measuring time 100 s, intensity range 
(full scale) 51200 counts. (From Naudon el al. [1979].) 


systems. Anisotropy may be due to an anisotropic interfacial energy, leading to a 
variation of the single-particle scattering function S(k), see eq. (59), or it may be related 
to a correlated arrangement of the scattering entities (in the two-phase model, this could 
be taken into account by an interparticle interference function in eq. 58) owing to elastic 
interactions (see, e.g., Johnson et al. [1990] and Wang et al. [1993]). 

Anisotropies of a decomposing alloy will, of course, not be visible for a sufficiently 
untextured polycrystal. Polycrystalline Al-Zn alloys (on the Al-rich side) are ideal 
candidates for an experimental verification of many theoretical predictions as the matrix 
is almost elastically isotropic and the Zn-rich GP zones are fully coherent with the matrix 
and initially spherical because of a small size effect. Under these circumstances, changes 
in scattering length density can be exclusively attributed to compositional changes 
(displacement effects, small according to eq. (32), can be neglected). Some years ago, an 
oversimplified interpretation of the original paper on spinodal decomposition (Cahn 
[1961]) led to a search for special features at the spinodal line and for the “linear 
spinodal regime” where the SAS curve should show a time invariant peak at a specific 
value of k, K m , and an exponential growth/decay of the SAS intensity for all k < \Jl K m . 
The overwhelming evidence of SAS experiments (in accordance with other experimental 
and more elaborate, more realistic theoretical results, see Binder [1992]) indicates that 
for metallic systems, a linear regime is found only exceptionally (and may then be 
explained by partial phase separation during quenching), and that the search for 
singularities at the spinodal line may be pointless in a system with short-range inter- 
actions. In these cases, the spinodal singularity disappears and there is a gradual 
transition from the nucleation regime to spinodal decomposition in a region of the phase 
diagram where the free-energy barrier of a nucleus becomes comparable to the thermal 
energy, kT. A sequence of scattering curves as shown in fig. 28 is thus more typical. 
Nonlinear extensions of the spinodal theories, but many other models as well, may yield 
a similar sequence of scattering curves (see Simon et al. [1984]). 

A general analysis of the scattering curves, which is capable of distinguishing 
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different regimes of the decomposition process is a test of self-similarity (see the 
literature cited above, and Fratzl and Lebowitz [1989]). If a system evolves controlled 
by a single characteristic length scale L(t), a scaling test of the SAS function 7 (k, /) (we 
use 7 for simplicity for the cross-section) according to 

I(K,t)/Q(t) = L 3 (t)s(x,t) (71) 

where t is time, Q is the integrated intensity as defined in eq. (65), x = 2tt L(t)K, yields 
a scaled scattering (or “structure”) function six) if s(x,t) does not change with time. The 
characteristic length L(t) may be obtained from the radius of gyration of a Guinier plot 
(see eq. 63), or from the peak position, L m =(27r/c m ) _1 , or as the inverse of the first 
moment of the scattering curve. If scaling holds, these different parameters should not 
give different results - but this test is rarely made. Figure 30 shows two tests for scaling 
where k m * was used to obtain L(t). Power laws for K m and the peak intensity, 7 m , are also 
frequently tested, as they hold approximately in several model calculations (though this 
may be accidental, see e.g., Milchev et al. [1988]), 

x m (t) = Ar a , I m (t) = Bt b (72) 

with constants A, B , and constant exponents a, b. If Q is constant, b = 3 a, cf. eq. (66b). 
Scaling is usually observed for advanced stages of decomposition. The scaled structure 
function depends on the volume fraction and on the morphology of the decomposition 
product (see Kostorz [1991] and Langmayr et al. [1992] for a comparison of various 
results). There is agreement that in the dilute “droplet” region, i.e., for isolated precipi- 
tates, the exponents a, b coincide with those already predicted by Lifshitz and Slyosov 
[1961] and Wagner [1961], i.e., a= 1/3, b= 1 for coarsening under the sole influence 
of an isotropic interfacial energy. In real systems, however, coherent precipitates 
introduce strain fields, and the resulting elastic interactions may introduce different 
growth laws. Early stages are expected to follow different time laws as shown, for 
example, in fig. 31. This figure also illustrates one of the assets of synchrotron radiation 
(see Simon etal. [1985] and Kostorz [1986] for a comparison of X-rays and neutrons). 
Owing to the high intensity of incident radiation decomposition processes may be 
followed with a time resolution of about 10 s. With neutrons, a few minutes are usually 
necessary to accumulate statistically relevant data (see, e.g., Williams et al. [1993] for 
an assessment of experimental features). Further examples for time-resolved SAS using 
synchrotron radiation are the studies on decomposition (Osamura et al. [1993]) and 
reversion (Okuda et al. [1993]) in Al-Li alloys. 

The earliest stages of decomposition, in favourable cases well accessible with the 
SAS technique, are still not well understood. There is a difficulty in determining a 
(small) critical radius as even the most elaborate methods to calculate particle size 
distributions from scattering curves (see, e.g., Pedersen [1993]) will not yield reliable 
values for the lower end as the scattering curve is insensitive to the smallest particles in 
the distribution. The role of quenched-in vacancies deserves further experimental 
attention and theoretical consideration, too. For example, the more concentrated Al-Zn 
alloys studied by AcuSa [1980] with X-ray SAS show a reduced rate of decomposition 
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< a > K/K m (b) K/K m 

Fig. 30. Scaled structure function s(K/K m , t ) according to eq. (71) for (a) Al-6.8 at% Zn aged at 40°C and (b) 
Al-10 at% Zn aged at 18°C for the times (total) indicated. The continuous line is a scaling function proposed 
by Furukawa [1979], (Neutron SAS results of Komura etal. [1984, 1985].) 



Fig. 31. Maximum intensity, l m (lower curves), and position of the SAS peak, « m , as a function of aging time 
/, for an Al-12 at% Zn alloy aged in situ in a SAS experiment using synchrotron radiation at ■: 139°C, 
•: 163°C and o; 195°C (after Simon and Lyon [1989]). 
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probably because of a smaller amount of quenched-in vacancies. Computer simulations 
with vacancies have recently been initiated (see Yaldram and Binder [1991], Fratzl 
and Penrose [1994]). Besides quenching, unconventional methods of reaching non- 
equilibrium states in alloys may be helpful, e.g., neutron transmutation or ion implan- 
tation (see Delafond etal. [1981] for Zn +(+) implantation in A1 and GP zones studied 
by X-ray SAS). 

The second asset of synchrotron radiation is the possibility of tuning the incident 
wavelength and thereby exploiting the anomalous scattering terms (see § 2.2). This may 
be useful in binary alloys to distinguish between surface and volume effects (as the 
scattering varies proportional to the square of the average scattering length in the former 
case and proportional to the square of the difference in scattering lengths in the latter, see 
e.g., Simon et al. [1992b], who showed that a neutron SAS signal from a thin-foil 
isotope-enriched Fe-Ni Invar alloy (Wiedenmann et al. [1989]) was most likely not 
related to phase separation). Fratzl etal. [1992] used contrast enhancement in the bulk 
for a study of dilute Cu-Fe alloys where a nucleation regime was identified. The 
anomalous dispersion also allows decomposition in ternary alloys to be studied (see 
Lyon etal. [19155] and Simon and Lyon [1994] for details). In a disordered alloy with 
three components, there are six independent partial scattering functions, for substitutional 
solid solutions, this number is reduced to three. The SAS cross-section may then be 
written proportional to 

l(x) = |/a - /c|\a + Re{(/ A - / B )(/ B - / C )*}S AB + |/ B - / c | 2 S B b (73) 

(A, B, C are the three components, density variations are neglected). A variation of the 
scattering lengths may also be realized by using different (stable) isotopes in neutron 
scattering, but this requires that different samples be prepared under exactly the same 
conditions (see Salva-Ghilarducci etal. [1983] for an example). With X-rays, the/ A , 
/ B ,/ c variations according to eq. (14) may be used within the same sample. Experiments 
on Al-Zn-Ag, Cu-Ni-Fe, Cu-Ni-Sn, Fe-Co-Cr, Fe-Ni-W and quaternary Fe-Ni- 
Co-Mo have been reviewed by Simon and Lyon [1993] and Lyon and Simon [1994], 
As an example, fig. 32 shows results for Cu-42.5 at% Ni-15 at% Fe, obtained from data 
near the K edges of Fe and Ni. The three independent functions S Kiiii , S FeFe and S FeNi 
were obtained from an overdetermined set of equations of the type given in eq. (73), 
while S CaCa was calculated. If a two-phase description is adopted, S FeNi ~ ^NiNi^FeFe 
should hold, which is obviously not fulfilled. The best interpretation (Simon and Lyon 
[1994]) is based on interfacial segregation of Fe between coherent Cu- and (Ni, Fe)-rich 
domains. The decomposition kinetics are in good agreement with earlier neutron 
scattering results (Wagner etal. [1984]). 

These decomposing alloys in fact show a strongly anisotropic microstructure which 
is immediately obvious if a two-dimensional SAS pattern is taken from a single- 
crystalline sample (see Lyon and Simon [1992]). An example from a neutron SAS study 
of Al-6.8 at% Zn (Bubeck et al. [1985], Kostorz [1988a]) is shown in fig. 33. Two 
types of “anisotropy” may be observed. In fig. 33a, there is radial symmetry for the 
larger values of /c, i.e., the precipitate zones are spherical (single-particle scattering 
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Fig. 33. Small-angle scattering pattern (lines of equal intensity) for an Al-6.8 at% Zn single crystal with (110) 
faces perpendicular to the incident beam. Two other crystallographic orientations are indicated. The crystal was 
aged in silu at room temperature for (a) 2 h and (b) 7.5 h. The azimuthal dependence of the peak intensity is 
visible. P indicates the area of the primary beam which has been shaded off by a beamstop. Scale: 
0.1 A -1 = 27 tk. (From Kostorz [1988a]). 
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dominates at larger k). The peak intensity at a value K m ~ 0.085 A, however, varies with 
the azimuthal angle. This is due to a preferred alignment of zones along the elastically 
soft (100) directions. The alignment may be approximately described by a one-dimen- 
sional interparticle correlation function (Bubeck etal. [1985], Guinier [1956]). Analysis 
of the kinetics shows (Bubeck et al. [1985]) that the zones remain spherical until they 
reach a radius of about 25 A, transforming to oblate ellipsoids with the short axes along 
(111). A large-angle diffuse X-ray scattering study of Al-4 at% Zn (using synchrotron 
radiation) aged at 80°C (Haeffner and Cohen [1989]) shows a shape transition of zones 
modelled from the experimental SRO parameters at a radius of about 10-15 A. These are 
individually identified particles in a population, and a smaller value than obtained from 
a scattering pattern involving a large number of particles seems acceptable. The work of 
Haeffner and Cohen [1989] also shows that the zones are not faceted but assume rather 
irregular interfacial structures and high zinc concentrations in the interior. 

SAS from single crystals has been observed for many other decomposing alloy crys- 
tals, e.g. Al-Cu (Guinier [1938], Sato etal. [1988] - time resolved, strong anisotropy), 
Al-Ag (Dubey et al. [1991], combined with diffuse scattering, see SchOnfeld et al. 
[1992b], Kostorz [1994]), Cu-Co (Steiner etal. [1983]), Au-Pt (Glas etal. [1992]), 
Ti-Mo (Fratzl etal. [1993], Langmayr etal. [1994]), Ni-Al and Ni-Al-Mo (Beddoe 
etal. [1984], Calderon and Kostorz [1990], Sequeira etal. [1995]), Ni-Si (Polat et 
al. [1986]) and Ni-Ti (Cerri etal. [1990], Vyskocil [1994], see Kostorz [1994]). 

Most of these results have recently been reviewed (Kostorz [1991, 1993]). With the 
exception of Cu-Co and Ni-Si. all SAS patterns show some anisotropy caused by misfit 
strains or anisotropic interfacial energy and often preferred correlations along the 
elastically soft directions. An example where the scattering anisotropy is entirely related 
to the platelike character of the scattering objects is shown in fig. 34. In this sample, 
Guinier-Preston I zones are formed which are very flat Cu-rich platelets in the {100} 
planes. Despite the dramatic difference between the two directions, these results alone do 
not allow mono- and multilayers to be distinguished (Haeffner et al. [1988]). Thin 
platelets have also been identified in internally oxidized Cu-Fe single crystals (Fratzl 
etal. [1993], Paris etal. [1994]). If very sharp streaks or peaks are to be measured, the 
curvature of the Ewald sphere must be taken into account, as misleading results might 
otherwise be obtained for a fixed scattering arrangement. 

An example for the strong influence of elastic interactions between precipitates is 
shown in fig. 35. The mismatch between y matrix and y' (Ni 3 Al) precipitates in the 
binary Ni-Al alloy is about 0.31% whereas it is close to zero for the ternary alloy. While 
there is still an interference ring indicating a sharp distance distribution, no azimuthal 
intensity variation is seen in this case. The precipitates are spherical as confirmed by 
electron microscopy, and there is no preferred alignment. In binary Ni-Al, the cuboidal 
particles show pronounced correlations along (100). The growth rate decreases dramati- 
cally in the alloy containing Mo. 

Some more applied SAS studies with neutrons on technical alloys have been 
summarized (Kostorz [1979, 1988b]). The advantage of low absorption of neutrons may 
be used to study large components non-destructively or to apply mechanical stresses on 
sufficiently large samples, prior to or during the scattering experiment. For example, 
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Fig. 34. Small-angle scattering intensity of an Al-1.7 at% Cu single crystal in the region of larger k values, 
taken in situ at 25°C with a linear position-sensitive detector along [100] or [110], The incident beam is parallel 
to [001], and the intensity is monitored at k = 0.07 A -1 as function of aging time. (From Sato etal. [1988].) 


neutron SAS of fatigued Cu-Co single crystals (Steiner et al. [ 1 983]) and a Ni-base 
superalloy (Nimonic PE16, Sundararaman etal. [1992]) has confirmed that precipitates 
dissolve in the zones of extensive dislocation activity. In polycrystalline Cu-2 at% Co, 
uniaxial compressive deformation was found to introduce SAS anisotropy due to sheared 
coherent or semicoherent precipitates (Ancrenaz and Servant [1992]). A special 
nondestructive neutron SAS study of a very complex material has been reported by 
Gotoda et al. [1989]). From multifilamentary composite wires less than 1 mm in 




Fig. 35. Lines of equal intensity (counts in 20 min) of neutron SAS data for single crystals of (a) Ni-12 at% 
Al aged for 35 min at 560°C and (b) Ni-6.1 at% Al-9.4 at% Mo aged for 3.9 h at 610°C. The incident beam 
is parallel to [110]. Some other crystallographic directions are indicated. Scale: 0.1 A" 1 = 2mc. (From 
Calderon and Kostorz [1990].) 
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diameter, containing bundles of very thin (16 to 39 /rm) filaments of superconducting 
Nb-Ti, embedded in a normal copper matrix, short pieces were arranged with the axes 
parallel to each other across the incident neutron beam. The global pinning force density 
controlling the critical current in the superconducting state depends mainly on the volume 
fraction of a precipitates in the (3 matrix of Nb-Ti. It was possible by neutron SAS to 
determine changes of the volume fraction of a-phase particles from 1.3 to 13.4%, 
depending on the thermomechanical treatment, and to interpret the differences in critical 
current on this basis, using supporting results from transmission electron microscopy. 

Decomposition processes in amorphous alloys (with and without crystallization) have 
received particular attention. SAS (mostly of X-rays) has been reported for a variety of 
substances, see e.g., Osamura et al. [1982, 1984] for Fe-Zr, Yavari etal. [1988] for 
Pd-Ni-P and Cu-Zr, Lamparter etal. [1988] for Fe-B and Fe-B-Si-C, also using 
neutrons, Rodmacq etal. [1984] for Pd-Si using neutrons, Janot and George [1985] 
for Fe-Cr-P-C (these authors showed that surface effects are predominant for thin 
ribbons), Goudeau et al. [1987] (X-rays, anomalous scattering) and Mangin et al. 
[1985] (neutrons) for hydrogenated Cu-Ti, Schaal etal. [1986] for Mg-Zn, Maret et 
al. [1992a] for Fe-Mn-Y, Maret etal. [1992b] for Tb-Cu and Gd-Cu, and Lorentz 
etal. [1994] for Fe-Ge. Figure 36 shows an example of a rolled and subsequently heat- 
treated sample of Fe 90 Zr 10 . The drastic change of the SAS intensity is attributed to fine- 
scale separation (~60 A) into a-Fe and Fe 3 Zr/Fe 23 Zr 6 (Osamura et al. [1984]). No 
Bragg peaks are found with X-rays or electrons. 

Compositional fluctuations and clustering above the critical temperature for phase 
separation may also be studied by SAS techniques. For large-scale fluctuations, i.e., those 
relevant for the SAS signal, the continuum theory of fluctuation waves (see Krivoglaz 
[1969]) is appropriate, and one can write [starting from eq. (43)] for a binary alloy (v a 
is the atomic volume): 


do- 
d/2 " 

— (/b-/a)V + 4 ^ 2 )" 1 . 

(74) 

as 

|c(/c)| 2 = —{g” + 4i7 2 B/c 2 ) 

(75) 


a 


with g" = d 2 g/dc 2 and g = Gibbs free energy density, and B = gradient energy term. The 
free energy density g may contain, apart from the chemical term g 0 , a strain energy term 
g s taking into account the elastic strain energy around a cluster. Defining a correlation 


length £ by 


£ 2 = B 

So + Ss 

(76) 

da ^ NkT{f B -f A ) 2 
and using (0) = , ,a 

d Si v a (g 0 +gs) 

(77) 

one can write 
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Fig. 36. X-ray SAS for amorphous Fe 90 Zr 10 , •: after rolling, and after subsequent annealing at 350°C for +: 10 
min, and ▲: 50 min (after Osamura etal. [1982]). 


d n K ’ AH W 1 + 4 ir 2 f V 


(78) 


According to eqs. (76) and (77), do-/di3(0) and the correlation length diverge if 
So ~ * 8s ~ i- e - when a coherent critical point is approached. This has been confirmed 
by Schwahn and Schmatz [1978] for Al-Zn of the critical composition c = 39.5 at% 
Zn. The stable phase diagram shows a critical temperature of 351.5°C, but neutron SAS 
measurements (at temperature) indicated no special behaviour at this temperature. Upon 
further cooling, large changes occurred between 324 and 322°C, corresponding to the 
coherent critical point at 323°C. A depression of the coherent spinodal by 28 K is well 
understood on the basis of Cahn’s [1961] formulation of the elastic energy term. 

Neutron SAS on frozen-in homophase fluctuations was measured by Poerschke et 
al. [1986] in Cu-Ni-Fe alloys where the decomposition kinetics are very slow. Samples 
were quenched from several temperatures above the coherent miscibility gap. Scattering 
from large precipitates at grain boundaries and some decomposition products picked up 
during the quench have to be separated, but the remaining SAS intensity yields equi- 
librium parameters £~ 2 (T) from an Ornstein-Zemike-Debye plot [reciprocal intensity vs. 
k 2 , see eq. (78)]. A plot of £~ 2 (T) vs. T can be used to determine T’coh- the temperature 
of the coherent miscibility gap (see fig. 37). 
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Fig. 37. Temperature dependence of the square of the reciprocal correlation length for homophase fluctuations, 
f~ 2 , for a: Cu- 48 at% Ni-2 at% Fe, o; Cu-46 at% Ni-4 at% Fe and •: Cu-48 at% Ni-8 at% Fe derived from an 
Omstein-Zernike analysis of neutron SAS data. Extrapolation to f 2 = 0 yields T coh- (After Poerschke et al. 
[1986].) 

Similar measurements have been performed by Furusaka et al. [1986] for Fe-Cr, 
another system with very slow decomposition kinetics. Omstein-Zernike behaviour is 
also found for short aging times within the miscibility gap and large k. From a recent 
neutron SAS study, Bley [1992] concluded that two characteristic lengths are necessary 
to describe the aging behaviour in this system. 

Magnetic clusters and precipitates can be studied with neutron SAS. The magnetic 
structure factor, eq. (19), can, in analogy to nuclear scattering, be reinterpreted by 
introducing an integration over fluctuations of the local magnetization density rather than 
discrete magnetic moments: 

M{k) = — J M{r) exp(-27 r iK-r)d 3 r . (79) 

v s 

A homogeneously magnetized sample will therefore not show any magnetic SAS, but 
fluctuations in the orientation and/or the magnitude of M(r ) will be revealed. 

A number of useful applications of eq. (20) for unpolarized neutrons are possible. In 
the simple case of a magnetic phase embedded in a non-magnetic matrix or vice versa, 
the ratio /V / 11 is independent of the magnitude of k for complete saturation. Any 
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deviation is an indication of incomplete saturation (then spin flip scattering will occur 
and an analysis is worthwhile) or of some other scattering contribution. Using measure- 
ments of /V/ 11 , Beaven et al. [1986] were able to distinguish scattering contributions 
from small voids in the presence of somewhat larger Cu precipitates in Fe (the ratio 
/"V / 11 is 1.4 for voids in ferromagnetic Fe, but 11 for Cu precipitates). 

In inhomogenous alloys, M(k) may vary considerably over a wide range of temperat- 
ures, as the Curie temperature depends on the chemical composition. Without polari- 
zation analysis, magnetic scattering as a function of temperature will give a first 
indication of the extent and magnitude of magnetized regions, fig. 38 shows the SANS 
curves I L and 7 11 for a partially decomposed Ni-14 at% Al alloy containing small (-5 nm 
diameter) nonmagnetic Ni 3 Al particles in an inhomogeneous matrix, measured at a 
temperature where the matrix is completely ferromagnetic (194 K). The Al concentration 
of the matrix is low near the precipitates (~11 at%) and still high at some distance. The 
variation of /"V / 11 is related to the gradient of saturation magnetization density around the 
precipitates. As the temperature is increased, those parts containing higher Al con- 
centrations will become paramagnetic first and some additional scattering in the I L part 
is expected which will initially increase with increasing temperature until, at a higher 
temperature, the magnetized regions are reduced to shells around the precipitates that 
finally disappear once T exceeds the highest T c (corresponding to regions with the lowest 
Al concentration). Preliminary tests using unpolarized neutrons have shown the expected 
results. As for those parts of the sample where the concentration implies a T c near the 
temperature of measurement, the magnetization will fluctuate, and measurements with 
polarized neutrons will be even more instructive. 



2tt:k , 0.1 A ’ 1 

Fig. 38. Macroscopic neutron SAS cross sections at 194 K, o; parallel and •: perpendicular to the magnetic field 
for inhomogeneous ferromagnetic Ni-Al containing nonmagnetic Ni 3 Al precipitates of an average radius of 
about 25 A (Kostorz [1991]). 
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Measurements of magnetic neutron SAS in zero field for bulk ferromagnets are not 
advisable if domain walls are present as these will introduce multiple refraction. 
However, the scattering due to fluctuations of magnetic moments over length scales 
accessible by neutron SAS may be easily followed for dynamical fluctuations (giant 
clusters, superparamagnetism, spinglass behaviour, etc.) as a function of temperature. 
Thus, the formation of finite spin clusters has been revealed by SAS of neutrons, e.g., in 
certain Au-Fe alloys in the early experiments of Murani (1976). No sizeable Fe clusters 
for 15 or 25 at% Fe were observed in recent X-ray SAS experiments (Yoshida et al. 
[1989], Fratzl et al. [1991]) but the same authors found a short-range-order tendency 
by Mossbauer spectroscopy and attributed the strong dependence of the magnetic 
properties on the thermal history in these alloys to temperature-dependent (higher-order) 
correlations, especially third-neighbour Fe-Fe pairs. Other neutron SAS experiments have 
shown magnetic clustering of finite range in Ni-Mn (Moze etal. [1985]), Pt-10 at% Fe, 
(Radhakrishna el al. [1989]) and in amorphous Fe 91 Zr 9 (Rhyne and Fish [1985]). For 
a discussion of SANS from disordered magnets in a magnetic field of variable strength, 
see Cullen and Callen [1986], 

A rare example for the use of polarized neutrons in SAS is the study of com- 
positional and magnetic inhomogeneities in the Fe-65 at% Ni Invar alloy (see Takeda 
etal. [1987]). These authors used two different samples, one with natural Ni (scattering 
length 1.03 xl(T 14 m), the other with the isotope 62 Ni (-0.87 x 10 -14 m), to introduce a 
change of sign between chemical (nuclear) and magnetic scattering-length densities, p(r) 
and PoM(r) (compared in saturation). 

By taking the difference of / 1++ and I 1 , one obtains the interference scattering 
function between nuclear and magnetic scattering which is controlled by correlations 
given by 

J (p(r) - p}{M(r + r') - A/}dV . (80) 

A change of sign of [p(r)~p] will also change the sign of this term. The experiments 
(after correction for depolarization, calibration, etc.) showed a small positive result for 
{/ 1++ -/ 1-- } for the sample prepared with natural Ni, whereas for Fe- 62 Ni the difference 
was negative. A model with paramagnetic Fe-rich regions embedded in a ferromagnetic 
Ni-rich matrix is compatible with the results. However, dynamic spin fluctuations also 
contribute significantly to SAS in this temperature range. 

Neutron depolarization of the transmitted beam is due to magnetic inhomogeneities 
larger than about 100 A (up to a few mm) and is complementary to SAS (see, e.g., 
Rosman and Rekveldt [1991] for antiferromagnetic particles and Mitsuda etal. [1991, 
1992] for spin glass behaviour in Fe-Al and Au-Fe). 

5.3. Defects 

Here, we discuss the SAS from dislocations and clusters of point defects as further 
examples of inhomogeneities revealed by this technique, and some other defect scattering 
results. 

In an otherwise homogeneous material, dislocations yield a SAS signal according to 
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eq. (57) because of local density changes which can be described by elastic continuum 
theory. This scattering is very small, and as the presence of dislocations simultaneously 
increases the probability of double Bragg scattering, early attempts to reveal SAS from 
dislocations with X-rays failed. Atkinson [1959] and Atkinson and Lowde [1957] 
used neutrons of long wavelength and finally found true SAS of the correct intensity 
from deformed metals. Theoretical work by Atkinson and Hirsch [1958a, b] and 
Seeger and coworkers (Seeger and KrOner [1959], Seeger [1959], KronmOller etal. 
[1963], Seeger and Ruhle [1963]) for isotropic cubic crystals and by Seeger and 
KrOner [1959] and Schmatz etal. [1974], who considered elastic anisotropy, has been 
reviewed by Schmatz [1975] along with the (scarce) experimental results. The SAS 
cross-sections are so small that even for high dislocation densities, not many specific 
dislocation arrangements have been studied, and surface irregularities or a few large 
inclusions or pores may easily obscure the scattering pattern. 

In magnetic crystals, magnetoelastic coupling leads to an additional SAS term (for 
unpolarized neutrons), and deformed single crystals of Ni (Anders etal. [1984]) and Fe 
(GOltz etal. [1986]) have been studied making use of the additional intensity and its 
symmetry properties near magnetic saturation. Despite the magnetic scattering, ex- 
periments of this type are very tedious and require a high incident neutron flux. 

The agglomeration of point defects should lead to more easily measurable SAS 
effects as the scattering contrast is much higher. The structure and annealing behaviour 
of voids in A1 single crystals, produced by irradiation with fast neutrons, have been 
extensively studied (see Hendricks etal. [1977]), Lindberg etal. [1977] and references 
therein). X-rays, neutrons, transmission electron microscopy and positron annihilation 
were combined to obtain a rather complete picture of the microstructure of these 
samples. From the SAS point-of-view, the two-phase model can be employed, and no 
interparticle interference effects are visible at volume fractions of about 1% and void 
sizes of several hundred A. Whereas the voids in irradiated A1 showed faceting attributed 
to truncated octahedra with [111] faces, voids formed in quenched and annealed /T-NiAl 
single crystals of strictly stoichiometric composition showed faceting on [110] planes, 
forming a rhombic dodecahedron (Epperson et al. [1978b]). Figure 39 shows an 
example. The crystallographic symmetry of the faceted voids is reflected in the scattering 
pattern. SAS at smaller angles than shown in the figure is “isotropic”, and a radius of 
gyration R a ~ 240 A can be determined for this particular case. 

The void concentrations in the studies mentioned above, about 0.1-1%, were easily 
measurable in otherwise perfect single crystals. Among the factors influencing mechan- 
ical properties of metals and alloys, void formation during fatigue or high-temperature 
creep presents a major problem, and it is desirable to recognize void formation as early 
as possible. SAS, especially with neutrons because bulk samples can be examined 
without further preparation, is a useful method if other scattering contributions are not 
prohibitively large (see Walther and Pizzi [1980]). Grain boundary cavitation in 
fatigued Cu was studied by Page et al. [1982]) who found cavities in the range of 
100-900 A. Yoo etal. [1982] measured neutron SAS from fatigued polycrystalline Ni of 
commercial purity. A strong influence of temperature on void formation was found. 
Small voids, about 20 A in diameter, have been seen by neutron SAS after fatigue 
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Fig. 39. Lines of equal intensity of neutron SAS measured for a /3'-NiAl single crystal containing faceted voids. 
The crystal was annealed at 1600°C, quenched and aged for 30 min at 650°C. The [110] direction was parallel 
to the incident beam. The two-dimensional position-sensitive detector was placed 2.5 m away from the sample, 
the neutron wavelength was 6.64 A. Scale: 0.1 A -1 = 2 ttk. (After Epperson etal. [1978b].) 


deformation of (lll)-oriented Cu single crystals (Kettunen etal. [1981], Lepisto etal. 
[1991]). Figure 40 shows the SAS increment between a late and early state of fatigue 
life. A major scattering contribution at low k must be attributed to dislocations, but at 
higher k, voids (or small dislocation loops) are present (volume fraction ~2x 10 -5 ). The 
existence of small vacancy clusters in the persistent slip bands of fatigued Cu has been 
confirmed by positron annihilation (LepistO etal. [1981]). 

Finally, helium bubbles forming in implanted Fe-Ni-Cr have been studied by neutron 
SAS (Carsughi [1994]). Their size-dependent gas density requires a size-dependent 
contrast factor. 

5.4. Special topics 

A special case of small-K scattering is Bragg diffraction from structures with large 
lattice constants or large-scale periodicity, including magnetic structures (see, e.g., 
Lebech etal. [1989] for a long-range spiral magnetic structure of a period of -700 A in 
a cubic FeGe single crystal). The flux line lattice of type II superconductors with a lattice 
constant typically around 1000 A and first revealed by neutron scattering by Cribier et 
al. [1964], has subsequently been studied in great detail for many different materials and 
conditions (see Gerold and Kostorz [1978], Hicks [1979]). Recently, flux line lattices 
have also been found in Y-Ba-Cu-O superconductors (Forgan etal. [1990], Yethiraj 
etal. [1993]) by neutron diffraction. 
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Fig. 40. Neutron SAS increment between a (111) Cu single crystal fatigued to about 80% of its expected 
fatigue life and another (111) crystal fatigued to only 20% (peak hardened). The circles represent experimental 
values, the solid curve is the sum of curve 1 (void scattering) and curve 2 (dislocation scattering). (After 
Kettunen etal. [1981].) 

Periodic and near-periodic multilayers can also be studied in specular reflection near 
the critical angle of total reflection (see e.g., Penfold [1992]). Such investigations are 
essential in the development of X-ray and neutron mirrors. With the improved col- 
limation at a synchrotron radiation source, SAS measurements under grazing incidence 
are providing a new surface-sensitive tool (Levine et al. [1989]) which has been used to 
study agglomerates and pores at surfaces (see also Naudon etal. [1994]). 

At small angles, multiple refraction in two-phase systems may obscure scattering, 
especially for neutrons of large wavelength and for thick samples (see Bacon [1975], 
Schmatz [1978], Kostorz [1979]). A separation of refraction and scattering is in principle 
possible by varying the incident wavelength. Strong multiple neutron refraction from 
magnetic domains has been evaluated by Galotto etal. [1976], Berk and Hardman- 
Rhyne [1985, 1988] have considered multiple scattering and refraction and developed a 
multiple-scattering treatment of neutron SAS data from dense powders and porous 
systems. Multiple scattering was found to be the appropriate description of the (wave- 
length dependent) scattering from alumina powders (-30% dense, average particle 
diameter 340 nm, samples 2 to 10 mm thick, Hardman-Rhyne and Berk [1985]). The 
theory was also successfully applied to SAS measurements on microporous silica and 
alumina for which the densification was studied (Long et al. [1991]). 

A very simple method of monitoring changes in the microstructure of a sample is to 
measure its transmission. As indicated by eq. (13), the transmission is a function of the 
total removal cross-section, i.e., scattering into any direction outside the range covered 
by the transmission monitor appears like additional absorption. The method has been 
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proposed for nondestructive testing of materials containing large inhomogeneities, e.g., 
cavities (Palacios etal. [1981]). With a very highly collimated beam, and if SAS is the 
only source of additional/variable scattering, a change in the transmission corresponds to 
the integrated SAS intensity change as given by eq. (65), or, for isotropic scattering 



In reality, the lower limit of the integral will be finite, x min . It is also possible to work 
with a fixed acceptance angle 4 0 a for transmission measurements and vary the wave- 
length of the incident radiation, thus testing the value of the integral of eq. (81) as a 
function of its lower limit k^-26/X (see Engelmann etal. [1979] for an experiment 
with ultracold neutrons). As many details of the scattering function will thus be lost, 
angular-resolved measurements are preferred if the intensity is high enough. 

The extended X-ray absorption fine structure (EXAFS), i.e., the energy dependence 
of absorption up to about 1000 eV above the absorption edge (e.g., the K edge, see 
fig. 2) can now be measured with sufficient accuracy. The basic physical process leading 
to oscillations of the absorption coefficient is the modification of the final state of the 
photoelectron of the excited atom by neigbouring atoms (see, e.g., Stern [1974]). The 
effect of different chemical species on phase and amplitude of the oscillations is rather 
complex, but in principle specific for each type of atom. Nearest neighbours of the 
excited atom have the most pronounced influence, and the EXAFS technique is thus a 
useful tool (see, e.g., Fontaine [1993]), complementary to scattering studies in all areas 
discussed and mentioned in this chapter. 

6. Energy transfers 

As mentioned in §2.4, inelastic neutron scattering (coherent and incoherent) is a 
versatile method to study phonons [eqs. (24) and (25)] whereas incoherent (as well 
coherent) energy-resolved quasi-elastic scattering of neutrons reveals diffusive motion. A 
few examples will now be given. 

6.1. Phonons in real crystals 

The dynamic properties of crystals are very sensitive to lattice defects and change 
dramatically near phase transitions. Phonons and defects have been discussed by 
Nicklow [1979] and phonons and phase transformations e.g., by Currat and Pynn 
[1979], and by Schober and Petry [1993]. As an example, fig. 41 (Nicklow etal. 
[1979]) shows the frequency distribution for several phonon wavevectors in a Cu single 
crystal irradiated with thermal and fast neutrons to produce Frenkel pairs (concentration 
~1.3 x 10" 4 ). The measured phonon peaks are shifted relative to those measured for defect 
free Cu, and an additional component on the high-frequency side occurs. Subsequent 
annealing results in an elimination of the peak shifts (72 K), but the additional high- 
frequency structure is only removed at 800 K. Part of the results can be explained by a 
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Fig. 41. The frequency distributions of neutrons scattered from irradiated Cu at 4 K for q = (0, 0, f)/a. Open 
and solid circles are experimental data, the lines are fits including resonant coupling of librational modes of the 
split interstitial. (After Nicklow el al. [1979].) 


resonant coupling of phonons to the librational modes of the split interstitial, but other 
defects, possibly small vacancy clusters, also contribute (see also Dederichs and Zeller 
[ 1980] for details). 

Larger interstitial defect concentrations are present in metal-hydrogen systems (for 
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coherent neutron scattering, D is used instead of H), and local inodes are more easily 
observed (see Blaschko etal. [1988] for D in rare-earth metals). 

The phonon spectra of many crystals show phonon anomalies at temperatures well 
above the transformation temperature. A very clear example is shown in fig. 42 where 
the phonon dispersion surface for acoustic phonons travelling along the chain direction 
of a one-dimensional conductor is indicated. The anomaly for the longitudinal modes is 
related to the special shape of the Fermi surface which is restricted, in the one-dimen- 
sional case, to a pair of points. Because of the electron-phonon coupling, phonon 
anomalies are expected (Kohn [1959]) for scattering processes involving opposite sides 
of the Fermi surface connected by the wavevector 2k p (k F is the Fermi wavevector), i.e., 
when the energy of the electrons is not changed. For a one-dimensional metallic system 
the electronic susceptibility diverges for 2k p . The resulting “giant” Kohn anomaly is 
responsible for the displacive transformation at lower temperature which may be interpreted 
as a “condensation” of these special phonons into a pattern of static displacements, leading 
to a modulation of period \/2k p (see Comes and Shirane [1979] for details). 

In two and three dimensions, Kohn anomalies, i.e., singularities in the phonon 
dispersion curves due to the abrupt change of the screening response of conduction 
electrons when wave vectors at the Fermi surface are involved, depend on the topology 
of the Fermi surface. Sections of low curvature are particularly effective. A structural 
phase transformation may in many cases be induced by the condensation (“softening”) 
of phonons of the appropriate wavevectors to create the new structure. An instructive 
case is /S'-NiAl which shows a broad shallow dip in the transverse acoustic phonon 
branch with [110] propagation and [110] polarization (see Mostoller etal, [1989]) for 



Fig. 42. Room-temperature dispersion surface of KCP [K 2 Pt(CN) 4 Br 03 - jH 2 0] for acoustic phonons 
propagating along the chain direction. A giant 2 k F anomaly is clearly visible for the longitudinal mode. (After 
Comes and Shirane [1979].) 
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the stoichiometric composition. Upon cooling, these phonon modes do not condense, and 
there is no structural transition. For the off-stoichiometric composition Ni-37.5 at% Al, 
a martensitic phase transformation occurs at about 80 K (the martensitic start temperature 
Af s varies from 0 K to 600 K, approximately linearly with off-stoichiometry between 
about 60 at% and 68 at% Ni, see Ochiai and Ueno [1988]). The transverse phonon 
branch shows a deeper dip when the transformation temperature is approached (Tanner 
etal. [1990]), see fig. 43. The transformation sets in before the condensation is complete 
(the corresponding elastic constant C' is also reduced but not to zero), indicating that the 
transformation must be triggered at strain defects, e.g., dislocations (Tanner et al. 
[1990]). Application of an external stress leads to a shift of the dip in the dispersion 
curve (Shapiro etal. [1993]). Zhao and Harmon [1992]) have calculated the band 
structure of Ni x Alj_ x and related the martensitic transformation to Fermi nesting and 
strong electron-phonon interactions and suggest that local variations in concentrations and 
thus in order parameter could trigger the transformation. In contrast to NiAl, isostructural 
NiTi shows a static instability when Af s is approached, and the same phonon branch 
collapses (see Tietze etal. [1984]). 

Dynamic and/or static instabilities are also expected for pure bcc metals undergoing 
a structural transformation. For the group-IV metals, the transition temperatures lie far 
above room temperature, and if a single crystal is to be studied, it must always be kept 
above the transition. In a specially designed furnace (see Flottmann et al. [1987]), 



Fig. 43. Neutron scattering results (energy tranfer E = hi>) for the transverse acoustic phonon branch in [110] 
direction ([llO] polarization) of Ni-37.5 at% Al (B2 structure) at o: 290, a: 150, •: 85 K (after Tanner etal 
[1990].) 
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crystals of bcc Ti (Petry et al. [1991a]), Zr (Heiming et al. [1991a]), and Hf (Tram- 
penau etal. [1991]) have been grown and investigated at high temperatures by neutron 
inelastic scattering. The phonon branches show dynamical (but no static) precursor 
effects for the displacements necessary to reach the hdp a-phase and the w-phase (found 
under pressure or by alloying these metals). 

In the vicinity of the order-disorder transition in Cu 3 Au, selected phonon scans show 
marked changes at temperatures about 25 K below T e (Gaulin etal [1990]) which 
require further experimental and theoretical study. 

6.2. Diffusive motion 

The use of incoherent quasi-elastic neutron scattering (see § 2.4) to study the motion 
of hydrogen in metals has been reviewed, e.g., by Springer [1977] and by Skold et al. 
[1979]. The incoherent scattering is somewhat simpler to interpret, but coherent quasi- 
elastic scattering may also be evaluated (see Springer [1977], Lovesey [1984], and 
Petry etal. [1991b] for the glass transition). The availability of backscattering spectro- 
meters (Birr etal. [1971] and spin-echo instruments (Mezei [1980]) allows changes of 
neutron energies to be detected between 10“ 5 and 1(T 9 eV, corresponding to atomic 
motion on a time scale of 10 -11 and 10 -7 s, or diffusion coefficients in the range of 10 -7 
to 10 -u cm 2 / s. Self-diffusion in solid metals can thus also be studied at temperatures not 
too far below the melting point if the absorption is low. Ait-Salem et al. [1979] and 
GOltz et al. [1980] studied self-diffusion in Na(bcc), using incoherent quasi-elastic 
neutron scattering, at temperatures between 323 and 370.2 K (melting point 370.9 K). 
The k dependence of the energy width gives very detailed information on the jump 
mechanism. The monovacancy mechanism with nearest-neighbour jumps is predominant, 
with a divancy contribution of about 20%. 

A more recent example concerns the unusually fast self diffusion in bcc /3-Ti (Petry 
etal. [1988], Vogl etal. [1989], measured on a backscattering spectrometer directly 
after growth in situ (to avoid the transition to the a-phase at 882°C). Figure 44 shows the 
results obtained at 1460 and 1530°C along with a visualization of the two jump vectors 
used for the fitted curves. Such fits can be attempted for all reasonable combinations of 
jump mechanisms, and the best fits are shown; in fig. 44(a) the admixture of next-nearest 
neighbours tends to improve the fit for some orientations, but in fig. 44(b), the nearest- 
neighbour jumps are better. The authors conclude that, if any, there might be 10-15% of 
direct next-nearest-neighbour jumps. The diffusion coefficients are in the range of 1CT 7 
to 10 -8 cm 2 /s. Spin-echo measurements have been performed down to 1100°C (poly- 
crystals) where the diffusion coefficient is only ~1CT 9 cm 2 /s. The phonon softening 
observed in the inelastic scattering measurements (see above) provides a means to 
explain the low migration enthalpy for nearest-neighbour jumps, as explained in detail by 
Petry et al. [1989]. Diffusion of Fe, Co, Ni and other metallic solutes in group-IV 
metals is at least two orders of magnitude faster than self-diffusion. In a similar study of 
dilute Zr-Co, Co diffusion via a vacancy mechanism was identified as the predominant 
mechanism (Petry etal. [1987]), but only an enhancement of diffusivity by a factor of ten 
was found via this path, calling for another mechanism, e.g., via interstitial sites. While 
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Fig. 44. Quasi-elastic line broadening F (full width at half maximum) for two single crystals of Ti, at (a) 
1460°C and (b) 1530°C, as a function of scattering vector k for different angles a with respect to the lattice 
axes (two different scattering planes). The solid curve is calculated for nearest-neighbour (NN) jumps, the 
dotted curve for 75% NN jumps and 25% direct next-nearest-neighbour jumps, (c) Schematic of the two types 
of jump used in the fits of (a) and (b). (From Vogl etal. [1989].) 


a fraction of 28% of interstitials was still compatible with these measurements, lattice parameter 
changes (Heiming et al. [1992]) and diffuse scattering (Heiming et al. [1991b]) show that 
only a small fraction, i.e., a few percent of the solutes present, may be on interstitial sites. 
This means that the interstitial diffusion path is considerably faster than anticipated. 
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Further reading 

General 

The books referred to in the text and contained in the list of references are recommended for general reading, 
in particular the books by Schwartz and Cohen and by Cowley and, more extensively covering real crystals, 
the book by Krivoglaz. An up-to-date account on theory, methods and applications of neutrons and X-rays 
in condensed matter research is given in the three volumes Neutron and Synchrotron Radiation for Condensed 
Matter Studies, edited by J. Baruchel, J. L. Hodeau, M. L. Lehmann, J. R. Reynard and C. Schlenker (Les 
Editions de Physique, Les Ulis, France, 1993). Volume I is entitled Theory, Instruments and Methods, Volume 
II Applications to Solid State Physics and Chemistry, Volume in Applications to Soft Condensed Matter and 
Biology. 

Neutron Scattering 

In addition to the books referred to, G. L. Squires, Introduction to the Theory of Thermal Neutron Scattering 
(Cambridge Univ. Press, 1978) may be consulted. Several chapters in G. Kostorz, ed., Neutron Scattering, vol. 
15, Treatise in Materials Science and Technology (H. Herman, series ed., Academic, New York, 1979) contain 
further details about metallurgical applications. More recent materials studies with neutrons are described in the 
Enrico Fermi Course proceedings edited by M. Fontana, F. Rustichelli, R. Coppola, Industrial and 
Technological Applications of Neutrons (Soc. Italians di Fisica, Bologna, 1992). 

Diffuse scattering 

Beyond the general coverage of the subject in the book by Schwartz and Cohen [1987], Bauer’s [1979] 
chapter (see references) gives more details on the use of neutrons. 
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Small-angle scattering 

The classical monograph by Guinier and Fournet [1955] is complemented by “Small-Angle X-ray 
Scattering”, eds. O. Glatter and O. Kratky (Academic, London, 1982) which gives a solid theoretical 
background and describes many methods and applications primarily for statistically isotopic systems. C. 
Williams et al. [1993] (see references) report on a broad range of recent X-ray and neutron studies, and 
neutrons are more extensively covered by Kostorz [1979] (see references). Reviews and research papers of 
the two most recent International Small-Angle Scattering Meetings, 1990 in Leuven and 1993 in Saclay, can 
be found in J. Appl. Crystallogr. 24 (1991) and in Journal de Physique D (1994/1995), respectively. 
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Ch. 13, § 1 


1. Introduction — The chemistry of interfaces and physical metallurgy 

Writing more than a hundred years ago, one of the progenitors of modern physical 
metallurgy, Roberts- Austen [1888] stated: “One thousandth part of antimony converts 
first rate best selected copper into the worst conceivable”. Although it was well known 
at that time that small amounts of certain elements could provoke remarkable changes in 
the physical and mechanical properties of metals — for example, the improvement in the 
strength of iron by carbon and the reduction in the electrical conductivity of copper by 
impurities — the above refers to the dramatic embrittlement of an otherwise ductile 
metal, associated with intergranular fractures. This is a cogent illustration of the subject 
matter in the present chapter, in which we shall be concerned with how the chemical 
constitution of the various interfaces in polycrystals influences the physical metallurgical 
properties. Observations on the transformation of ductile metals to intergranularly brittle 
matter clearly implicate the chemical constitution of the grain boundaries and point to the 
existence of basic, underlying mechanisms which lead to interfacial regions having highly 
localized and altered chemistry. In this chapter, the generic term interfacial microchem- 
istry denotes the science and practice of those phenomena in polycrystalline materials 
which give rise to zones of chemical heterogeneity. This may occur from either the 
equilibrium or the non-equilibrium partitioning of those solutes from solid solutions 
which are present as alloying elements or as residual impurities. 

In metallurgical practice, the term segregation is used in its original sense to denote 
the build-up of concentrations of elements during crystallization from casting. In more 
recent times, metallurgists studied, by indirect procedures, various effects which implied 
very local chemical-concentration changes at grain boundaries. The latter concentrations 
were thought to be the origin of metallurgical problems such as hot cracking or temper- 
embrittlement in low alloy steels, as demonstrated by increases in the ductile-to-brittle 
transition temperature during impact testing. Such indirect observations associated with 
the presence of certain residual impurities at grain boundaries were commonly referred 
to as “segregation” in metallurgical circles (McLean [1957]) and although common 
usage dictates that we shall continue to use this term in the context of interfacial 
microchemistry, we clearly distinguish between the phenomena which give rise to these 
types of equilibrium segregation and the phenomena which give rise to the original 
solidification segregation. Indeed, as will be demonstrated, this redistribution of solutes 
by equilibrium segregation is truly reversible as analyzed by thermodynamic procedures, 
and it is quite analogous to the chemisorption or adsorption phenomena long studied by 
surface chemists for gas atoms interacting with free solid surfaces. Although the terms 
“segregation” and “adsorption” have acquired historically these specific usages in their 
respective disciplines, in order to emphasize the fact that the basic phenomena involved 
are identical, we shall apply the terms “segregation” and “adsorption” interchangeably to 
both the external interface, such as the free surface and the internal interface, such as the 
grain boundary. 

In addition to grain boundaries which are the principal interfaces with metallurgical 
consequences, interfacial microchemistry embraces a variety of interfaces in polycrystal- 
line materials, among them free surfaces, the interfaces between various constituent 
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phases in the solid, stacking faults, and precipitate/matrix interfaces. From a macroscopic 
point of view of this theme, we underline that all these interfaces behave essentially in a 
similar way in spite of widely differing, often very complex structures at the atomic level 
— the same thermodynamic analyses are applicable to all these types of interface, which 
may be described by the classical concepts of surface adsorption chemistry; indeed similar 
adsorption isotherms hold, the conditions for the existence of any particular form and the 
magnitude of the adsorption parameters depending on the nature of the interface alone. 

Interfacial microchemical phenomena in polycrystals can be visualized by aid of the 
schema in fig. 1. The multicomponent solid mass is held at equilibrium in an isothermal 
enclosure and at a temperature which permits rapid thermal transport. The chemical 
potentials of all species, assumed here to be in solid solution, are constant throughout the 
system. Instead of a random distribution of solute species, as in an ideal single crystal, 
it is found that at equilibrium there is a heterogeneous partitioning which results in the 
enrichment of interfaces by certain surface-active species, as shown diagrammatically in 
fig. 1. The levels of the enrichment are defined only by the system parameters at 
equilibrium and not by the history of the material. The chemical enrichment at any 
interface can be reproduced simply by re-establishing the identical physicochemical 
conditions. It is in this sense that the segregation is truly reversible. 

An essential condition of equilibrium segregation, whether at a free surface, a grain 
boundary or precipitate/matrix interface, is that the width of matter over which it builds 
up is influenced by the structural width of the interface. The field of structural pertur- 
bation of a grain boundary should not extend beyond a few atom distances, and it has 
now been experimentally confirmed that the space occupied by the segregation is 
constrained to within the structurally perturbed region of the interface (§ 3.2). Hence a 
distinguishing feature of equilibrium segregation is that it is localized to within one or 
two atom distances of the plane interface. Thus we speak of a fractional monatomic layer 
level of segregation at grain boundaries and other interfaces, by which we mean, by 
analogy with the concepts of monomolecular adsorbed layers in surface chemistry, that 
a fraction of a single equivalent atomic layer consisting of a sheet of close-packed atoms 
lies along the interface. Of course this is an idealization, and in practice monatomic 
layers of this nature probably do not form. However, it is a convenient way of visual- 
izing the extent of segregation and of quantifying it, whether this is expressed as a 
fraction of a monatomic layer or as several such layers. 

In contrast to this equilibrium segregation, various phenomena exist in which the 
apparent levels of segregation build-up may extend to distances of as much as several 
micrometers across grain boundaries. This effect is sensitive to the rate of cooling from 
a high temperature. The earliest effect of this type was announced by Westbrook and his 
collaborators (Westbrook [1964]) in a series of experiments in which they claimed to 
detect such an effect through changes in indentation microhardness profiles across grain 
boundaries in certain dilute alloys. Although falling within the scope of a general 
treatment of interfacial microchemistry, this phenomenon must be distinguished firom that 
involving equilibrium segregation, which is probably the most ubiquitous form of 
interfacial enrichment in solid systems. The origin of the former segregation is also 
believed to be entirely different from that involving reversible segregation — namely, 
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during a temperature fall, vacancies flow to the grain-boundary sinks in order to preserve 
their thermal equilibrium value. A solute concentration is thus built up about the 
boundary because of the dragging of vacancy — solute pairs. Although highly system- 
specific, the phenomenon may be of consequence in certain metallurgical operations and 
this, together with other related manifestations, will be considered later in this chapter. 
To distinguish this phenomenon from reversible segregation, we refer to it as non- 
equilibrium segregation. 

In order to understand the impact of interfacial microchemistry on physical metallur- 
gical processes, our aim in this chapter is to provide descriptions of the equilibrium and 
non-equilibrium segregation phenomena in terms of basic physical, chemical and kinetic 
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parameters, and then to relate such segregation to changes in mechanical properties, 
embrittlement in particular. Indeed, the manner by which a narrow sub-monatomic layer 
of a foreign species along a grain-boundary path may so affect bulk mechanical 
properties as to convert a tough metal into a fragile one, has been the point of departure 
of many intriguing basic studies. However, it is beyond the scope of this chapter to 
enlarge in detail on all the mechanical effects that have been scrutinized in recent years 
in terms of the role of interfacial microchemistry. 

2 . Thermodynamic features of interfacial adsorption 

The thermodynamic property possessed by all interfaces in common is a free energy, 
which is a measure of the unsatisfied atomic bonding in the relatively disordered 
structure of the interface. In liquids, where it is referred to as surface tension, this free 
energy is easily demonstrated to be able to do work, as testified by many conventional 
capillarity phenomena, such as droplet formation or the shapes of soap bubbles. Foreign 
species may accumulate at these interfaces in order to satisfy better the atomic bonding, 
and hence to reduce the free energy: this is in simple terms the basic motivation for the 
segregation of foreign atoms at interfaces. It is in this way that soap added to water 
reduces its surface tension. This free energy is the central thermodynamic function which 
characterizes all interfaces whether in solids or liquids, and because its magnitude 
depends on temperature, crystalline orientation and the presence of foreign species, its 
determining presence is felt in many physical phenomena in nature and in turn in the 
mechanical and chemical properties of materials. 

The fundamental connection between surface free energy and the chemical state of 
the surface was first demonstrated in the last century by Gibbs [reprinted 1948] in an 
elegant and generalized treatment. This treatment and the mathematical derivations are 
shown in detail in a number of textbooks on surface chemistry to which the reader is 
referred (e.g., Defay etal. [1966]). In his classical derivation of the fundamental theorem 
relating the “surface of tension” of a fluid to bulk composition and the excess of species 
at the surface, Gibbs introduced for convenience the device of a mathematical “dividing 
surface” which was placed arbitrarily in order to make the surface concentration of the 
solvent species vanish; however, for internal interfaces such as grain boundaries which 
were not considered by Gibbs, there are conceptual difficulties in locating this mathemat- 
ical dividing surface in a manner that corresponds to a physical surface of separation, 
because the physicochemical properties vary continuously from one homogeneous phase 
to the adjoining phase. Therefore, in the treatment of adsorption processes at internal 
interfaces it is now customary to adopt the convenient approach of Guggenheim [1950], 
in which the interface is considered as a separate phase of finite thickness, and where the 
thermodynamic relations so derived are independent of the thickness of the phase 
separating the two adjoining phases which may be two crystals or a crystal in contact 
with a vapour. For a multicomponent solid, following the approach of §9 of ch. 5, 
leading to eq. (69) there, but replacing the surface tension, <j, by the scalar interfacial 
free energy y, the Gibbs adsorption isotherm may be derived. In the dilute approx- 
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imation, which is that of general interest, for a bulk solute molar concentration X c « 1 , 
in which Henry’s law holds, this becomes: 


dy 


dl nX 


= -rtt 2 . 


(i) 


where T 2 is the interfacial excess of the solute species expressed in mol/m 2 . This is the 
most useful form of the Gibbs Adsorption Theorem for a dilute binary system and in this 
basic form it has been used experimentally to relate changes in composition of the bulk 
to the changes in both the free energy and the surface excess composition for all 
interfaces in solids and in liquids. (See also ch. 5, §9.1.). 

To illustrate the use of the Gibbs Adsorption Theorem in the dilute form in order to 
measure interfacial excesses, fig. 2 shows for three interfaces in pure iron — the liquid 



Fig. 2. Isotherms for interfacial free energies in the dilute binary system Fe-Sn: data for the liquid surface from 
Kozakevitch and Urbain [1961]. 
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surface, the solid surface and the average grain boundary — the variation in the 
interfacial free energy as a function of the bulk tin content (Seah and Hondros [1973]). 
The three interfaces behave similarly in the general form of the dependence of interfacial 
energy on bulk composition, in that there is an early rapid fall in the isotherm, followed 
by a gradual weakening of the effect as the composition increases. This is typical of the 
behaviour that has been encountered in many such binary systems studied experimentally. 
The interfacial tensions for the pure metal are also typical in that the free energy of the 
solid surface is significantly greater than that of the liquid surface, while the grain- 
boundary free energy is approximately one third of that of the surface free energy. The 
data shown in this figure are analyzed by application of eq. (1) to the results pertaining 
to the specific temperature of the experiment. The amount of interfacial excess expressed 
by r 2 is in terms of moles of the solute species per unit area. In fig. 3 these quantities 
have been transformed into equivalent monatomic layers, and the surface composition is 
shown as a function of the bulk tin content for the three interfaces in question. The 
behaviour of the liquid surface is more ideal, in that it shows the approach towards a 



Fig. 3. Adsorption isotherms for tin in iron interfaces: the low-temperature isotherms (dashed) are measured 
directly by AES: the other are derived from the interfacial energy data shown in fig. 2. 
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saturation adsorption of tin at the liquid iron surface, at a level of about a single 
monatomic layer. With increasing bulk content, the interfacial excesses shown for the 
grain boundary and for the solid surface increase, but in this particular system and in the 
conditions of this experiment, there is no apparent sign of saturation adsorption. In fact, 
for the system iron-tin the behaviour in the solid state is not typical in that there occurs 
multiple layer segregation (Seah and Hondros [1973]). 

In the case of iron containing dilute amounts of phosphorus, which typifies the 
behaviour of iron in the presence of a number of species, including sulphur and oxygen, 
the dependence of the grain-boundary free energy on the bulk phosphorus content is 
shown in fig. 4 (Hondros [1965]). From the slopes of the surface tension isotherm at 
1723 K, the grain-boundary excesses are calculated and the resulting adsorption isotherm 
is superimposed on the same figure. At the saturation part of the isotherm, the calculated 
grain-boundary excess of phosphorus is 0.65 x 10 19 atoms/ m 2 which corresponds to 
approximately a third of a monatomic layer of phosphorus at grain boundaries, assuming 
the phosphorus atoms to be hard spheres spread out evenly and compactly across the 
interface. This compares with 1.4 x 10 19 atoms of phosphorus per m 2 for the saturation 
coverage of the free surface which, in turn, corresponds to about 0.75 atoms of phos- 
phorus per outermost atom of iron. 

As will be shown later, there has been an enormous stimulus to the study of 
interfacial microchemistry in both binary systems as well as multicomponent engineering 
alloys as a result of the advent of the modem surface spectroscopy techniques which 
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Fig. 4. Grain-boundary energy and segregation isotherms for phosphorus in iron (after Hondros [1965]). 
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permit the direct measurement of segregation. In order to build up a body of reliable 
data, it is useful that the quantities determined by the different approaches be made 
compatible. Thus we have extended to all interfaces the notion of an Interfacial Enrich- 
ment Factor, (3 S , defined as the ratio between the interfacial concentration, X s , (in mole 
fraction of a monatomic layer) and the bulk solute concentration, X c , (expressed as mole 
fraction). Thus: 


ft 



( 2 ) 


where T s is the Gibbs excess defined earlier as T 2 and expressed as mol/m 2 and r° is 
the quantity of solute in moles constituting a close packed monatomic layer of unit area. 
This can be shown to be related to the slope of the surface tension isotherm (eq. (1)) as: 


ft =- 


RTF, 


dy s 

dX r 


(3) 


In this way comparisons can be made between enrichment levels obtained by the Gibbs 
technique and those obtained directly using surface and interface analysis techniques. 


3. Methods of measuring the microchemistry of interfaces 

There now exists a copious technical literature on the subject of the detection and 
measurement on interfacial segregation, and one will remark on the wide range of 
imaginative approaches which were deployed during this century in attempts, mostly 
qualitative, to detect the suspected contaminants at grain boundaries which gave rise to 
various metallurgical effects. These mostly indirect techniques include: electrode potential 
measurements on grain-boundary fracture surfaces; a number of subtle metallographic 
features observed at grain boundaries; variations in X-ray lattice parameter measurements 
with different grain sizes; internal friction measurements; micro-indentation hardness 
measurements; the spectrographic analysis of material carefully extracted from the grain- 
boundary region; auto-radiography; and deductions made from anomalous grain growth 
behaviour. From the point of view of furthering this subject matter and offering a deeper 
insight, these past methods are not recommended to the reader as vital to the pursuit of 
the theme. The interested reader will find them fully covered in the review by Hondros 
[1976], but for most purposes these early techniques can be relegated to studies of the 
historical evolution of the subject: in one respect it is interesting to read these accounts 
as a testament to the highly ingenious approaches that the tenacious research worker is 
forced to adopt in those frequently encountered situations where the available tools are 
inadequate to satisfy his curiosity and sense of scientific exploration. These methods have 
been effectively superseded with the development of a range of sophisticated direct 
surface analysis techniques, based upon a variety of electron and ion emission processes; 
these and other significant techniques are considered below. 
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3.1. The interfacial energy or Gibbsian approach 

Not only is there a requirement for data on interfacial energies in alloys as quantities 
in the modelling of various metallurgical processes, but as we saw above, a knowledge 
of the dependence of the interfacial free energy on bulk composition yields a quantitative 
measure of the amount of the species concentrated at that interface. This is the classical 
Gibbsian approach to the problem, and its most serious disadvantage is that since it is an 
indirect procedure, one requires an a priori knowledge of the relevant species procuring 
the change in the interfacial energy. Thus it finds its greatest applicability in binary 
systems where one knows in advance the elements involved. This inability to detect 
chemical species is a serious disadvantage in the study of multicomponent alloys where 
one must presuppose a knowledge of the surface active species from prior general 
metallurgical observations. 

The demand for relevant interfacial free energy data for the solid state arising, for 
example, from studies of grain growth or nucleation phenomena, has stimulated a variety 
of measurement approaches such as heat of solution calorimetry or controlled cleavage 
of the crystal. Most of these have been found inadequate or intrinsically not convenient 
for studying adsorption effects. For the measurement of the surface free energy of a 
metal, the most successful technique, and indeed the only one to have supplied data on 
Gibbsian segregation, is that described as the Zero Creep Technique in which the surface 
energy is deduced from a knowledge of the critical externally applied force necessary to 
counter-balance the capillarity forces which tend to shrink, at high temperature, samples 
with a high surface-to-volume ratio. The idea that at high temperature a solid will 
measurably shrink in order to reduce its surface area has been used in various forms by 
investigators, but we note that its main limitation is that because the shape change 
depends on solid-state diffusion, the technique is limited in practice to temperatures 
generally above 0.7 T m . The zero creep technique, as well as other measurement 
techniques, are described in detail by Hondros [1970], 

As in the case of free surfaces, a wide range of techniques has been explored in order 
to measure grain-boundary free energies. These include isothermal calorimetry but in 
practice, most techniques are restricted to the determination of the grain-boundary 
energies for the pure metal. The only systematic data on the compositional dependence 
of grain-boundary energies have come through the application of polyphase equilibria 
techniques, which yield a relative grain-boundary/surface free energy value. The simple 
procedure is based on the observation that at high temperatures, the kinetics of mass 
transport allow intersecting interfaces to assume equilibrium configurations consistent 
with a minimum of energy, in which the local surface tensions can be analyzed as if in 
vectorial balance (Smith [1948]). This idea has been used for a variety of situations and, 
in the case of grain boundaries, the microtopography of a grain boundary intersecting a 
free surface (fig. 5) is a distinct groove along the intersection of the boundary with the 
surface. This configuration indicates a balance in the tensions between the two crystalline 
surfaces and that of the grain boundary and in these conditions, assuming the surfaces to 
be isotropic, the equilibrium is given by: 
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Vapour.liquid or solid 



Fig. 5. Equilibrium configuration for a grain boundary intersecting the free surface, showing the dihedral angle. 


y b = 2y s cos(0 2), (4) 

where 6 is the dihedral angle formed by the groove. In practice, the polycrystalline 
sample is equilibrated in physicochemical conditions which duplicate those in which y s , 
the reference value, is measured by the Zero Creep Technique. After a prolonged period 
of heating at temperature, which allows the equilibrium geometry to be attained, dihedral 
angles are measured on about 50 intersecting boundaries using optical interferometry or 
some other suitable technique. In this way, errors arising from the anisotropy of 
interfacial energies can be neglected, since it has been shown that eq. (4) is adequate if 
the grain-boundary/surface energy ratios are averaged statistically over a large number of 
observations. The method of thermal grooving applied to crystallographically fully 
characterized bicrystalline samples can be used for determination of the energy of 
individual grain boundaries. This can be documented, for example, by measurements 
recently performed by Wolf etal. [1992] who constructed the orientation dependence of 
grain-boundary energy for asymmetrical twin boundaries in copper. These measurements 
revealed an energetical minimum corresponding not to the symmetrical {112} interface 
as would be expected, but at an asymmetrical boundary deviated 8° from the {112} one. 

3.2. Modern surface analysis techniques 

The stimulus for much of the present-day work on surface and interface microchem- 
istry arises from the ease of use and interpretation of measurements made with the 
surface analysis techniques developed in the last decades. To give some idea of the 
problems involved in this analysis, suppose a typical metal is fractured in air to provide 
a clean surface. This will be covered by a monolayer of oxygen atoms in a time of the 
order of one nanosecond. Thus, to study the surface or interface chemistry and reactions 
over a reasonable timescale, vacua of the order of 10“ 8 Pa (10“ 10 torr) are required to 
keep the environmental contamination under control. This technical capability became 
generally available only with the development of ultra-high vacuum (UHV) technology, 
in the mid-1960s. Thus, it was not until the 1970s that the surface analysis techniques 
became firmly established as tool for materials studies. 
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There are four main techniques, Auger electron spectroscopy (AES), X-ray photoelec- 
tron spectroscopy (XPS), ion scattering spectroscopy (ISS) and secondary ion mass 
spectroscopy (SIMS), which have been used to study segregation and each of which 
provides slightly different information. In general, the techniques are operated in UHV 
and are used to study the free surface of solid. The solid may be cleaned by argon ion 
sputtering, and the free surface segregation studied as a function of time, temperature, 
ambient environment or a combination of these. For the study of grain boundaries, these 
are exposed by fracture in the UHV of the instrument, using a suitable fracture stage. If 
the segregant embrittles the grain boundary, this fracture may be achieved fairly readily 
either at room temperature or by cooling the sample to liquid nitrogen temperatures. In 
the absence of intrinsic embrittlement, the grain boundary may be parted in some 
materials by impact after cathodically charging the solid with hydrogen to promote 
hydrogen embrittlement. Of course, for grain-boundary studies, unlike those on free 
surfaces, a fresh sample must be prepared for each heat-treatment time and temperature. 
The relative ease of studying free surface explains, to some extent, the high level of 
effort devoted to free surfaces compared with that for grain boundaries. 

The analysis of the prepared surface in UHV is usually undertaken with the most 
powerful and popular of the techniques, AES (Briggs and Seah [1990]). In this, a 
focused electron beam of 1-30 keV energy is used to excite atoms in the surface layer 
of an appropriate area of the target, creating holes in their inner core energy levels. The 
atoms subsequently decay by emission of either a characteristic X-ray (which is the basis 
of bulk chemical analysis) or by the emission of a characteristic Auger electron, which 
is the basis for surface analysis. We show in fig. 6(i) the inner core level diagram for an 
element heavier than neon as an example. The initial core hole may be created in level 
K which is then filled by a higher energy electron from, say, L 3 with the energy balance 
taken by, say, a second L 3 electron which is ejected from the atom with an energy E A 
and is known as an Auger electron. E A is characteristic of the particular atom and in this 
case is given approximately by: 

E A =E K -E Li -E Li . (5) 

Fortunately, of the very many transitions that may occur, only a few are strong for each 
element and these enable the surface atoms to be clearly identified from the emitted 
electron energy spectrum. Atoms from depths greater than a few monolayers (1 nm) also 
eject Auger electrons but these do not escape easily and so do not contribute to the 
emitted line spectrum. Thus AES is characteristic of the outermost atomic layers of a 
solid and, by focusing the electron beam, we may now localize the composition of 
heterogeneous surfaces to a resolution across the surface that currently approaches 20 
nm. Elements, except for H and He, are generally detected with sensitivities in the range 
one part in 100 to 1000. 

A classical example which illustrates the use of AES is the analysis of the segregat- 
ion giving rise to the temper brittleness in the 3CrjMo steel rotor which initiated a well 
known failure at a power station (Kalderon [1972]). The fracture of this material is 
fully intergranular as shown by the scanning electron micrograph of fig. 7. AES analysis 
of a sample taken from the vicinity of the original crack, and fractured in the UHV AES 
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(i) (ii) 

Fig. 6. Schematic transition level diagram for electrons in (i) AES and (ii) XPS. 

system, shows 45% of a monatomic layer of phosphorus at the grain boundary. The 
spectrum shown in fig. 8 also identifies the alloying elements present and the carbides at 
the grain boundary. 

That the segregant is located in the grain-boundary zone may be demonstrated by 
slowly peeling away the surface atoms, using in situ argon ion sputtering and monitoring 
the surface composition by AES. In this way a composition-depth profile may be 
obtained for all elements present. Measurements for many segregants at grain boundaries 
are shown in fig. 9 where it is seen that the results agree with the solid curve predicted 
on the hypothesis that the segregant atoms are on the outermost atom plane of the 
fracture surface and hence associated with the precise atom plane of fracture. 

A particular attribute of AES is the ability to achieve a fine localization on the free 
or fracture surface with the focused electron beam. This allows anisotropy effects 
between different crystals and the segregation at inclusion-matrix interfaces to be 
analyzed with comparative ease. The present state of the method also enables to detect 
fine variations in segregants contents at different parts of both matching fracture surfaces 
of a bicrystal, connected with an asymmetry of the fracture process on the atomic scale 
(Menyhard et al. [1991]). 

The second of the techniques, XPS (Briggs and Seah [1990]) has some advantages 
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Fig. 7. Scanning electron micrograph of the service failure of a temper-brittle 3Cr]Mo rotor steel, showing the 
fully intergranular fracture surface (field of view approximately 200 gm) (from Seah [1975]). 


over AES although it lacks spatial resolution. In XPS, as shown in fig. 6(ii), electrons are 
ejected from the surface atom core levels by the characteristic X-rays of energy hv from 
a metal target, usually Al or Mg. The emitted electron energy, E x is given in the example 
of fig. 6(ii) by: 

E x =hv-E u . (6) 

Thus, each of the lines in the emitted photo-electron spectrum directly reflects the core 
level positions, identifying the emitting atoms which, for the same reasons as given for 
AES, are those in the outermost 1 nm of the surface. The precise position and shape of 
the photo-electron line depends on the chemical state of the surface atoms, and so XPS 
can be used to probe the chemical state of the segregant atoms and to separate out those 
elements which are in micro-precipitates on the surface. For instance, in fig. 10 details 
are shown of the 2p (or L 23 ) line for phosphorus on the free (100) surface of an iron 
single crystal. The various curves show: (1) a true segregant; (2) phosphorus segregated 
plus Fe 3 P precipitates where the phosphorus level is above the solubility limit during the 
heat treatment; and (3) the dotted curve from solid Fe 3 P. 

During the past years, a veritable plethora of alternative surface analysis techniques 
has emerged. Of these, the only one that has been used for segregation studies is ion- 
scattering spectroscopy (ISS). In ISS (Niehus et al. [1993]) the mass of atoms in the 
surface atom layer is determined by recording the energy loss of ions bounced off those 
atoms as if in a billiard-ball collision. Using a detector at an angle 6 to the incident beam 
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Electron energy, eV 

Fig. 8. Auger electron spectrum from the fracture surface of fig. 7, exhibiting 0.45 monolayers of phosphorus 
segregated at the grain boundary (after Seah [1975]). 


of ions of mass A/, and energy £„ it is easy to show that surface atoms of mass M will 
reflect those ions into the detector with an energy E M where: 


M = Mi 


E-, + 2- x /£^£~ cos 9 + E m 
Ei-Es, 


( 7 ) 


This allows the analysis of the surface atoms. However, it will be seen from eq. (7) that high 
masses are all found bunched in a small energy range near £, and thus, because of energy 
blurring terms which give rise to peak broadening, it is difficult to distinguish adjacent masses 
heavier than copper. ISS has, however, two main advantages over AES and XPS which are 
very pertinent to studies of surface segregation, especially in applications related to adsorption 
and catalysis. These are that by using neon ions of 1 keV energy, the analysis is limited to 
the outermost atom layer only and by suitably directing the beam at a single crystal surface, 
atom site positions can be determined very accurately by shadowing effects. 

SIMS, as a surface analysis technique can well supply useful information. As with 
ISS, the surface is interrogated by an ion beam with an energy in the range 1-10 keV, 
with a flux of some nA/cm 2 (Benninghoven etal. [1982], Briggs and Seah [1992]). 
Secondary ions that are then emitted may be mass analyzed to determine the atom 
groupings on the surface since these groupings are reflected in the total mass of the 
clusters detected. Complex atom groupings of twenty or more atoms can be observed 
which represent bonding arrangements at surfaces (Briggs [1982]) and so, in principle. 
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Fig. 9. Localization of segregant atoms at the grain boundary in many systems by AES with argon ion etching 
(after Seah [1980a]). 

the interactions of co-segregants can be studied. SIMS has other advantages of detecting 
hydrogen (Fukushima and Birnbaum [1984]) and of being much more sensitive than 
AES or XPS; however, it is very difficult to quantify the spectra and hence it has not yet 
become popular as a technique to study segregation. This may change with the develop- 
ment of the detection of “sputtered neutrals mass spectroscopy” (SNMS), using post- 
ionization of the sputtered matter to effectively reduce matrix effects. Time-of-flight 
SIMS (TOF-SIMS) is particularly useful for submonolayer detection and is often 
combined with SNMS (Briggs and Seah [1992]). 

3.3. Micrographic techniques 

Three micrographic techniques have been used in the study of grain-boundary 
segregation, and although each has its own unique advantages, as noted below, none can 
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Fig. 10. X-ray photoelectron 2p core line of phosphorus: (1) segregated on Fe (100); (2) segregated with Fe 3 P 
precipitates; (3) from solid Fe 3 P (after Egert and Panzner [1982]). 


be used for the parallel study of surface segregation. These three techniques have the 
crucial advantage over the techniques discussed above that, in addition to being analytic- 
al, they can provide measurements for boundaries that will not undergo fracture. They 
thus enable a new range of systems to be studied. 

The most popular of the micrographic techniques is ion microprobe analysis or, as it 
is sometimes known, secondary ion microscopy. In this technique, a mass spectrometer 
is used to construct emitted-ion images of the surface bombarded by a primary ion beam. 
The grain boundary is arranged normal to the surface being imaged, so it intersects the 
surface in a line that does not move as the surface is eroded by the primary ion beam. 
In this way a trace of the boundary can be imaged using the segregant atoms, with a 
resolution as low as 20 nm (Rudenauer [1994]). Isotopes can be separated, and the 
elements hydrogen and helium, not observed in AES or XPS, may be clearly imaged. 

Another micrographic technique which has unique advantages is scanning trans- 
mission electron microscopy (STEM) with X-ray analysis (Doig et al. [1981]) or with 
electron-energy loss spectroscopy (EELS; Bruley et al. [1994]). A thin section of 
material is prepared with the grain boundary again perpendicular to the surface and the 
sample is then imaged in the electron microscope. The segregant is measured using a 
nondispersive X-ray detector with the electron beam on the boundary or by determination 
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of the characteristic energy loss features in the transmitted primary beam by an electro- 
static analyzer. These techniques have the advantage that small precipitates and inclu- 
sions on the boundary can also be structurally analyzed by electron diffraction, but have 
the disadvantage of requiring laborious and sophisticated specimen preparation (see also 
ch. 11, §2.10). In the last decade, the X-ray method has been improved to identify 
quantitatively the crystallographic sites, distribution and types of impurities in many 
crystals. This method, which takes advantages of the dependence of localized atomic 
characteristic X-ray emission on the orientation of the incident electron beam, is known 
as ALCHEMI (Atom Location CHannelling Enhanced Microanalysis (Spence and Taft0 
[1983])). When the dimensions of segregant and matrix atoms substantially differ from 
each other (e.g. Bi and Cu), the atomic positions of a large segregant at the grain 
boundary can be identified by high-resolution electron microscopy combined with a 
theoretical study of a segregated boundary. In this synergistic way, Luzzi et al. [1991] 
were able to decipher with great accuracy the structure of segregated grain boundaries 
containing multiple atomic sites. 

The third micrographic technique is atom-probe field ion microscopy (AP FIM; 
Miller [1987]; Tsong [1990]). Here the atoms on the end of a sharp tip of the material 
may be imaged with atomic spatial resolution. Selected atoms may then be removed from 
the tip, using (electric) field evaporation, and mass-analyzed. Thus, in principle, 
individual atoms around a grain boundary or a precipitate/matrix interface may be 
analyzed and counted. In a alternative variant of the instrument, atoms of selected masses 
may be field-desorbed and imaged. In this way images of the trace of a grain boundary 
may be constructed with near atomic resolution. For example, a boron-segregated grain 
boundary in Ni 3 Al is displayed by means of AP FIM in fig. 11. 

4. Theory of segregation processes 

4.1. Introduction: equilibrium and non-equilibrium segregation 

In this section we are concerned with the theory of microsegregation in two forms: 
equilibrium and non-equilibrium. Equilibrium segregation occurs, as noted earlier, as a 
result of inhomogeneities in the solid giving rise to sites for which solute atoms have a 
lower free energy. These sites occur at interfaces such as the free surface, grain 
boundaries and phase interfaces as well as at defect sites, dislocations and stacking faults. 
All of these regions have concentrations of solute atoms which differ from each other 
and from that of the bulk materials as shown schematically in fig. 1. At any given 
temperature, there is a unique solute concentration for each of these sites that is 
asymptotically approached as time goes to infinity and at a rate governed by diffusion. 

On the other hand, non-equilibrium segregation depends on rate processes and kinetic 
events and, in general, disappears as time approaches infinity if diffusion processes are 
allowed to reach full equilibrium. There are a number of discrete routes for producing 
this form of segregation, which include moderate rate quenching of samples from a high 
temperature, the growth of precipitates, the effect of stress at temperature, etc., which are 
dealt with in more detail in § 4.7. Here we concentrate on equilibrium segregation which 
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Fig. 11. Field-ion micrograph of boron-decorated boundary in Ni 3 Al. Bright spots on boundary were identified 
as individual boron atoms using single atom identification technique of the atom probe (from Miller [1987]). 


not only is the more common form, but also has been studied more comprehensively. 

The free energies of the sites for equilibrium segregation may be represented as shown in 
fig. 12, where each site has both enthalpy and entropy contributions. In the dilute limit of 
element A segregating in a matrix of element B, one may consider the relative concentrations 
C,(A) and C 2 (A), at different sites, 1 and 2, to obey the Maxwell-Boltzmann relation: 


CM) 

CM) 



( 8 ) 


where AG 12 is the free energy difference between sites 1 and 2. Although eq. (8) 
provides a feel for what may generally occur in dilute systems, it greatly oversimplifies 
the problem. In the sections that follow we consider the various adsorption theories for 
non-dilute systems, which have been established as direct analogues or developments of 
theories well known in the field of gas adsorption on the free surfaces of solids. The 
theories will be presented as a series with successively more complex segregation 
conditions. The free energy term giving rise to segregation will then be analyzed and 
finally the kinetics of the segregation process will be presented. 


4.2. The Langmuir-McLean theory 

The earliest derivation was made specifically for grain boundaries, (McLean [1957]), 
but is also valid for free surfaces. In this classical derivation, McLean proposed a model 
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Fig. 12. Free energy diagram for solutes segregated at surfaces and grain boundaries. 

of P solute atoms distributed at random amongst N lattice sites and p solute atoms 
distributed at random amongst n independent grain-boundary sites. The total free energy 
due to the solute atoms is then 

G = pe + PE- kT[ In n! N\- ln(n - p)\ p\(N - P)\ Pi], (9) 

where E and e are the energies of the solute atom in the lattice and in the grain bound- 
ary, respectively, and the final term is the configurational entropy for the arrangement of 
the solute atoms in the bulk and grain boundary. The equilibrium state of the system 
occurs at the minimum value of G. This may be determined by differentiating G with 
respect to p, noting that the sum of p and P remains constant. Corresponding to this min- 
imum, the grain-boundary analogue of Langmuir adsorption at free surfaces is obtained: 

X X f A.G\ 

x°-x = T^F exp r^J’ Langmuir_McLean ’ 0°) 

where the notation of Hondros and Seah [1977a] is now used: here X° is the fraction 
of the grain-boundary monolayer available for segregated atoms at saturation, X b is the 
actual fraction covered with segregant, X c is the bulk solute molar fraction and AG is the 
free energy of segregation per mole of solute. AG contains all the entropy terms such as 
vibrational and anharmonic, but not the configurational term which is written separately 
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in eq. (9). Segregation measurements obeying the form of eq. (10) have been reported for 
several systems at both free surfaces and grain boundaries. Figure 13, for example, re- 
produces the measurements by Kumar and Eyre [1980] which show that the Langmuir- 
McLean relation is obeyed for the segregation of 8-12 at. ppm of oxygen to grain 
boundaries in molybdenum over a wide temperature range. For convenience eq. (10) has 
been rearranged as follows to obtain a linear plot: X b is assumed to be unity and X c « 1 
so that: 


— = ln ? — ^4—. 
RT {l~X b )X c 


( 11 ) 


We recognize the basic features of segregation behaviour from this form: the element 
segregation increases as the bulk solute content, X c , increases or as the temperature, T, 
falls. In this simple model, solute atoms are not interacting at the grain boundary and 
never exceed the saturation segregation level of one monolayer. At this point, we require 
some way of estimating the value of AG for segregation to both grain boundaries and 
free surfaces. This is discussed in the next sections. 


4.2.1. Prediction of the free energy of segregation to grain boundaries 

The value of free energy of grain-boundary segregation, A G b , was estimated by 
McLean [1957] from the elastic strain energy, E* of the solute in the lattice, all of 
which is assumed to be released on segregation. For a solute atom of radius r, in a site 
of radius r 0 , the elastic energy associated with one mole of solute is given by (Wyn- 
blatt and Ku [1979]): 


24mVA'Gr 0 r 1 (r 1 - r 0 ) 2 
3A>, + 4Gr 0 


(12) 


where K is the solute bulk modulus, G is the solvent or matrix shear modulus and N is 
Avogadro’s number. Two examples of the values for segregants in copper computed by 
McLean [1957] are shown in table 1. The final column shows the experimental values. 

Values estimated by this approach are generally correct to within a factor of two. 
However, much higher accuracy is required for the quantitative prediction of segregation 
behaviour. This was provided in the treatment by Seah and Hondros [1973]. Using the 
(BET) gas adsorption theory of Brunauer et al. [1940], they write the solid-state 
analogue as 

f exp[^j, (Truncated BET). (13) 

where X®, the solid solubility, is the important parameter. For slightly soluble sub- 
stances, X° = exp(AG so /RT) so that the Langmuir-McLean and Truncated BET theories 
are in fact identical in the dilute limit with 


A G b = AG' + A G sol , 


(14) 
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Fig. 13. Langmuir-McLean plot for oxygen grain-boundary segregation measured by AES in molybdenum 
(after Kumar and Eyre [1980]). 

as shown in the energy level diagram of fig. 12. The importance of the Truncated BET 
form is that one may define as a useful parameter the enrichment ratio, fi h which repre- 
sents the ratio X,/X c in the dilute limit. Since X° generally turns out to be unity, we find: 

A = Jr where K = ex pJ~rj- (15) 

Figure 14 shows a compilation of measurements of grain-boundary enrichment ratios, fi h , 
(estimated by the grain-boundary energy and surface-analysis approaches), in relation to 
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Table 1 

Values of the elastic distortion energy, E,„ for Sb and Bi in Cu. 


Element 

K 

(GN/m 2 ) 

r i 

(nm) 

(r, - r 0 )/ r , 

E d 

(kJ/mol) 

AG, 

(kJ/mol) 

Sb 

41 

0.1528 

+ 0.194 

37 

65 

Bi 

31.4 

0.157 

+ 0.266 

66 

100 


the solubilities of the segregant at the measurement temperature. Thus the grain-boundary 
enrichment for systems in which no measurements are available may be accurately 
estimated from handbooks of phase diagrams (Massalski etal. [1986]). It is clear that, in 
the dilute limit, eq. (15) provides the theoretical description of the experimental data of 



Fig. 14. Correlation of measured grain-boundary enrichment ratios with the atomic solid solubility (after Seah 
[1980d]). 
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fig. 14 quite accurately. Indeed it is seen that whilst A G s0 , ranges from zero to -80 kJ/mol, 
AG' has a mean value of -10 kJ/mol with a standard deviation of only 6 kJ/mol. Thus, 


AG,, = A G sol - 10 ± 6 kJ/mol . (16) 

Equation (13) is valid only for X C <X°. If excess solute is present, so that a second phase 
appears, the solute content is limited to X° and eq. (13) becomes 



(17) 


The translation from eq. (13) to eq. (17) as the solubility limit is reduced through the 
solute level, is illustrated by the measurements for sulphur in iron by Seah and Hon- 
dros [1973], shown in fig. 15. Equation (13) is shown for the higher temperatures, 
where the solubility is high, and eq. (17) at the lower temperatures. 

Watanabe et al. [1980] proposed to refine the dependence of grain-boundary 
enrichment ratio on solubility limit by considering the effect of grain-boundary orien- 
tation (see also § 4.5) and suggested the idea of constructing a grain-boundary segrega- 
tion diagram as a three-dimensional function of (3 b in dependence on X° and on the 
misorientation angle of adjacent grains. The first experimental grain-boundary segregat- 
ion diagram was constructed by Hofmann and LejCek [1991] for Si, P and C segre- 
gation at [100] symmetrical tilt grain boundaries in a-Fe. In this diagram, however, the 


Sulphur at grain boundary, fj moles m ' 2 



Fig. 15. AES measurements of sulphur grain-boundary segregation in iron as a function of temperature for two 
bulk compositions (after Hondros and Seah [1977b]). 
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grain-boundary enrichment ratio was replaced by a more general and independent 
thermodynamic quantity, A H°. Recently, LejCek and Hofmann [1993] theoretically 
derived the dependence of the segregation enthalpy on both the grain-boundary orien- 
tation and the bulk solid solubility. It was shown that this dependence can be expressed 
as a sum of two mutually independent terms 

Atf°(<i>, X ? ) = AH* (d>) + vtffr In X°] (1 8) 

where A H*(3>) is the enthalpy of segregation of a solute with unlimited solid solubility 
(i.e., X° = 1) in a given matrix at a given interface of orientation <t>, and v is the 
parameter characteristic for the matrix element. Expression (18) was derived under two 
basic assumptions, first, 

T In a] = const (19) 

where a’ is the activity of a solute at the solid solubility limit, and second, 

«: = (20) 

which relates the activity a c ' and concentration X c ° of any solute at the solid solubility 
limit in a given matrix. In fact, eq. (18) represents an extension of the model of Seah 
and Hondros [1973] by taking additionally into account (i) the structural dependence of 
interfacial segregation (i.e., AH*(3>) * const), and (ii) the non-ideal behaviour of the 
system (i.e., v* 1). The grain-boundary segregation diagram for the [100] symmetrical tilt 
grain boundaries in a-Fe is shown in fig. 16. 

4.2.2. Prediction of the free energy of segregation to surfaces 

The theory for the free energy of surface segregation, A G„ has developed along less 
phenomenological lines than that for grain boundaries although both are merely special 
cases of one general effect. Each of the two approaches leads to a reasonably accurate 
prediction for the case to which it pertains. The reader may blend the two approaches to 
gain a greater insight into the nature of the contributions to the free energy, however, at 
the present time it does not lead to a greater accuracy in calculation. 

The treatment for free surfaces is based on the derivation of Williams and Nason 
[1974] who use a layer-by-layer model of the surface region, with solute A atoms 
replacing the solvent B atoms substitutionally at the surface. They find that this model 
gives the Langmuir-McLean adsorption eq. (10) for the fractional coverage of A on the 
surface, which allows the calculation of A G s . The basic model adopts the quasi- 
chemical approach, shown in fig. 17, where the nearest neighbour bond energies are 
given by e M , e BB and e AB between AA, BB and AB neighbours respectively. AG, is 
calculated as the energy released on removing an A atom from a lattice site and 
exchanging it for a B atom on the free surface. The energy required to remove A from 
the bulk is: 

-(Z ( +2Z V )[^+(1-X C > /U) ], (21) 
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e°[i00] 

Fig. 16. Grain-boundary segregalion diagram for [100] symmetrical tilt grain boundaries in tt-Fe showing the 
dependence of enthalpies of Si, P and C segregation on misorientadon angle 8 of both adjacent crystals and on 
the bulk solid solubility data (after Lej£:bk and Hofmann [1993]). 


where Z ; is the atomic coordination number in the layer and Z v is the atomic coordination 
to one of the adjacent layers. The energy to replace A at the surface is similarly: 

2,[X,b m + (1 - *>„] + Z v [X c e M + (1 - (22) 

Equivalent terms occur for the B atom moving in the reverse direction so that by using 
the regular solution approximation in which 

e AB = "2 ( fi AA "*■ e BB ) (23) 

we find that the free energy of surface segregation per atom, AG/N, is given by 

AGJN = \ Z v (e ss - e M ) + 2m{z ( (Z c - X,) + Z V (X C - i)}. (24) 

In Williams and Nason’s theory, successive layers away from the surface are considered 
and for nonzero values of (o the composition of the second and further layers can also 
differ from that of the bulk. For instance, if <o is negative, there is a perceptible 
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Fig. 17. Schematic substitutional model for the quasi-chemical segregation theory. 

oscillation of composition decaying away from the surface in addition to the surface 
segregation. Here, however, to simplify the analysis and because it has not been possible 
to observe such small oscillations in practice, it will be assumed that all of the com- 
positional variation occurs in the topmost atom layer. 

Numerical evaluation of eq. (24) may proceed in several ways. Following Wynblatt 
and Ku [1979] the bond strengths e M and e BB may be derived from the molar surface 
energies of the two constituents. At absolute zero these energies may be expressed as 
follows: 

7 A - ~ 2 7b = ~~2^v e BB^' ( 25 ) 

The regular solution parameter, co [eq. (23)], may then be calculated from tabulated 
values (Hultgren et al. [1973]) of the enthalpy of mixing, H m \ 

0, = Noj = H m ! ZX c (l - X,), (26) 

where Z is the total coordination number. Williams and Nason in their original paper 
evaluated e M and e BB from the molar sublimation enthalpies, H 51 ^, where 

H™ b = - -j Ze u N, (27) 

where i stands for A or B. The above terms now give some insight into the kinds of 
elements that will segregate to free surfaces. If the regular solution parameter can be 
ignored, it is seen from eqs. (24) and (25) that, other things being equal, the element 
with the lower surface energy enriches the surface of a binary system. Alternatively, 
from eqs. (24) and (27), this will also be the element of lower sublimation enthalpy. 
The above is particularly true for liquid binary metallic systems as shown by 
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Hondros [1980a]. However, for solids McLean’s strain term given in eq. (12) is also 
important. The strain term is only significant for excess-sized solute atoms. Tsai et al. 
[1977] point out that, as there is little relaxation around vacancies, the strain energy 
associated with r 0 <r 1 , must be much less than the converse. 

Using the above approach, Seah [1979] gained an overall view of surface segregation 
by combining eqs. (24) and (25) and adding the strain term of eq. (12). From what we 
have seen in §4.2.1, this term will be of the order of 2.6 x 10 35 x r a (r l - r 0 ) 2 . In fact, 
using regression analysis of published data, Seah found for dilute binary systems: 

-AG, = (y s B - -y*) + 1.8ft + M2.66 X 10 35 r 0 (r, - r 0 )\ (28) 

where M is unity for r, > r 0 and zero for r, < r 0 . The r values are the atom sizes given by 
pN(2rf=A, where p is the bulk density and A the atomic weight of the element. Figure 
18 shows the resulting correlation for an improved version of eq. (28), using the relation 
/3 S =X/X c = exp(-AG/RT), and the experimental data for the surface enrichment ratio. 
This analysis confirmed the correctness of simply adding the bond and strain energy 
terms directly and that the latter is significant only for r, > r 0 . Thus we may write: 

AG, = [bond term, eq. (24)] + Af[strain term, eq. (12)] 

t \ (29) 

- T (entropy terms discussed in § 4.2.4). 

Several other models have been developed to estimate thermodynamic parameters of 
surface segregation. On the basis of a simple tight-binding theory of surface segregation 
in alloys of transition metals, Mukherjee and Moran-L6pez [1987] proposed a method 
to predict surface segregation assuming a rectangular shape of the d-band density of 
states. This method enables the equilibrium surface concentration to be calculated as a 
function of temperature. Another technique for calculating the heat of segregation is 
based on the BFS model (Bozzolo et al. [1992]). The energy of segregation which is 
composed of two basic contributions, the strain term and the chemical term, can be 
obtained using a zero-temperature Monte Carlo calculation. As the input data, pure 
elemental parameters and only two alloy properties have to be used (Bozzolo et al. 
[1993]). In contrast to these two models, a phenomenological approach was used in 
modem thermodynamic calculation of interface properties (Mezey etal. [1990]). In this 
approach, the free energy of segregation of a solute I at a surface in a binary M-I alloy 
can be expressed as a sum of two terms, the dangling bond contribution: 


AGf ={y,-y M )A M , 

and the real mixture contribution: 

(30) 

AG," = qApf - Apl. 

(31) 


Here y i are the surface energies of pure elements i, q=A M /A„ A, are the partial molar 
areas of species i, and Ayxf are the excess chemical potentials of mixing related to the 
elements i. The total free energy of segregation, AG,- A G, d + A G, m , can be determined by 
means of individual factors evaluated from elemental data, and from experimental values of 
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Fig. 18. Correlation of measured surface enrichment ratios with the predicted values from eq. (28) (after Seah 
[1979]). 

interfacial free energies and molar enthalpies of mixing. It was also shown that this approach 
can describe anisotropy of surface segregation of Pt in Ni (Mezey and Hofer [1992]). 

4.2.3. Segregation with adsorbate-adsorbate interactions 

In the above we have concentrated on evaluating A G in the limit of dilute X b , X s and 
X c to compare the overall behaviour with experiment. To study the effect of higher levels 
of segregation we may rewrite eq. (24) simply: 

A G s = AG® - 2 Z,NaX„ (32) 

where AG® = NZ v (e BB -e AB ) does not depend on the segregation level. This equation is 
based on the substitutional segregation model shown in fig. 17, which is a good 
description for many free surfaces and grain boundaries. A completely different 
description, valid only for free surfaces, is the Fowler theory in which adsorbed atoms 
on top of the free surface have, simply, the interaction energy e M . In this case: 
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AG S = AG° + Z,Ne M X s . (33) 

This is seen to be the same as the result for eq. (32) if the top atom layer B atoms are 
replaced by vacancies. 

Using either of the above equations, if the presence of a segregated A atom enhances 
the probability of the next site being occupied by an A atom, A G becomes more negative 
as the segregation proceeds. As e M becomes more and more negative, the segregation 
shows progressively sharper rises as the temperature falls until eventually the rise in 
segregation becomes discontinuous at a temperature given by the solution of 

C = -4RT, (34) 

where C is the coefficient of X s in eqs. (32) and (33). Measurements illustrating this form 
of segregation are shown in fig. 19 for Se and Te segregation in pure iron. There is also 
shown the simple Langmuir-McLean case with C - 0 and a case in which a dis- 
continuous rise occurs from a low to a high segregation level. As seen from the labelling 
of the ordinate, the lowest segregations occur at low bulk solute contents, X c , and high 
temperatures, T. The Fowler type of discontinuous grain-boundary segregation with 
C=-5RT was observed by Menyhard et al. [1989] for Bi in Cu. 

4.2.4. Temperature dependence of the free energies of segregation 

The temperature dependence of segregation occurs mainly through the explicit 
temperature term in eq. (13). As the temperature rises the segregant “boils off’, leading 
to a lower value. However, the free energy itself is temperature dependent through the 
customary relation: 

AG = AH - 7AS (35) 

where AS is the entropy change on segregation to a grain boundary or free surface. Seah 
and Lea [1975] write AS as the sum of three terms, A S v , A S a and A S m . A S v is associated 
with the changes in the vibrational behaviour on segregation, A S a with the anharmonicity 
of the potential of the segregant atom site and A S m with site multiplicity. The vibrational 
entropy term is given, in the Einstein model, as 

S v = 3fl[l + \n(kT/hv)\ (36) 

where kT»hv , v being the Einstein frequency, hv may be rewritten kd E or 0.115k0 D 
where d E and d D are the Einstein and Debye temperatures (Kittel [1986]) so that 

AS V =3R\n(0 D /0' D \ (37) 

where 0 D and 0* are the Debye temperatures for the solute atom in the matrix and at the 
distorted site of the grain boundary or free surface. Owing to the greater vibrational 
amplitude of atoms at surfaces, 0‘ D /6 D has been calculated for tin surface segregation in 
iron to be approximately 2/3. In this way, taking into account the matrix atom exchang- 
ing places with the segregant atom, A S v is about 3.5 R for the free surface and 3.3 R at 
grain boundaries. 
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Fig. 19. Grain-boundary Fowler isotherms with the AES measurements of Pichard etal. (after Hondros and 
Seah [1977a]). 


In a similar manner, the two further entropy contributions, A S a , due to the anharmon- 
icity of the atomic potential at the termination of a lattice, either by the surface or by a 
grain boundary (Ewing [1971]), and A S m , due to the possible site multiplicity for the 
segregating atoms, may be calculated. Both terms are much smaller than A S t „ and so will 
not be treated here. 

The correlation of the above full prediction of the temperature dependence of AG 
with the experimental data for tin in iron is shown in fig. 20 for both surface and grain- 
boundary segregations. This figure also serves to emphasize the generally observed fact 
that surface segregation at moderate temperatures is generally much greater than 
segregation at grain boundaries. 
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Fig. 20. The predicted temperature dependences of the free energies of surface- and grain-boundary segrega- 
tions of tin in iron, together with the experimental data at 823 K (550°C) and 1693 K (1420°C) (after Seah and 
Lea [1975]). 

4.3. Segregation in simple ternary systems: site competition 

Site competition in a ternary system occurs where there are two segregants which can 
fill the same sites at the free surface or grain boundary. Each segregant obeys the 
appropriate adsorption theory with the available empty sites reduced by the segregation 
of the other species. At saturation segregation, this leads to the situation where the sum 
of the segregants is constant at one monolayer. This has been observed on the free 
surface of iron for Sn, Sb and O in competition with S. Generally the Sn, Sb or O 
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segregate rapidly to an equilibrium level but, with time at temperature, S (which may be 
present at only a few ppm in solution), will replace much of the Sn, Sb and O. Such a 
competition is not generally expected at grain boundaries since the bond energy terms 
can continue to be reduced until two total monolayers are segregated, and this latter level 
of segregation is rarely reached. Grain-boundary site competition between N and S has 
been reported (Tauber and Grabke [1978]) in iron but not under all conditions (Jones 
etal. [1981]). More importantly, P is displaced by C (Erhart and Grabke [1981]). This 
is of great significance in steels, as discussed in § 6.3. 

4.4. Segregation in complex metallurgical systems 

In 1975 Guttmann extended the Fowler theory to allow for coupling interactions 
between two co-segregating species in multicomponent systems. This development, vital 
to explain the segregation behaviour resulting in intergranular failures in engineering 
materials, gives the analogue to the Langmuir-McLean equation. 

= X ci exp(-A GJRT) 

l + X; =1 ^[exp(-AG,/J?r)-l]’ 

where and X b2 are the molar fractional monolayers segregated by a noxious impurity 
and by an alloying element of bulk contents X cl and *c2 respectively. The AG’s are given 
by equations of the form of eqs. (32) or (33) but depend not on the level of self- 
segregation but on each other through the equations: 

AG, = AG,° - 2C 13 X m + a' l2 X b2 , A G 2 = AG° - 2C 23 X M + a' l2 X kv (39) 

where AG, and A G° are the free energies of segregation of the impurity and alloying 
elements separately in the matrix, and C 13 and £-23 are the respective Fowler-type 
interaction coefficients (cf. eq. (32)). The interaction coefficient a' 2 refers to the changes 
in nearest neighbour bond energies in forming the alloy-impurity bonds, 

**12 — ,2 — £,1 — E 21 )’ ( 40 ) 

and can be obtained from measurements of the effects of the alloying elements on the 
solubilities of the impurities (Guttmann [1976]). The stronger the reduction in 
solubility, the stronger the coupling of the alloy element to the impurity and the more the 
alloying addition enhances the impurity segregation. 

In a similar manner to that of the Fowler isotherm, as a' 2 becomes more and more 
negative the curves exhibit a more pronounced S shape and eventually attain a region in 
which the segregation shows a discontinuous increase from a low to a high level. The general 
effect of Guttmann’s coupling can be seen by reference to fig. 14. Alloy element additions 
with a negative value of a' 2 cause the impurity element solubility to fall. Simultaneously, 
through eqs. (38) and (39), the segregation level rises. Thus, the addition of the alloying 
element causes the point for the original binary system in fig. 14 to move diagonally upwards 
to the right but to stay within the general correlation (Guttmann and McLean [1979]). 

The general behaviour of this model is now well accepted and allows an accurate 
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description of some of the behaviour observed in embrittlement problems. The increase 
in the free energy of segregation of one species by a second, described in eq. (39), has 
been validated in careful experiments on grain-boundary segregation of Sb and Ni in 
steel by Gas et al. [1982], Although high -a' 2 values lead to a high enrichment ratio, 
the result in practice often will be a low segregation. This is because the reduction in 
solubility and hence the number of solute atoms available to segregate make the rate of 
build-up of the segregant at the surface or grain boundary very slow indeed so that, with 
practical heat treatments, a negligible segregation occurs. This is precisely the case, for 
example, with the addition of manganese to remove the sulphur embrittlement in steels. 
The problem of the kinetics of segregation is very important in practice and this is 
considered in the next section. 

Guttmann’s original theory was modified by Seah [1977] to remove the site 
competition between alloy elements and impurities which had not been observed in 
practice. In eq. (38) the sum over the j terms is simply replaced by the t ,h term. This 
simplifies the calculations and gives a better description of the experimental results as 
discussed in §6.3. Several more complex theories are detailed by Guttmann and 
McLean [1979] and by Guttmann [1980]. 

Solute segregation in multicomponent systems can alternatively be described in terms 
of chemical equilibrium (Briant [1990]). In this approach, the reversible chemical 
reaction r between two segregating elements I and J at the boundary (b) and in bulk (c), 

/ w + / (i,) =/ (i,) + J {c \ (41) 


can be characterized by an equilibrium reaction constant K r 


jr _ a bi a cj 
a cl a bJ 



(42) 


where a bi and a ci are the activities of individual solutes at the boundary and in the bulk, 
respectively. In contrast to the Guttmann’s approach in which the solute interactions 
modify the value of segregation free energy (eqs. (38) and (39); Guttmann [1975]), the 
value of A G r is constant for the reaction r at a given constant temperature in the Briant’s 
approach (eq. (42)) and only the changes in bulk activities and solid solubilities are 
responsible for segregation behaviour of any element in the system, i.e., no solute atom 
interactions are considered. By means of the latter model, Briant [1988a] interpreted the 
grain-boundary segregation of Sb in a-Fe enhanced by additions of Ni as observed by 
Gas et al. [1982], by increasing the bulk solid activity of Sb in a-Fe by Ni. Strictly 
speaking, both Guttmann’s and Briant’s approaches lead to rather similar expression 
(compare eqs. (38) and (39) with eq. (42)), supposing the activity a, to be expressed as 
the product of concentration X , and activity coefficient f, and the product of activity 
coefficients to be represented by combinations of interaction coefficients in eq. (39). 


4.5. Anisotropy of segregation 

In the above we considered that all sites are equivalent prior to segregation. This is 
generally not the case either at free surfaces or at grain boundaries. The simplest case to 
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understand is that of free surfaces. The driving force for segregation is the lowering of 
the overall system free energy. If the surface energy is already low as in the case of the 
low-index surfaces (111), (110) and (100), we should expect that the segregation here 
would be lower than that for a high index surface, and lowest for (111) surfaces in an 
fee lattice (Erlewein and Hofmann [1977]). Little direct work has been reported, but 
measurements by Johnson etal. [1978] for Au segregation on Ni, by Zhou etal. [1981] for 
C and Sn segregation on Fe, and by Suzuki and Oku [1990] for Cr segregation on Fe 
generally confirm this although Si segregation at Fe (111) was detected to be rather high 
(Zhou etal. [1981], Suzuki and Oku [1990]). Considerably more work has, however, been 
completed on the influences of the anisotropy of surface segregation on the anisotropy of 
surface energy, because of the technological significance, and this is discussed in § 5.1. 

4.5.1. Segregation at symmetrical grain boundaries 

In contrast to free surfaces which can fully be described by the orientation of surface 
normal (i.e., two degrees of freedom in a chosen reference system), five variables are 
necessary for a complete characterization of a grain boundary: Three of them determine 
misorientation of lattices of the two adjacent grains (rotation axis and misorientation 
angle) and two of them describe the orientation of the boundary plane (Wolf [1992]). 
Therefore, the orientation dependence of grain-boundary segregation cannot be displayed 
in the same simple manner as that for surface segregation. In principle, the complete 
anisotropy of grain-boundary segregation for a binary system should be constructed in an 
eight-dimensional space as an interdependence of interfacial composition (or grain- 
boundary enrichment ratio) on five grain-boundary parameters, and on temperature and 
pressure (Seidman [1992]). To simplify the problem, orientation dependences of grain- 
boundary segregation are displayed as plots of grain-boundary composition vs. one grain- 
boundary variable keeping the other four ones constant, in addition to constant tempera- 
ture and pressure. In this way, solute segregation was found to increase with increasing 
misorientation angle for both tilt (e.g., Biscondi [1982]) and twist (e.g., Seidman 
[1991]) low-angle grain boundaries. Among the high-angle grain boundaries, two types 
of interfaces can be distinguished: the majority of general grain boundaries with 
relatively high values of (} b , and a few special grain boundaries characterized by a low 
enrichment factor in comparison to the general ones (cf. fig. 21). From the viewpoint of 
atomic structure, the special grain boundaries are equivalent to so-called favoured or 
delimiting ones, the structures of which are created by single structural units whereas the 
structures of the general (or non-favoured) ones consist of a combination of structural 
units of the two neighbouring favoured grain boundaries (Fischmeister [1985], see also 
ch. 9, § 2.2). To characterize the special grain boundaries, various bulk geometric parameters 
are used. One of the most widespread geometric characterizations of grain boundaries is the 
reciprocal density of coincidence sites 2 which can be determined for cubic crystals by 

% = d(h 2 + k 2 + l l ) (43) 

where h, k and / are the Miller indices of the symmetrical grain boundary corresponding 
to the given misorientation 6 , and 5 = 1 if h 2 + lc 2 + ( 2 is odd and 5 =0.5 if tf + kf + l 2 is 
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Fig. 21. Orientation dependence of the Re enrichment factor for (Oil) twist grain boundaries of a W-25at%Re 
alloy (after Seidman [1991]). 


even. Other geometric parameters are the inverse planar coincidence site density T 
defined as the ratio of the unit-cell area A n and the smallest possible area per atom A l 

T = (44) 

and the interplanar spacing d(hkl) defined for cubic lattices by 


d(hkl) = 


ea 

V/t 2 +k 2 + 1 2 


(45) 


where a is the lattice parameter and e has the value of either 0.5 or 1 depending on the 
particular combination of odd and even Miller indices and on the type of the cubic lattice 
(Wolf [1992]). Usually, it is accepted that special grain boundaries possess low values 
of 2 and r, or high values of d(hkl)\ however, because these parameters are bulk and 
geometric characteristics, there is no one-to-one correlation between them and the 
structure of grain boundaries. Therefore, these parameters can serve as rough estimates 
only (Sutton and Balluffi [1987]). 

Owing to the great difference in the behaviour of special and general grain bound- 
aries and because of the great importance of grain-boundary properties in the complex 
behaviour of a whole polycrystal, a large effort has been spent in the last two decades to 
learn the character of all possible special grain boundaries, their atomic structures and 
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properties. As regard grain-boundary segregation, low values of /3 fc were found at 2 = 11, 
50.5° [110], {113} and 2 =3, 70.5° [110], [112] symmetrical tilt grain boundaries in a 
Mo-O system (Biscondi [1982]) or at the 2 =3, 70.5° (Oil) twist grain boundary in a 
W-Re alloy (fig. 21; Seidman [1991]). A low tendency to P, C and Si segregation was 
detected for 2 = 13, 22.5° [100], [015], 2 =5, 36.9° [100], {013} and 2 =5, 53.1° [100], 
[012] symmetrical tilt grain boundaries in an Fe-Si alloy (Hofmann et al. [1992]). In 
this case, grain-boundary segregation was represented by segregation enthalpies A H° of 
individual elements as shown in fig. 16. This parameter is more general than the 
enrichment ratio for example, it is independent of temperature and bulk composition 
of the alloy. This is very important in the case of complex multicomponent systems in 
which site competition and solute interaction can modify the grain-boundary composition 
to such an extent that fi b loses the ability to characterize grain-boundary segregation in 
a binary system (see also §§4.3. and 4.4). The above mentioned interfaces represent 
special grain boundaries selected on the basis of geometric classification of symmetrical 
tilt grain boundaries (Paidar [1987]). 

4.5.2. Segregation at asymmetrical grain boundaries 

For asymmetrical grain boundaries, the relative orientations of both crystals to the 
grain-boundary plane have to be considered and therefore their description is more 
complex. In a neat study by Suzuki et al. [1981] of phosphorus grain-boundary 
segregation in iron, they show that after fracture, the level of phosphorus observed may 
be associated with the Miller indices of the fracture surface, as shown in fig. 22a. The 
measurements of the segregation in relation to the stereographic triangle are shown in 
fig. 22b. It is clear that the lower segregations are associated with the lower index poles 
since, as one may expect, the boundaries involving these surfaces are more ordered than 
the general high-index boundaries. The preponderant role of the orientation of the grain- 
boundary plane in solute segregation was also experimentally observed for sulphur in 
nickel. No segregation effects were detected at symmetrical [111] and [112] facets of 
a 2 = 3 twin boundary but some sulphur enrichment was observed at asymmetrical ones 
such as, for instance, (11 1)/(1 13) (Bouchet and Priester [1986, 1987]). Similarly, a 
symmetrical {113} grain boundary did not exhibit any S segregation in a nearly 2 = 11 
Ni bicrystal whereas asymmetrical boundary facets with orientations close to e.g. 
(112)/ (116) and (ll4)/(225) (i.e. with a deviation of the order of ~2° from ideal 2 = 11) 
were found to be enriched with sulphur (Swiatnicki et al. [1990]). In contrast to the 
{113}, the other 2 = 11 symmetrical grain boundary {332} was found to be very unstable 
and to split into numerous asymmetrical parts considerably enriched with sulphur (Juhas 
et al. [1994]). The conclusions of Suzuki et al. [1981] were directly supported by 
systematic measurements of grain-boundary segregation in carefully prepared 2=5, 36.9° 
[100] tilt bicrystals of an a-Fe-Si alloy. Low tendency to Si, P and C segregation (low 
absolute values of segregation enthalpy) was observed at the boundaries created by at 
least one low-index plane, i.e., symmetrical {013} and {012}, and asymmetrical 
(001)/ (034) and (017)/ (011) ones. On the other hand, high solute enrichment was 
detected at the high-index asymmetrical boundaries (01 8)/ (047) and (03 11)/(097) 
(Lej/ek et al. [1992]). In addition, different amounts of segregants were found at both 
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Fig. 22. The anisotropy of phosphorus grain-boundary segregation in Fe-l%P showing (a) the segregant atoms 
(dark squares) in relation to the fracture process and (b) the level of segregation in relation to the relative 
orientation of each grain to the grain boundary (after Suzuki etal. [1981]). 


halves of asymmetrical grain boundaries, suggesting an asymmetry of solute segregation 
in the boundary core (LejCek [1994]). A good correlation exists between the segregation 
enthalpy and the effective interplanar spacing d e £. This form of spacing was introduced 
by D. Wolf for boundaries where two well defined planes join, e.g., (001) A /(011) B . d eff 
is then given by d ej /a = (d A + d B )/2. The boundaries characterized by high values of d eff 
possess low absolute values of segregation enthalpy for a particular solute as is demon- 
strated in fig. 23 (Hofmann and LejCek [1991], LejCek and Hofmann [1995]). 

4.53. Computer simulation of grain-boundary segregation 

Measurements of chemical composition of individual grain boundaries in well- 
characterized bicrystals by AES represent a substantial step forward in understanding the 
anisotropy of grain-boundary segregation. However, the information provided by such 
measurements is still averaged over a relatively large interfacial area (at present, the 
lateral resolution limit of AES is about 20 nm) as compared to individual boundary sites 
which may possess different ability to solute adsorption. Due to the present limitations 
of experimental techniques (although methods such as high-resolution transmission 
electron microscopy, HREM, combined with electron-energy loss spectroscopy, or AP 
FIM, approach atomic resolution), the only method of evaluating solute segregation at 
individual boundary sites remains computer modeling. 

Computer modeling has been successfully applied to the determination of atomic 
structure of grain boundaries in pure solids and to the development of the structural unit 
model (cf. ch. 9, § 2.2.7). Later, techniques such as molecular statics and dynamics 
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d ef/ a 


Fig. 23. Plot of segregation enthalpy A/fJ of carbon (□), phosphorus (A), and silicon (o) in a-iron vs. effective 
interpianar spacing d tJf normalized by the lattice parameter a, for some symmetrica] (open symbols) and 
asymmetrical (solid symbols) tilt grain boundaries (after LejCek and Hofmann [1995]). 


or Monte Carlo calculations using many-body potentials of either the embedded-atom or 
Finnis-Sinclair types, and molecular orbital models, were extended to theoretically 
simulate equilibrium atomic structures of segregated grain boundaries. 

Computer modeling of the structure of the 2 =5, 36.9° [100] {012} grain boundary 
in Cu-Bi system revealed large differences in the segregation energy for particular sites. 
Because Bi tends to occupy the sites of the highest segregation energy, a two-dimensio- 
nal (2-D) ordered structure occurs in the boundary layer where each Bi atom is sur- 
rounded by Cu atoms and vice versa (Vitek and Wang [1982]). The values of segregat- 
ion energy of phosphorus and boron at {013} and {114} grain boundaries in a-Fe also 
show a high site sensitivity; however, their values calculated by Hashimoto etal. [1984] 
are much higher than those expected from experiments. More realistic values of the 
segregation energy of Mg, Al, Si, P, S and Cl at the sites at the exact plane of a 2 = 5, 
{013} symmetrical tilt grain boundary in a-Fe were calculated using a tight-binding type 
electronic theory of s-, p-, and d-basis orbitals by Masuda-Jindo [1986]. Segregation 
energies of P and Si obtained in this way (-14.6 kJ/mol and -9.8 kJ/mol, respectively) 
are in excellent agreement with the values of segregation enthalpy of these elements 
experimentally determined at the same grain boundary by LejCek and Hofmann [1991] 
(-13.3 kJ/mol and -8.5 kJ/mol, respectively). This agreement suggests that both these 
solutes segregate in a dilute Fe-Si(P) system at a specific type of sites at the boundary 
at all temperatures and therefore are characterized by specific segregation enthalpies. 
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A very efficient and straightforward approach is based upon a point approximation 
for the configurational entropy, an Einstein model for vibrational contributions to the free 
energy with respect to atomic coordinates and composition of each site (Najafabadi et 
al. [1991]). The approach was applied to the study of Cu segregation at 2 = 5, (001) twist 
boundary in Ni-Cu alloys and revealed an oscillatory depth distribution of Cu in the 
equiatomic alloy (Wang etal. [1992]). 

Computer modeling can also reveal an orientation dependence of grain-boundary 
segregation as is demonstrated in fig. 24 for the example of Au segregation at (001) twist 
grain boundaries of Pt. Absolute values of segregation enthalpy were found to increase 
with increasing rotation angle 0 from 1.1 kJ/mol for 8 = 5° to 11.7 U/mol for 8 =36.9° 
(2 = 5). The average grain-boundary enrichment ratio can be expressed by 

ft(T,0) = l + m(r)sin(0/2) (46) 

where the slope m{T) depends only on temperature (Seki etal. [1991]). 

There are numbers of experimental studies on grain-boundary segregation on the one 
hand and computer simulations of segregated boundaries on the other hand. Nevertheless, 
there is still a lack of direct correlations between these two approaches. The first attempt 
to make a direct comparison of HREM observation of a segregated grain boundary and 
its computer-simulated structure was performed very recently to interpret the atomic 
structure of the 2=3, (1 11)/ (1 1 1) grain boundary in the Cu-Bi system (Luzzi [1991]). 
In a non-segregated state, this boundary is the symmetrical coherent twin boundary with 
the stacking 

ABCABCBACBA {Al) 

where A, B and C represent individual [111] atomic planes in an fee lattice and£ 
denotes the boundary layer. Usually, this boundary stays intact during solute segregation. 
However, Bi segregates at other grain boundaries and its segregation at non-coherent twin and 
2=9 boundaries induces their transformation to (1 1 1)/(1 1 1) facets in which Bi atoms remain 
involved. The Bi-containing grain boundaries can be represented in the same way as the pure 
boundary but in the boundary layer C one third of the Cu atoms is replaced by hexagonally 
arranged Bi atoms. However, the Bi atoms are centered outside this atomic plane so that 
the structure of this boundary can be represented more appropriately by 

ABCAB C' - C(Bi) BACBA (48) 

where C denotes a plane of Cu atoms in which one third of atoms is replaced by 
hexagonally arranged vacancies, and C(Bi) the plane of Bi atoms positioned above these 
vacancies. Due to the splitting of the [111] boundary plane C into two planes C and 
C(Bi) containing Cu and Bi, respectively, this boundary is not symmetrical in the sense 
of mirror symmetry across the geometrical boundary plane (Yan and etal. [1993]). 

4.5.4. Correlation between grain-boundary and free-surface segregation 

Owing to the similar nature of solute segregation to grain boundaries and to free 
surfaces, measurements of free-surface segregation have often been used to estimate 
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Fig. 24. Orientation dependence of the Gibbsian interfacial excess P Au at (001) twist grain boundaries in an 
Pt-lat%Au alloy at 1900 K (O), 1500 K (□), 1300 K (A), 1000 K (•) and 900 K (■) (after Seki etal. [1991]). 


solute segregation at buried grain boundaries qualitatively or semiquantitatively. Because 
of the similar nature of solute segregation to both types of interfaces, the same formalism 
is generally used for a thermodynamic description of solute segregation. Since a high- 
angle grain boundary can roughly be considered as consisting of a complicated mixture 
of free surfaces and bulk binding states, the average free energy of general grain 
boundaries is about 1/3-1/2 of that for the free surface. Because solute segregation is 
associated with a lowering of the interfacial free energy, it is clear that grain-boundary 
segregation enthalpies are generally lower than those of surface segregation for the same 
system (Hondros and Seah [1977b]). On the basis of this similarity, Seah and Lea 
[1975] proposed a method of quantification of grain-boundary segregation in a binary 
system. This method consists of the construction of a conversion diagram from the 
temperature dependence of surface segregation and a single measurement of grain- 
boundary segregation at suitable temperatures (see fig. 20, § 4.2.4.). 

However, significant differences in segregation behaviour of the two types of 
interfaces can occur in three- or multicomponent systems resulting from site competition 
and solute interaction. For example, boron segregates significantly at grain boundaries in 
Ni 3 Al whereas no sulphur is observed there. On the other hand, extensive segregation of 
sulphur but low segregation of boron occur at its free surfaces (White etal. [1984]). 

Qualitatively more pronounced differences between surface and grain-boundary 
compositions can be observed in systems with strong attractive interaction of two 
segregants resulting in the formation of two-dimensional surface compounds. In an 
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Fe-6at%Si alloy containing P, S, N, and C as trace impurities, cosegregation of Si and 
N results in a stable Si surface compound at the (013) free surface thus preventing 
significant segregations of other elements. In contrast, enrichment of P and C, and 
depletion of Si characterize the segregation behaviour at the {013} grain boundary while 
N was practically not detected there. In the denitridized and decarburized alloy, phos- 
phorus enrichment and silicon depletion were again observed at the grain boundary but 
pronounced sulphur segregation, accompanied by weaker phosphorus and silicon 
enrichments, dominates at the free surface (LejCek etal. [1993]). From a phenomenolog- 
ical point of view these qualitative differences may be considered as a generalized 
anisotropy of interfacial segregation. 


4 . 6 . The kinetics of segregation 


Most models of the kinetics of segregation follow McLean's [1957] approach. Solute 
atoms are assumed to segregate to a grain boundary from two infinite half crystals of 
uniform solute content or to a surface from one infinite half crystal. Diffusion in the 
crystals is described by Fick’s laws and the ratio of the solute in the grain boundary to 
that in the adjacent atom layer of the bulk is given by the enrichment ratio, /3, assumed 
to be constant and independent of the actual values of X b and X c . The kinetics of the 
segregation are thus described by: 


X h (t)-X b (0) 

x„H-x b ( o) 


1 - exp 


FDt 

P 2 f 


erfc 


~ FDt 

P 2 f 


(49) 


where F=4 for grain boundaries and 1 for the free surface, X b (t) is the boundary content 
at time t, D is the solute bulk diffusivity, and /is related to the atom sizes b and a of the 
solute and matrix, respectively, by f=a 3 b~ 2 . For short times, eq. (49) approximates to 


X b (t)-X b (0) _2 b 2 

X„H-X b ( 0 ) Pa 3 


FDt 


7 T 


(50) 


Equations (49) and (50) are, in fact, limiting extremes of a general problem. In practice 
P is only constant for dilute systems with low segregation levels. As segregation 
proceeds ft generally falls as a result of saturation in X b . If P starts high and falls rapidly 
as the segregation saturates, eq. (50) is valid up to saturation (Lea and Seah [1977]). A 
detailed analysis for the saturation occurring in the Langmuir-McLean adsorption theory 
has been presented by Rowlands and Woodruff [1979]. Their analysis, re-interpreted 
in fig. 25, shows how the time dependence of the segregation changes from eq. (49) to 
eq. (50) as the final equilibrium segregation level X b (°°) approaches the saturation level 
X b . Calculations for particular systems in relation to temper brittleness are given in § 6.3. 
Measurements of X b (t) at surfaces by AES at different temperatures allow the determin- 
ation of the bulk diffusion constant through eq. (50), as shown by Hofmann and 
Erlewein [1976] for Sn in Cu. 

The more complex problem of the kinetics of segregation in ternary systems has also 
been studied by Tyson [1978], Tyson uses McLean’s approach but incorporates Gutt- 
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Fig. 25. The kinetics of segregation in binary systems for varying degrees of saturation in the final equilibrium 
level (after Rowlands and Woodruff [1979]). 


mann’s theory to define the grain-boundary concentration. For a ternary system with a 
dilute impurity and a non-dilute alloying element he finds that eq. (50) is often a good de- 
scription if the final segregation level is near saturation and similarly eq. (49) is valid if 
the final level is low. However, for certain critical cases, in which the two interacting spe- 
cies have similar values of Z)/3“ 2 , the segregation curves exhibit an intermediate plateau. 

The effect of site competition and repulsive interaction between solutes on segregat- 
ion kinetics in multicomponent alloys was analyzed by Militzer and Wieting [1986]. 
This effect results in a replacement of a quickly segregating element by another, slowly 
diffusing but more surface active element. As a result, a non-monotonic course of 
segregation kinetics appears. Under strong repulsive interaction, the replacement can 
reach such an extent that an interface miscibility gap occurs. The time necessary for 
nucleation of new phases is generally only very short and does not delay the kinetics of 
segregation (Militzer and Wieting [1987]). 

The linear relationship between X h and t W2 (eq. (50)) was derived assuming the bulk 
diffusion to be a controlling mechanism of segregation kinetics. In some cases, however, 
deviations from this behaviour have been observed. The diffusion can be accelerated e.g. 
via pipe diffusion. Then, the kinetics of interfacial segregation is composed of several 
contributions, e.g., the bulk diffusion term, X v (t), the pipe diffusion term, X p (t), and the 
term reflecting the enrichment of dislocations, X pv (t), 

x*(/) = x,(0 + *,(») + *,,(')■ (51) 
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In this way, Militzer et al. [1992] explained a complex segregation behaviour of 
sulphur at the (001) surface of an Fe-Si alloy. 

The process of solute segregation generates composition changes in the vicinity of the 
interface. The generally encountered situation is illustrated in fig. 26, where this in-depth 
distribution was calculated for surface segregation of Sn in Cu for different annealing 
times. The course of the depicted curves was experimentally confirmed by sputter depth 
profiling (Hofmann and Erlewein [1978]). 

The kinetics of desegregation have, in the past, been considered as much faster than 
the kinetics of segregation. This should not be so. The relationships are as given in eqs. 
(49) and (50), with X fc (0) and X b (o°) transposed (duPlessis [1990]). 

4.7. Non-equilibrium segregation 

The above segregation process leads to a well defined equilibrium value of the 
segregation as time approaches infinity if the temperature and constituents are kept 
constant. This equilibrium is approached at a rate defined by the relevant diffusion 
processes as discussed in the previous section. 

In non-equilibrium segregation phenomena, the enrichment disappears as time 
approaches infinity but it may be very significant at intermediate times. These non- 
equilibrium segregations arise through a number of discrete processes but all depend 
ultimately on the fact that changes in the materials environment cause the chemical 
potential of one the constituents to vary from place to place at a given concentration. 
This constituent then redistributes at a rate governed by diffusion theory, concentrating 
in certain regions in order to attain a constant chemical potential everywhere. In the 
meantime the spatial dependence of the chemical potential generally weakens also at a 
rate governed by diffusion theory, so that the constituent, having segregated, disperses 
again to become homogeneous throughout the material. This type of segregation is thus 
transient and is termed non-equilibrium but, in fact, includes many different separate 
mechanisms. (See also ch. 7, § 8.3.2). Another important point is that each of these 
mechanisms leads to segregation in a zone of considerably greater width around the 
appropriate interface than occurred with the equilibrium mechanism discussed earlier. 
Depending on the mechanism, this zone may range from nm to fim. It should now be 
recognized that, having established a zone of enrichment, equilibrium partitioning 
between this zone and the interface will inevitably occur so that the real situation will 
incorporate contributions from both equilibrium and non-equilibrium segregations. Below, 
the non-equilibrium aspects are discussed but the above caveat should be borne in mind. 

4.7.1. Solute pile-up at growing precipitates 

An early form of non-equilibrium segregation concerns the solute pile-up at the face 
of a growing carbide in steels. Owing to their low solubility in carbides, certain solutes 
such as Sn and Sb will be rejected as the carbide grows and will pile up on the matrix 
side of the matrix/precipitate interface. Immediately this happens, the excess atoms start 
to diffuse into the matrix. Thus, if the carbide grew a constant rate, the solute pile-up 
would grow asymptotically to a steady level. However, the carbide precipitate growth 
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Fig. 26. Calculated time dependence of concentration-depth distribution near the surface for Sn (X = 5 x 10 3 ) 
at a Cu (111) surface (after Hofmann and Erlewein [1978]). 


rapidly slows down and stops so that the pile-up disperses eventually to zero. In a 
practical situation, with the carbides growing as the material cools, the pile-up may be 
“frozen in”. Precise calculations are difficult since the extent of the pile-up has been 
shown, by Rellick and McMahon [1974], to be critically dependent on the cooling 
rate, the carbide growth rate and the solute partitioning between the carbide and the 
matrix. With large, rapidly grown carbides, enrichments as high as a factor of 100 may 
be encountered although much weaker effects are likely to be observed in practice. 

4.7 2 . Quench-induced segregation 

Another form of non-equilibrium segregation which also occurs at interfaces has been 
analyzed by Williams et al. [1976] on the basis of a model proposed by Aust and 
Westbrook [1965], In this model, cooling of the material causes the equilibrium 
vacancy concentration to fall so that vacancies flow to interfaces where they may be 
readily annihilated. Because solute atoms in the crystal lattice have an associated strain 
energy, as discussed in §3.2.1, which may be relieved by pairing the solute atom with a 
vacancy, the vacancy flow has an associated solute flow. In their calculations for boron 
segregation during the quenching of 316 steel (17% Cr, 11% Ni stainless steel) from solution 
temperatures in the range 1173-1633 K, Williams etal. write the vacancy concentration as: 

v = exp(-E / /w) (52) 

where £,(= 4) contains geometric and entropy terms and E p the vacancy formation energy 
is 1.4 eV (1 eV per atom = 96.5 kJ/mol). The concentration of boron-vacancy com- 
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plexes, v B , obeys a similar relation but also depends on v and the boron bulk con- 
centration, B, 

Vfl = B\k c exp( EjkT\ (53) 

where k c (= 12) contains geometric and entropy terms and E B , the vacancy-solute binding 
energy, is 0.5 eV. In this system a significant population of complexes occurs at 1623 K 
but not at 1173 K and, since the diffusivity of the boron-vacancy complexes is higher at 
all temperatures than that of the vacancies or the boron atoms, significant boron 
segregation from the bulk level of 90 at ppm could be observed by autoradiography, as 
shown in fig. 27, for the higher temperature quenches at 50 K/s. These results were 
confirmed later by Karlsson et al. [1988], who found in a similar material (17% 
Cr-13% Ni, 316L stainless steel) that the amount of segregated boron increases with 
increasing starting temperature and is larger for lower cooling rates (-10 K/s). It was 
also detected after the highest cooling rate 530 K/s. This was also confirmed by 
computer simulations (Karlsson [1988]). Non-equilibrium segregation is observed after 
cooling from temperatures above 1350 K whereas equilibrium segregation dominated 
after rapid cooling from 1073 K. Effects of non-equilibrium segregation seems to be 
independent for the majority of grain boundaries but no segregation was detected at 
coherent twin boundaries (Karlsson and Nord£n [1988]). To differentiate between 
equilibrium and non-equilibrium segregation at grain boundaries of a Ni-In alloy, Muschik 
et al. [1989] found AES depth profiles of the shape for intermediate time in fig. 26 to be 
indicative for non-equilibrium segregation above 1000-1100 K quenching temperatures. 



Fig. 27. Optical autoradiograph of austenitic 316 stainless steel containing 90 ppm boron solution heated at 
1623 K for 1/2 hour, furnace-cooled to the temperature T, in the range 1 173 to 1623 K, held for 1/2 hour and 
cooled at 50 K/s to room temperature. Strong boron non-equilibrium segregation is visible for the higher 
temperature quenches (after Williams etal. [1976]). 
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Detailed calculations for Cr segregation in 2^CrlMo steel by Doig and Flewitt 
[1981] show that, in that case, narrow distributions for Cr enrichment are obtained 
around the grain boundary for quench rates in excess of 2000 K/s with significant 
segregation occurring for quenches from temperatures in the range 1300-1400 K. 
Experiment and theory agree in indicating an excess Cr level of some 2% over a zone of 
5 nm on either side of the boundary. The limiting conditions for the quench-induced 
segregation are presented in fig. 28 as a function of the initial temperature before quench 
and the quench rate. At quench rates above line 1 there is insufficient time for the 
solute-vacancy pairs to diffuse an adequate distance, whereas at quench rates below line 
2 there is sufficient time for the solute to diffuse away from the segregation region and 
so outside these lines little segregation will be observed. Again, at initial temperatures 
below line 3 there is neither sufficient kinetics or sufficient vacancy population for the 
segregation to be significant. No high temperature limit is appropriate if the quench goes 
through the temperature range to line 3. Observations of significant quench-induced 
segregations are reviewed by Harries and Marwick [1980]. 



Log quench rate 


Fig. 28. Diagram of the limiting conditions for appreciable non-equilibrium segregation as a function of quench 
rate and temperature prior to quenching. 
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4.7.3. Stress-induced segregation 

A third form of non-equilibrium segregation occurs when the vacancy flux is not 
produced from a change in temperature but from an applied stress at constant temperat- 
ure. The vacancy flux is directed by the stress to relieve the strain and consequently it in 
turn directs excess solute atoms, by the vacancy-solute coupling discussed above, to 
grain boundaries parallel to the stress axis. The source of the solute atoms is thought to 
be the other grain boundaries rather than the lattice, since the activation energy for grain- 
boundary diffusion is much lower than that for lattice diffusion. In steels the problem is 
complicated by the stress-assisted dissolution and reprecipitation of grain-boundary 
carbides which produces the first non-equilibrium segregation mentioned above due to 
pile-up. Changes in grain-boundary phosphorus segregation as a function of tensile and 
compression stresses in 2{Cr steels have been observed and interpreted by Shinoda and 
Nakamura [1981] although, at present, the interpretative model is not fully complete. 

4.7.4. Segregation at moving grain boundaries 

It has been known for a long time that an atmosphere of solute atoms exerts a drag 
on a moving grain boundary (ch. 28, §3.4.1). However, up to now only very little is 
know on the reverse effect, i.e. on the non-equilibrium segregation of solutes at a moving 
grain boundary during recrystallization. This is despite the fact that the solute enrichment 
effects can be much stronger than those of equilibrium segregation. For example, a non- 
equilibrium segregation of boron was observed on moving grain boundaries in an Fe-3 
mass%Si alloy (fi b = 1.6) although no equilibrium segregation was detected at static 
boundaries under the same conditions (Zhang et al. [1994a]). Boron segregation at 
moving grain boundaries is intensified with continuing recrystallization and persists with 
the new boundaries moving forward. With decreased velocity of moving boundaries at 
the end of the recrystallization process boron enrichment at grain boundaries decreases 
and eventually disappears. Obviously, a property responsible for segregation changes due 
to grain-boundary movement is the width of the boundary. 

The non-equilibrium segregation is facilitated by increasing width of a moving 
interface. The broadening of the boundary results from its interaction with lattice 
dislocations disappearing there during recrystallization. The width S of the moving 
boundary can be expressed by: 

8 = d 0 + Vrd T kp (54) 

where d 0 is the width of the static boundary, V is the boundary velocity, r is the relaxation 
time for dislocation annihilation by moving grain boundary, d T is the increment of average 
width of unit area of the boundary during r, and A p is the difference of the dislocation 
density in deformed and new grains. The enrichment ratio at moving boundaries is then 

ft =l + |exp^-ij (55) 

where W is the width of grain-boundary area of increased solute concentration, and U 0 
is the potential of solute atoms near the grain boundary. According to eqs. (54) and (55), 
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one obtains fi b = 1.7 for boron segregation at moving grain boundaries in Fe-3mass. %Si 
during recrystallization at 1273 K after 20% deformation (Zhang etal. [1994b]). 

4.7.5. Radiation-induced segregation 

This topic is treated in chapter 7, §8.3.2 and chapter 18, §4.7. 

5. Segregation-related physicochemical properties 

The influence of the interfacial microchemical state on physical metallurgical 
properties, such as creep-rupture life and temper embrittlement, operates through 
associated effects on certain basic physicochemical properties which in turn determine 
the metallurgical behaviour. The main physicochemical properties which are influenced 
by segregation processes fall into three categories: interfacial energetics, kinetics and 
cohesion. These are considered in turn below. Wherever these basic interfacial para- 
meters are found to be involved with metallurgical phenomena, clearly these phenomena 
are influenced by segregation even though no explicit mention of the fact may occur in 
this chapter or, indeed, elsewhere. 

5.1. Interfacial energetics 

The close association between interfacial free energies and interfacial composition is 
discussed in § 2 in terms of the Gibbs Adsorption Theorem. We note that the thrust of 
much of the effort on the determination of interfacial free energies was originally 
intended to derive the values for interfacial composition. Both this thermodynamic 
approach and the new direct spectroscopy techniques can now be used interchangeably 
by way of parameters such as the interfacial activities or enrichment ratio, through which 
patterns of enrichment behaviour for a wide range of systems and for various interfaces 
may be compared and interrelated. 

However, there are situations where the treatment of a microstructural or metallur- 
gical problem requires a knowledge of the exact interfacial energy values relevant to the 
conditions which prevail. Examples of this are the nucleation of precipitates or second 
phases at grain boundaries, where a determining parameter is the actual interfacial free 
energy between the boundary plane and the second phase that is being nucleated. Again, 
in the study of the nucleation and stabilization of cavities formed along grain boundaries 
during creep, an important quantity is the value of the surface free energy pertaining to 
the cavity wall. The surface and grain-boundary free energies of most pure metals have 
been measured experimentally by the techniques discussed earlier and fair assessments 
have been made for others, on the basis of a knowledge of bulk thermochemical proper- 
ties with which empirical relationships exist. These have been summarized elsewhere 
(Hondros [1969, 1978]). We note, as expected from eqs. (25) and (27), that y^O.16 H sub 
at around 1500 K, where y s and H sub , the heat of sublimation, are measured per mole 
(Overbury etal. [1975]). At 0 K y s increases to 0.25 H SIlb , as expected (Tyson [1975]). 

In multicomponent crystalline solids, the main determinants of the interfacial energies 
are the presence of segregants, the temperature, and the crystalline orientation. The 
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evaluation of the interfacial free energy relevant to the modelling of a metallurgical event 
is difficult because of its sensitivity to the above parameters, in particular, adsorption of 
solute species. The direct experimental determination of that quantity is technically often 
very difficult and indirect. 

For binary dilute alloys, reasonable experimental data exist for several systems on the 
change in absolute surface free energies with increasing bulk content of the alloying 
element: these are summarized in fig. 29 (Hondros and McLean [1974]). The depres- 
sion in y s up to the limit of solid solubility is system dependent, however, the estimate 
usually adopted is about 40%. Furthermore, we note that the data were measured on 
polycrystalline samples and therefore they refer strictly to the average surface free 
energies. The anisotropy of surface energy is small for pure metals, but it may be quite 
significant for alloys in which surface enrichment is involved. 

In some cases, extremely low quantities of very surface-active elements in the vapour 
phase may produce significant effects on the interfacial free energies. Figure 30 shows 
as an example, the sensitivity of the surface free energy of Fe-3mass%Si at two 
temperatures to the potential of oxygen in the vapour phase. In order to obtain and hold 
such low values of oxygen potential, it is required to work with complex gas phase 
mixtures involving H 2 0 in H 2 . Furthermore, these experiments were conducted on foils 
showing a (100) texture so that the values refer substantially to this crystal plane. (See 
also ch. 28, §4.4.2). 

In a subsequent study aimed at providing fundamental data on the surface-energy- 
driven secondary recrystallization in sheet alloy, Mills et al. [1973] showed that the 
surface energies of crystals exposing the principal poles differed in a complex but 
significant manner as a function of temperature and oxygen adsorption, so that by 
manipulating these variables, each of the textures (111), (110) or (100) could be 
controllably grown. 

For the effect of temperature on the interfacial free energy, considering a single 
component isotropic solid, eq. (68), ch. 5 reduces to: 

— = (56) 

where S r is the interfacial entropy. Thus for pure elements, the temperature dependence 
of the interfacial free energy is negative. Theoretically, this is computed to be about -0.5 
mJ/m 2 K which is close to the value conventionally adopted from experimental data, 
-0.33 mJ/m 2 K (Hondros [1969]). 

In contrast to the above, fig. 30 clearly shows that in a system of more than one com- 
ponent the temperature dependence of the surface free energy can be positive. This is 
consistent with the thermodynamics of surfaces. By differentiating eq. (68), ch. 5 with 
respect to T, a complex expression may be obtained containing two terms which describe 
the temperature dependence of the interfacial free energy in a binary system. The first 
term, relating to entropy, is always negative, while the second term relates to adsorption 
and is positive. (The latter term refers to the effect of “boiling off’ of a segregate with 
increasing temperature, where it is expected that the amount of adsorption would decrease 
with increasing temperature, i.e., dF/dT is negative.) This shows that in a system of more 
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Fig. 29. Dependence of the surface free energy on bulk composition for a number of dilute systems: the host element is shown first in each pair. 
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Fig. 30. Dependence of the surface free energy of the (100) plane in Fe-3%Si on oxygen potential and 
temperature (after Hondros and Stuart [1968]). 

than one element, the interfacial energy may increase or decrease with increasing temper- 
ature, depending on the relative strengths of the two terms of entropy and adsorption. 

The above discussion indicates the complexities involved in assessing the precise 
interfacial free energy value for a given physicochemical situation. There are very few 
systems for which this quantity has been determined at the level of rigour shown in fig. 
30. For more complex interfaces, such as those between a second phase and the matrix, 
the experimental problems are even more severe. As a general guideline for binary 
systems, a knowledge of the amount of surface excess of an identified species from a 
know bulk concentration provides an estimate of the associated change in interfacial 
energy, through the application of the Gibbs Adsorption Theorem in a reversed manner. 

Where the form of the adsorption is known, an approach that has been used for both 
free surfaces and grain boundaries is to integrate the isotherm. Thus, if the Langmuir 
isotherm holds, this may be combined with the Gibbs Adsorption Theorem, which for a 
dilute system yields the equation for grain-boundary free energy: 



-14 -13 -12 -11 -10 -9 

log pO 2 (Pa) 
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Fig. 31. Surface self-diffusion in Cu, Au and Ag (solid curves) and the effects of adsorption of a number of 
species (broken curves) (after Delaware and Rhead [1971]). 


y b = yl + RTF° b In 



(57) 


where the quantities in T are defined as above and y° b is the grain-boundary energy of 
the pure metal. This approach was used by Petch [1956] in a similar form (but retaining 
the gas pressure term), in order to determine the effect of adsorbed hydrogen in lowering 
the surface energy of iron in a theory of hydrogen-induced fracture, and also by Seah 
[1980c], in a study of impurity-aided creep cavitation embrittlement. The enrichment 
quantity r t required for eq. (57) may be obtained directly by spectroscopic measurements 
or assessed through the enrichment factor >8* and its dependence on the inverse of the 
solid solubility (see eq. (15) and fig. 14). This yields a change in free energy Ay which 
in many modelling studies is often the quantity required: an absolute value of y for the 
appropriate conditions is assessed by referring to the standard state value for the energy 
of the pure component. 
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5.2. Surface and grain-boundary kinetics 

The free surfaces of solids offer rapid diffusion paths for the transport of matter. For 
example, the grooves that are formed at the intersection of grain boundaries with free 
surfaces when metals are held at high temperature (commonly referred to as “thermal 
etching” (see fig. 5)), owe their origin to the interchange of matter by surface diffusion. 
In general, this mechanism predominates in solid state phenomena where the microstruc- 
tural features involve dimensions of the order of /u.m. Thus surface diffusivity in an 
important mechanism in the early stages of sintering, where the necking of the spherical 
particles takes place. 

Considering the effects of surface-adsorbed elements, a good body of evidence 
affirms that adsorption will generally accelerate the surface diffusivity, while in some 
specific cases suppression will occur (Gjostein [1966]). The interesting data in fig. 31 
for example, compiled by Delamare and Rhead [1971], show the effects of a number 
of impurities, either arriving from the vapour phase or segregating from the bulk, on the 
surface diffusivities of copper, gold and silver. The Arrhenius plots for the pure metals 
are shown as solid curves, whereas each of the broken curves refers to the metal 
substrate containing an adsorbed layer of impurity under controlled conditions. The 
second element of each pair denotes the adsorbed species. Clearly, in some cases such 
adsorbed layers induce spectacular increases in the self-diffusivity of the substrate metal 
and at the highest temperatures, these values can approach a factor 10 4 above that of the 
clean substrate. As yet there is no agreement on the mechanism for this remarkable 
effect: one theory proposes that the uppermost atomic layer of the surface consists of a 
mixture of adsorbate and adsorbent, constituting in effect a two-dimensional compound 
with liquid-like properties. From this theory, it follows that adsorbates which form 
compounds with low melting points should lead to this accelerated surface diffusivity. An 
earlier theory proposed a “skating” mechanism in which the substrate atom is thought to 
climb on top of the adsorbed layer, where it may migrate freely with long jump distances 
of the order of 100 atom spacings, a hypothesis which could explain increases in surface 
diffusivities of the order of 10 4 . This overlayer model of adsorption diffusion now 
appears to be superseded by the mixed layer or two-dimensional liquid model, which is 
seemingly supported by Low-Energy Electron Diffraction (LEED) studies on adsorbed 
monolayers of lead and bismuth on copper. 

Whereas the copious studies on surface self-diffusion (ch. 7, §7.4) fall within the 
context of a fundamental understanding of mechanisms probably relevant to the 
important process of catalysis, the somewhat fewer studies of the effect of segregation 
on grain-boundary self-diffusivity have a more immediate applicability to physical- 
metallurgical phenomena. We now turn to consider the latter. 

Grain boundaries are paths for rapid self-diffusion and it is commonly understood that 
the transport of matter along such interfaces should be rate-controlling in a number of 
important metallurgical processes, in particular in temperature regimes of about 0.5 T m or 
below. In general, grain-boundary diffusivity data which refer to the pure elements have 
been employed in the modeling of various processes in metallurgy. Clearly, the use of such 
readily available data could introduce serious distortions if transport along grain boundaries 
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Fig. 32. The measured and predicted dependence of the grain-boundary self-diffusion in Fe-Sn alloys at 
888 K on the amount of tin segregated at the boundary (after Bernardini etal. [1982]). 

were to be sensitive to the microchemical aspects of the interface. Below we examine the 
theory and evidence for the role of solute segregation on grain-boundary diffusivities. 

A pure grain boundary is characterized by a high free energy, being a structural zone 
with a low volume density of matter compared with the bulk lattice. From such simplistic 
considerations, we might expect that this should lower the activation energy barrier for 
substitutional self diffusion, so that atom transport is easier along these paths than in the 
bulk crystalline lattice. This concept has been expressed in a semi-quantitative manner by 
Gibbs and Harris [1969] who related the activation energy for grain-boundary transport 
Q b to that for volume diffusion Q v by the expression 
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ft = ft - Na'y b a\ (58) 

where a 2 is the cross-sectional area of an atom*, N is Avogadro’s number and a' a 
structure factor which depends on whether diffusion is substitutional or interstitial. This 
predicts sensibly that increasing y b will lower the activation energy for grain-boundary 
transport and that for a highly coherent interface such as a twin boundary, with a low 
free energy, the activation energy for diffusion would be little different from that of the 
lattice. 

Consider the presence of a solute in the base solvent with accompanying high grain- 
boundary enrichment of solute. If the segregation-associated change in grain-boundary 
energy is ky b , it follows that the activation energy for grain-boundary diffusivity in the 
presence of segregation, Q b eg , is related to that for the pristine boundary by: 

ft"* =Q b + Na'a 1 Ay b . (59) 


Hence, according to this general model, segregation increases the activation energy for 
grain-boundary diffusion compared with the pure state. In effect, the segregation permits 
the sites of high compression and tension to be relaxed, reducing the net free energy of 
the boundary and increasing the volume density of the boundary by the occupation of 
vacant sites. Thus, from the diffusional point of view, the material in the boundary zone 
approaches the character of the material in the bulk lattice. The above approach allows 
estimates of the magnitude of the effect. Thus, when phosphorus segregates to the grain 
boundaries of pure iron, the grain-boundary energy is reduced by about 400 mJ/m 2 at its 
maximum (Hondros [1965]); substituting in the above equation and including a value 
of 2 for the structure factor appropriate to vacancy diffusion, the activation energy for 
grain-boundary diffusion in pure iron is raised from 175 kJ/mol to 200 kJ/mol. 

The above account is probably satisfactory for rough estimates of expected effects, 
and supports the view that the consequence of segregation is the elimination of the 
boundary sites having a high degree of disorganization, thereby decreasing grain- 
boundary transport with respect to that in a pure boundary. The subject has been 
developed analytically by Bernardini et al. [1982] who derived a more rigorous 
expression linking the change in diffusivity to the amount adsorbed at the grain bound- 
ary. This treatment takes as its point of departure of the theory of Borisov etal. [1964] 
who derived the basic relationship between grain-boundary energy and grain-boundary 
diffusivity, given by the expression 


y„ = 


mkT 


2 at 


In 


^-ln» 


D v a sv 


(60) 


where D b is the grain-boundary diffusivity, D v is the volume bulk diffusivity, a sv is the 
size of the solvent atom, S is the width of the grain boundary, and S = ma sv . This equation 


* In this chapter the atom size, a , is derived from pNi i 3 = M, where p is the bulk density and M the atomic 
weight of the solid. 
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is transformed by expressing y b in terms of r t , the amount of segregation. By combining 
eqs. (58) and (60), we effectively extend the Borisov treatment to a binary system in 
which grain-boundary segregation occurs. This yields: 


Pi 

d; 




i2a£/m 


(61) 


where D b is the grain-boundary self-diffusion coefficient in the presence of segregation, D v ' 
is the coefficient due to the presence of the solute in the bulk, D b and D v are the 
corresponding values in the pure solvent, a su is the atomic size of the solute in the 
particular solvent, and m is the quantity, retained from the Borisov equation (60), which 
relates to the number of effective atom layers in the boundary. The above equation has 
now been confirmed experimentally in iron containing tin, by directly measuring the 
diffusivity constant using radio-tracer techniques and also by determining the amount of 
grain-boundary segregation using Auger electron spectroscopy on exposed grain-boundary 
surfaces. Figure 32 shows how the measured grain-boundary self-diffusivity of iron 
decreases with increasing levels of tin present at the boundary, at 888 K. Similar data 
were obtained for a number of temperatures and for both solvent and solute diffusivities 
along the boundary. In the same figure, the straight line indicates the effect predicted by 
the application of eq. (61), the treatment necessarily restricting the prediction to the 
dilute approximation. 

Equation (61) may be simplified further by expressing D' v as a linear function of the 
bulk concentration of solute, X c , following a number of theoretical and experimental 
investigations on systems involving substitutional self-diffusion, as discussed in ch. 7, 
§4.1: 

D; = D v (l + b v X c ), (62) 


where b v is a temperature dependent coefficient representing the variation of D b with the 
bulk solute concentration X c . Thus eq. (61) can now be expressed as 

Di=D b [l + {b,-2a)X e \ (63) 


where a is a concentration ratio. This defines the quantity actually measured in boundary 
diffusion measurements in the presence of segregation as aD b S, and not the standard D b S 
where S is the boundary width. To express eq. (63) in a more accessible form, we 
introduce the grain-boundary enrichment factor defined earlier, fi b (eq. (15)), which is 
closely related to a by: 

« = (64) 
ma su 


Therefore, the overall effect may be expressed by: 
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(65) 
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In summary, this equation expresses the effect of segregation (contained in the enrich- 
ment factor f3 b ) on grain-boundary diffusivity. This is offset by the effect of the solute on 
volume self-diffusivity. The equation then predicts that grain-boundary transport of both 
solvent and solute atoms may be reduced or increased depending on the balance between 
the opposing effects implicit in fi b and b y . Since b v is generally of the order of 10-100 
and fi b for highly surface active species is of the order of 1000-10000, in general, the 
effect of the high enrichment factor will dominate in the presence of strongly segregating 
species and this will reduce the grain-boundary diffusivity. An enhancement of the solute 
diffusivities is caused by the increased matter transfer due to accumulation of the solute 
atoms in the grain boundary. It is only an apparent enhancement due to a being greater 
than unity and is not due to a real increase in D b . The magnitude of the enhancement 
depends on the interaction between grain boundary and solute (Kaur and Gust [1988]). 

The ability to predict the effect of segregation on grain-boundary diffusivities is a 
development which permits the quantitative examination of a number of metallurgical 
phenomena in which these kinetics are known to be rate-determining: some of these 
applications will be considered in § 6. 

5.3. Grain-boundary cohesion 

As was noted in the Introduction, historically the most important manifestation of 
grain-boundary segregation has been the effects on grain-boundary cohesion as expressed 
in a number of well known forms of intergranular fragility. This continues to provide 
much of the focus for work in this field. The central problem is the role of solute atoms 
on atomic cohesion at the interface. This is now recognized to be a wider issue, extend- 
ing to aspects of orf-hesion (i.e., cohesion between dissimilar pieces of matter), as in that 
between a metal and an oxide or another phase such as a precipitate or even a polymer. 

The theoretical and experimental framework for understanding adhesion in such 
systems and the role of interfacial microchemical interactions is hardly developed and 
this section will be restricted to grain-boundary cohesion which is now quite well 
advanced. However, in connection with adhesional phenomena in metals, Hondros 
[1980b] has attempted to systematize the data available for metals adhering to oxides and 
other phases, and it appears that the classical concept of work of adhesion, expressed in 
the formulation of Young and Dupr6 as the algebraic sum of the two free surface ener- 
gies and the interfacial energy, may be applied to these solid state systems. This would 
predict that segregation to metal/oxide interfaces, by decreasing the interfacial energy 
should increase the work of adhesion but scant data are available to test this hypothesis. 

Because of its technological importance, much of the work on grain-boundary 
cohesion has been concerned with effects of additions in iron, and it is now well 
established that the elements Cu, Zn, Si, Ge, Sn, P, As, Sb, Bi, S, Se, Te, and Mn all 
cause intergranular weakness in iron. A similar set also weaken copper and nickel. We 
now consider the two questions “why do certain elements weaken grain boundaries?” and 
“what is the relative embrittling potency of these elements?”. 

A number of theories have been proposed to account for the effect of segregants on 
grain-boundary cohesion. All have a common approach but differ in their interpretation 
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of the way to carry out the numerical calculations. In 1957 McLean first proposed that 
in low-temperature intergranular fracture, the total work of fracture is the sum of the 
ideal work of fracture, y, representing the energy to disrupt the bonds across the 
interface, and the irreversible plastic work of deformation, y p , which is absorbed in the 
crystal lattice as dislocations and phonons over a region well away from the boundary. 
Although y p is much larger than y it is easy to see that if y is increased, the forces at a 
crack tip must also be increased to propagate the fracture and hence y p will increase. 
Thus, for a given grain boundary, y and y p are directly related. The precise relation will 
depend on the orientations of the two grains and the grain boundary to the stress axis as 
well as on the microstructure. Indeed, analyses by McMahon and Vitek [1979] and by 
Jokl etal. [1980] show that y p is proportional to y" where n is in the range 2-5. Thus, 
to answer the above questions we must understand the effects of segregants on the ideal 
work of fracture of the grain boundary. 

The ideal work of fracture of a clean grain boundary with energy y° b was given by 
McLean [1957] as: 

y°=2y° s -yl (66) 

where y° is the fracture surface energy of the clean material. An analysis of y° andy° 
values for many metals by McLean and Hondros (Seah and Lea [1975]) shows that, 
typically, y° b ~ y°/3 so that the ideal work of fracture of clean grain boundaries is 
approximately 5/6 of that of an average plane in the crystal. This does not mean that the 
grain boundaries will always fail in preference to cleavage, since the orientation of the slip 
planes and the surface energy of the low-index cleavage planes make cleavage generally more 
likely than intergranular failure in clean bcc metals (Hondros and McLean [1976]). 

More important than the choice of brittle fracture modes is the balance between 
brittle and ductile failure. The fracture behaviour of a 0.2% carbon steel, as a function 
of the test temperature, is shown schematically in fig. 33. At high temperatures, where 
dislocations move easily, the yield stress is low and the steel fails in a ductile manner. 
As the test temperature is reduced, the dislocation movement, governed by a term of the 
form exp(-Q/RT), becomes more difficult and the material eventually fails in a brittle 
manner. The temperature at which this occurs is the ductile-brittle transition temperature 
(DBTT). If the grain boundaries contain the embrittling segregants, the boundary 
cohesion may be sufficiently lowered so that the transition temperature is raised and the 
otherwise ductile material becomes brittle. 

An analysis of the ideal work of fracture of grain boundaries with segregants, y, has 
been made by Hirth and Rice [1980] using thermodynamic arguments. They show that 
the reduction in the ideal work of fracture, where there is no redistribution of the 
segregant species, is given by: 

(67) 

where n b /I") are the chemical potentials of the solute species in equilibrium with a level 
segregation T mol/m 2 at the grain boundary and the free surface, respectively. This 
relation may be re-expressed: 
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Fig. 33. The effect of temperature on the tensile properties of a 0.2% carbon steel showing, schematically, the 
lowering of grain-boundary cohesion and the resultant increase in the ductile-brittle transition temperature (after 
Seah [1976]). 


P pmax _ p 

7 = 7° + r*(AG s - AG„) + /?rj; In ^ max — dT (68) 

where Tj, is the segregation level at the grain boundary, are the maximum segregat- 
ion levels at the boundary and surface, respectively, and the terms in AG refer to the 
binding energies of segregants, which, it should be noted, are negative quantities (Rice 
and Wang [1989]). In the dilute limit, the last right-hand term is equal to -RT In 2 and 
usually, it can be neglected (RT In 2 = 5.76 kJ/mol at 1000 K). Thus we see that the ideal 
work of fracture falls linearly at first with the level of segregation, and that elements 
which segregate more strongly to free surfaces than to grain boundaries reduce the ideal 
work of fracture and vice versa. By measuring A H s , A H b and P 4 under identical 
conditions in Ni 3 Al bicrystals with artificial cavities at the boundary, Otterbein et al. 
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[1993] applied eq. (68) to obtain the decrease of the grain-boundary fracture strength by 
segregation of sulphur. 

An alternative approach which gives the same result in the quasi-chemical pair 
bonding approximation has been given by Seah [1980b]. In this approach the actual 
bond energies across the boundary before and after fracture are calculated in the manner 
described in §4.2.2. In this way: 


o . 

y = y + 




-m 



(69) 


where the terms are as described in § 4.2.2. and, in addition Z g corresponds to Z v across 
the grain boundary (approximately |Z V ), is the value of r 4 at one monolayer, a A is the 
atomic diameter of the segregation atom A and is the sublimation enthalpy of A 
atoms in J/m 2 . Calculations may be easily carried out for eq. (69) in the regular solution 
approximation, but only small errors are involved if fi is ignored. In this case it is clear 
that the lower the value of H A b ' for the solute, the more will a segregant embrittle. 
Figure 34 shows a compilation of H sub, values for all elements. It is clear that the well 
recognized segregants, Sb, Sn, S, P, Mn, Si and Cu all with fF ub * values below that of 
iron shown by the dashed horizontal line, will all embrittle iron, whereas Mo and C will 
improve the cohesion; this is confirmed experimentally (Seah [1981,1990]). 

Detailed studies of the fracture properties of Ni 3 Al (Liu et al. [1985]) show that B 
has a strong effect inhibiting intergranular failure. The results of fig. 34 show that, if B 
segregates to grain boundaries, this is exactly what should occur. White et al. [1984] 
show that B does segregate to the boundaries and that they are strengthened. The theory 
of Hirth and Rice [1980] predicts the same result if A G s has a smaller magnitude than 
A G b . This too is shown by White et al. ’s [1984] data where no surface segregation is 
observed. 

The effect of individual segregants on boundary cohesion can also be considered from 
the viewpoint of atomic bonds at an interface. Presence of a foreign element at a grain 
boundary leads to a redistribution of electron charge density. In the case of electro- 
negative sulphur or phosphorus atoms the charge is drawn from the surrounding metal 
atoms to the impurity atom and consequently, the adjacent metal-metal bonds are 
weakened (Briant and Messmer [1982]). The bond parallel to the boundary is stronger 
than that perpendicular to the interface. This results in a reduction of cohesion across the 
boundary and thus, in grain-boundary embrittlement. In the case of the less electro- 
negative elements such as boron and carbon, relatively strong covalent-like bonds exist 
with metal in direction perpendicular to the boundary whereas much weaker bonding 
occurs with metal atoms in parallel direction thus enhancing grain-boundary cohesion 
(Wu etal. [1992]). 

As will be shown in the next section, it is not sufficient to consider solely the effect 
of segregants on the ideal work of fracture, as described above. At a crack tip it is the 
choice between dislocation emission and bond breaking that decides whether ductility or 
brittle fracture occurs. It can be shown that embrittling segregants increase the ease of 
dislocation emission at the crack tip, so offsetting the total embrittling effect discussed 
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above. The correction is small but allows the correct prediction of the absolute embrit- 
tlement of cooper by bismuth (Seah and Hondros [1983]). 



a,n m 


Fig. 34. The general embrittlement-ductility plot for matrix and segregant elements in the ideal solution 
approximation. Any element may be chosen as matrix and then those elements with higher values of fl”*’ 
increase the grain-boundary cohesion and vice versa. As an example iron may be taken as the matrix material 
(after Seah [1980b]). 
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6. Metallurgical phenomena affected by segregation 

Interfacial segregation influences metallurgical phenomena through its impingement 
on several basic physicochemical properties, involving the energetics, kinetics or the 
cohesion of the interfaces, which were treated in the previous section. This is illustrated 
schematically in fig. 35. For example, grain-boundary segregation will in general retard 
the boundary diffusivity and this in turn will affect the rate of diffusion creep or the rate 
of grain-boundary cavity growth involved in creep embrittlement. Again, because it 
retards grain-boundary ionic mobility in polycrystalline oxides, this should also inhibit 
the rate of surface oxide growth. In some instances, the particular metallurgical property 
may be affected through changes in more than one of the basic physicochemical proper- 
ties considered: for example, the rate oxidation of nickel alloys containing aluminium 
may be affected through the kinetics of ion transport across or through the boundaries, 
and in addition the integrity of the oxide layer is enhanced by improved interfacial 
adhesion resulting from the presence of segregated species at the metal/oxide interface. 

As fig. 35 clearly indicates, numerous metallurgical and microstructural phenomena 
in alloys are influenced by interfacial microchemistry: in some cases this may be a 
determining factor while in others it may be simply one of a number of contributory 
factors. In the space of this chapter, we cannot consider all of these phenomena in detail 
and so some key references are cited in the caption of fig. 35. Below we consider for 
each of the important basic physicochemical properties treated in § 5 (i.e., the first three 
columns in fig. 35) a significant example of its impact in physical metallurgy, following 
which we mention briefly a number of other metallurgical phenomena for which 
evidence is accumulating of effects related to microchemistry. 

6.1. Surface free energy change: role in creep cavitation 

In microstructural terms, the creep-rupture life of alloys serving at high temperature 
under stress may be controlled by the nucleation, growth and coalescence of micro- 
cavities which develop at grain boundaries, in general those normal to the stress axis. A 
number of contending micromechanisms have been proposed for the nucleation of these 
cavities, and while there is very strong evidence that in certain engineering alloys these 
are initiated at sites where slip planes intersect second phase particles such as carbides 
or manganese sulphide inclusions, cavities have nevertheless been observed in quite pure 
metals. Many observations in the literature strongly affirm that impurity adsorption at 
both the grain boundary and the surface of the cavity wall will affect creep-rupture 
properties. Depending on the temperature and stress regime, grain-boundary segregation 
may affect the process of cavity growth through its influence on the grain-boundary 
diffusivity, but in addition, because segregation to the inner surface of the cavity reduces 
its surface energy, it will also affect the nucleation propensity. 

Among the many examples, we cite the work of Tipler and McLean [1970] who 
showed metallographically and by creep measurements that small quantities of antimony 
in cooper increase the amount of grain-boundary cavitation and also lower the creep 
rupture life compared with that of the pure material. Kraai and Floreen [1964] showed 
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SURFACE AND GRAIN BOUNDARY SEGREGATION 


BASIC PHYSICO-CHEMICAL PARAMETERS AFFECTED 
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Fig. 35. Some metallurgical phenomena affected by physicochemical parameters influenced by surface- and 
grain-boundary equilibrium segregation. Key references for the interested reader are: (1) Batte et al. [1980]; 
McMahon [1992]; (2) Tipler [1980]; George and Pope [1989]; (3) Mills etal. [1973]; (4) Nicolet [1978]; 
Hayes and Grieveson [1975]; (5) Whittle and Stringer [1980]; (6) Hondros and Henderson [1983]; 
Matsuko etal. [1991]; (7) Briant [1988b]; McMahon [1991]; (8) Nicholas and Old [1979]; Lynch 
[1989]; (9) Latanision and Opperhauser [1974]; Lynch [1988]; (10) Suresh etal. [1981]; Sundarshan 
and Louthan [1987]; (11) Kupper etal. [1981]; Wei and Grabke [1986]; (12) Lea and Hondros [1982]; 
Bruemmer [1989]; (13) Nanni etal. [1976]; German and D'Angelo [1984], 


that the ductility of nickel is reduced by a factor of ten by the presence of sulphur of up 
to 20 ppm in the bulk, while Thomas and Gibbons [1980], and Osgerby and Gibbons 
[1992] demonstrated the remarkable effects of trace impurities on the creep performance 
of nickel-base superalloys. Figure 36 shows the substantial reduction of creep rupture 
ductility in these alloys caused by ppm levels of lead and tellurium. At the maximum 
impurity level shown in this figure, the deleterious effect of each impurity was associated 
in every case with a much higher level of grain-boundary cavitation, of the order of 5-10 
times greater than that observed in the pure alloy. In Nimonic 105 containing 34 ppm of 
lead, the amount of grain-boundary damage as measured by the number of cavities per 
mm 2 was ten times greater than in the alloy with the base composition. Similar effects 
were observed under presence of bismuth. 

In addition to the considerable metallurgical evidence in which impaired creep life is 
associated with the presence of impurities, which presents circumstantial evidence for an 
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Fig. 36. Reduction in creep rupture ductility in nickel-base superalloy induced by trace levels of impurities. 


effect through the interfacial microchemistry, there are important observations using 
Auger Electron Spectroscopy on the creep grain-boundary fracture surface which show 
a considerable accumulation of the suspected contaminant. An elegant example of 
selective segregation through the application of the high resolution (0.2 /tm) Auger 
spectroscopy in shown in fig. 37 (Franzoni et al. [1981]) for a commercial 2;jCrlMo 
steel, creep tested at 873 K and subsequently fractured at liquid nitrogen temperature for 
examination. The figure shows the high magnification scanning electron image of a 
grain-boundary area and the corresponding Auger spectroscopy maps for the elements Sn 
and Sb, indicating a simultaneous segregation of these elements only on the cavity 
surfaces. The non-cavitated areas of the boundary showed only P which was in turn 
significantly lower within the cavity. 

In considering the role of interfacial microchemistry on cavitation, we assume that the 
cavities have been successfully nucleated while the precise mechanism for nucleation will 
not concern us. We consider an applied tensile stress, <r, which performs an amount of 
work ^7 tP(t in generating a spherical cavity of radius r. The associated surface energy of 
the cavity is Airily at it is simple to show that as a cavity grows from a very small size, 
the net energy that has to be supplied first increases and then, after reaching a balance 
point, decreases, at which critical point the radius is given by r c , where 

a = 2 yj r c . (70) 

At this value, the competition between the surface energy which tends to sinter the pore 
and the applied stress is in balance. When the pore radius reaches a value greater than 
r c , energy will be saved as it grows to a microscopical cavity. If the surface segregation 
of the active species is sufficiently rapid, a pore nucleus with a radius smaller than r c 
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Fig. 37. SEM image of a creep cavitated grain-boundary area in 2jCrlMo steel and the corresponding AES 
maps for Sn and Sb showing segregation on cavity surfaces (after Franzoni et al. [1981]). 

may survive according to the above equation, because of the reduction in y s , typically by 
a factor of two. The net effect of this impurity segregation is that a greater density of 
nuclei per unit area of grain boundary will achieve stability and hence they will be able 
to grow: the experimental evidence cited above in connection with the incidence of 
segregation to cavity walls strongly affirms this view. 

The detailed mechanism of surface-energy induced cavitation damage has been 
analyzed by McLean [1981], specifically for the alloy Nimonic 80A. The steep function 
of the curve reproduced in fig. 38 shows the quantity N, the number of grain-boundary 
cavities in a cubic millimetre of material per 1% creep strain, as a function of r, the 
cavity radius. For the pure alloy, substituting for y s at a stress of 150 MPa, the critical 
value of r is 27 nm. At this stress, the cavities which are stable and which will contribute 
to the creep damage are those which fall to the right of r c under the solid line, shown as 
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r , nm 


Fig. 38. Theoretical distribution for the density of cavities, N, in Nimonic 80A as a function of the cavity 
radius, r indicating the areas (hatched) in which cavities will contribute to the creep damage. (Note, there are 
approx. 100 mm 2 of grain boundary per mm 3 of material here (McLean, private commun.)). 


the cross-hatched area ABC in the diagram. McLean integrated the equation representing 
the curve between r c -21 nm and r— » °°, and found that the number of cavities in this 
area is about 1.6 x 10' 3 /m 3 per 1% strain. In the first instance, segregation to the cavity 
wall surface affects this process as we have seen by reducing y s , thereby reducing the 
critical stable pore size to r' c . This is marked in the diagram at 15.6 nm, corresponding 
to a 42% reduction in y s . There is also a second effect to be considered, associated with 
segregation to the grain boundary itself. The consequent reduction in grain-boundary 
energy leads to a reduction in intergranular cohesion (eq. (66)), and, irrespective of the 
precise mechanism of nucleus generation, a lower stress concentration is required to 
nucleate a cavity in a segregated boundary than in a clean boundary (Hondros and 
McLean [1974]) so that more cavities will be generated for a unit strain. Thus the curve 
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Fig. 39. Theoretical deformation map as a function of tensile stress and temperature where Coble creep 
dominates and the effect of low levels of sulphur on the boundaries (broken lines), for a grain size of 32 /xm, 
and strain rate of 1CT 8 s. 


for nucleation of cavities in the presence of grain-boundary segregation lies at a high 
value and is shown as the dashed curve in fig. 38. The two effects of segregation are 
cumulative with respect to cavity stabilization, and all cavities which fall in the hatched 
area A’B’C to the right of r' now contribute to the creep damage. It is difficult to 
ascribe accurately the position of the dashed curve representing the role of grain- 
boundary segregation and the ease of decohesion. For an order of magnitude increase in 
the creep cavity density in the presence of a segregant, the dashed curve corresponds to 
N values about three times those associated with the solid curve. Thus, while at this stage 
the model appears somewhat qualitative, the quantities are plausible and consistent with 
some experimental observations. 

6.2. Grain-boundary diffusivity: role in diffusion creep 

It is now well established that at lower temperature regimes and under light stresses, 
the predominant mode of deformation of polycrystals is dominated by diffusional creep, 
in this case, by mass transport along the grain boundaries. In the course of studying 
sintering mechanisms in alumina, Coble [1963] recognized this deformation mode and 
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accordingly modified the equation for diffusional creep associated with the names of 
Nabarro [1948] and Herring [1950] in which volume diffusivity dominates: this is 
analogous to the Herring equation and is of the form 


B 2 ailSD b 

d 3 kT 


(71) 


Here e is the strain rate, a the tensile stress, fl the atomic volume, d the grain size, and 
B 2 a numerical constant which varies with grain geometry. It is appreciated that both 
lattice- and grain-boundary diffusion contribute independently to diffusional creep, and 
the general equation involving contributions from both volume- and grain-boundary 
diffusion is given by: 


e = 


ail 

d 2 kT 


B l D v + 


B 2 SD b j 


(72) 


Clearly, because the activation energy for boundary diffusion is less than that for lattice 
diffusion and because of a greater sensitivity to grain size, the Coble creep mechanism 
predominates at the lower temperatures and in fine grained material. Ashby [1972] 
showed that a generalized deformation mechanism map may be produced by plotting 
stress as a function of homologous temperature (see ch. 22, § 11). Applying a version of 
the above constitutive equation, the Ashby map indicating the fields of predominance for 
various deformation mechanism is reproduced in fig. 39 and shown as solid lines, for 
nickel. This indicates clearly the important tracts where Coble creep predominates. Such 
maps have been computed on a basis of diffusional data derived from pure metals. If 
impurity species in the bulk alter the lattice diffusivity, or more importantly, if by 
enriching at the grain boundaries they affect the boundary transport, this should be 
reflected in significant shifts in the boundaries on such maps. We examine here to what 
extent the Coble creep mechanism is affected by the presence of interfacially active 
impurities and how this is reflected in the positions of the boundaries on the Ashby map. 

The predictive eq. (65) developed earlier shows the dependence of grain-boundary 
diffusivity on grain-boundary segregation as expressed in the enrichment factor /3 b . We 
insert this into eq. (72) above and limit it to the deformation regime where grain- 
boundary transport is rate-determining. 

From these equations we compare the strain rates e and e' resulting from a change 
of D b to D b and D v to Dj ; hence: 


— = 1 + b v X c 


2 at 


ma c , 




(73) 


where the symbols are as defined in §5.1. This may be simplified further: noting that in 
general b v has a value of 10 2 and considering very dilute concentrations of highly surface- 
active species (X c of the order of ICC), the second term containing b v may be ignored in this 
instance. Hence in a binary system in which the second component is very dilute and also 
very surface-active, the grain-boundary diffusion creep rate at a given temperature is related 
to the diffusional creep rate of the pure solvent in the first approximation by: 
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(74) 


There exist practical systems in which the dilute approximation implicit in the above 
equation should apply. For example, consider the effect of small impurity additions of 
sulphur on the nickel deformation map shown in fig. 39 in connection with the boun- 
daries of the Coble creep mechanism. For the application of eq. (74), it is required to 
know the solid solubility of the impurity in the parent lattice at the temperature of 
interest, and also the relevant atomic sizes. Thus the quantity j 8 b is derived by substi- 
tution in the solubility law of eq. (15) described earlier. In the case of nickel, a number 
of dilute impurities at the ppm level may introduce significant effects on Coble creep 
rates. The computed effect of trace additions of sulphur on the extent of the Coble creep 
regime in nickel is superimposed on the Ashby map in fig. 39 as the dashed lines, here 
specifically for a grain size of 32 pm and a critical strain rate of 10' s /s. This shows that 
by introducing quantities as low as 2 and 10 ppm of sulphur in nickel, the boundaries of 
the Coble creep regimes are shifted substantially. These large effects predicted for 
sulphur, magnesium and other dilute impurities (Hondros and Henderson [1982]) 
demonstrate the important role of interfacial microchemistry on grain-boundary diffusion 
phenomena and how in this particular case, the impurities induce a beneficial effect by 
inhibiting grain-boundary transport. 


63. Interfacial cohesion: role in temper-brittleness 

Temper-brittleness is one of the oldest known forms of brittle intergranular failure. 
It occurs in low alloy steels incorporating alloying elements such as Cr, Ni and Mn when 
they are slowly cooled from temperatures above 773 K. High-purity alloys do not exhibit 
this embrittlement and it has been shown that the cause can generally be associated with 
the adventitious levels of P, Sn, Sb and As impurity, which are typically in the 50-200 
ppm range. It has been recognized that Cu is also deleterious and this is important since 
much present day low alloy steel has copper contents as high as 0.15%. Analyses of 
commercial steels show that temper-brittleness is, in practice, mainly caused by the 
grain-boundary segregation of phosphorus whereas in nickel-containing steels, tin also 
contributes. The phosphorus comes from the original steel making, and is expensive to 
remove to very low levels. The tin arises from recycled scrap plated steel. The role of 
individual elements in temper embrittlement is summarized by Briant [1988b] and 
McMahon [1991], 

It is instructive to consider the analysis of one very well documented instance of 
temper brittleness investigated at the Watertown Arsenal (Carr and etal [1953]). As we 
have mentioned earlier, as segregation increases in a steel, the ductile-brittle transition 
temperature (DBTT) also increases until it rises above the ambient temperature and 
brittle failure will occur in service. Studies of the DBTT through Charpy impact measure- 
ments therefore provide a characterization of the heat treatments that will cause temper 
brittleness. Figure 40 shows the DBTT measurements for SAE 3140 steel (1.26% Ni, 
0.77% Cr, 0.39% C, 0.015% P) heated to 948 K and water-quenched, followed by iso- 
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Fig. 40. The lime-temperature diagram for embrittlement of SAE 3410 steel corrected to constant hardness 
(after Carr etal. [1953]). 


thermal heat treatments at increasing times in the critical temperature range 648-848 K. 

In fig. 40 we observe several of the characteristic features of temper-brittleness. The 
embrittlement occurs most rapidly around 773-823 K, and reheating above 873 K rapidly 
removes the embrittlement. Material quenched through the temperature range 673-873 K 
will not be embrittled, whereas large-section castings or forgings necessarily cooled 



Fig. 41. The time-temperature diagram for the segregation of P in the SAE 3410 steel of Carr etal. [1953]. 
The numbers against the curves denote the P segregation occurring as a result of the embrittling treatment, i.e., 
in excess of the 0.06 monolayers segregated during 948 K (675°C) temper, (after Seah [1977]). 
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slowly through this range will be severely embrittled. The embritdement is often called 
reversible temper-brittleness since samples embrittled trough slow cooling can be de- 
embrittled by reheating above 873 K followed by a quench, and then again embrittled by 
a slow cooling treatment. 

The reconstruction of fig. 40 has been made by a calculation of the segregation of 
phosphorus in the steel using Guttmann’s modified theory as described at the end of 
§4.4. Seah [1977] used this approach with A =-11.5, A G° =-44.0 and a^ j7 =-26.0 
kJ/mol to predict a low phosphorus equilibrium segregation above 800 K but rapidly 
rising to near the saturation of one monolayer at 748 K. Combined with this prediction 
are the kinetics of segregation discussed in §4.5 with D P = 1.58 x 10^xexp(-219000//?7) 
m 2 /s. These permit the generation of the segregation contours of fig. 41 which correlate 
very closely with the DBTT curves. This correlation shows the dramatic DBTT shift of 
275 K per monolayer of segregated phosphorus. 

An alternative to the above interpretation of temper-brittleness via the alloy-impurity 
coupling model of Guttmann, has been proposed. Studies of phosphorus segregation in 
pure iron, and alloy with up to 0.1% carbon and 2.2% chromium by Erhart and 
Grabke [1981] show that the phosphorus segregation in pure iron is as high as custom- 
arily observed in steels but that carbon, if present, will also segregate, reducing the 
phosphorus segregation through site competition, and improving the cohesion as 
described in § 5.3. The action of added chromium is shown to precipitate the carbon, so 
that it no longer segregates and hence the phosphorus segregation returns to the initial 
high level. Thus, in steels, the presence of chromium appears to enhance the phosphorus 
segregation, but in reality it is the carbon which the chromium influences and this, in 
turn, affects the phosphorus segregation. This view is supported by Briant [1981] who 
also finds that the alloying elements have no effect on phosphorus segregation in 0.3% 
carbon steels. If this model is correct, the lower part of fig. 41, below 723 K, should 
remain largely unaltered but the top part above 773 K, should arise as a result of carbon 
competitive segregation deriving from, say, dissolution of carbides at temperatures above 
773 K. This is clearly not the situation in figs. 40 and 41 since the Cr can only precip- 
itate a small fraction of the C in the steel. We conclude that, for the SAE 3 140 steel, we 
must be concerned with Guttmann’s alloy-impurity coupling. However, it is possible that 
in higher-Cr steels such as 2{CrlMo, with lower carbon contents, the Erhart and Grabke 
mechanism may operate. In other steels both mechanisms may be important. 

Finally, such steels as the Mo-containing 2]CrlMo and 3Cr^Mo alloys incur 
additional effects in that Mo and P react readily to form (FeMo) 3 P precipitates. Thus, in 
untempered steels quenched from above 1123 K, there is little P segregation and most of 
the Mo is in solution. As a function of time in the tempering range 723-923 K, the Mo 
that is free in solution couples with the P and according to the Guttmann theory this 
leads to high equilibrium segregation. However, the coupling is strong, causing the 
formation of Mo 3 P microprecipitates so that free P level is low, thereby considerably 
retarding the rate of build-up to high segregation at grain boundaries. This is clear from 
eqs. (49) and (50) where the initial rate of segregation is proportional to the free solute 
content. Mo retards the diffusivity of P in pure iron by an order of magnitude at these 
temperatures. Thus Mo initially retards the segregation of P. With time at temperature, 
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Fig. 42. The effect of increasing bulk tin content on the rate of coarsening of nitride precipitates in iron 
at 643 K. 


the Mo nucleates Mo 2 C carbides which precipitate and release the locked-up P. Thus, as 
time evolves the P segregation may eventually return to the level expected in Mo-free 
steels, as observed in practice (McMahon et al. [1977]). The initial retarding effect of 
Mo improves as the Mo content of the steel increases but reaches an optimum in the 
range 0.4-0.7%, after which the embrittlement deteriorates again (Dumoulin et al. 
[1980], Yu and McMahon [1980]). Part of the initial improvement is, as discussed 
above, due to the reduced P segregation. An additional effect arises through the 
improvement in cohesion caused by Mo segregation predicted in fig. 34 and observed by 
Dumoulin et al. [1980], However, as the Mo content increases, the propensity to Mo 2 C 
precipitation increases and above 0.7% content the remedial effect disappears. Later 
experiments of Menyhard and McMahon [1989] suggest that the entire molybdenum 
effect could be that of cosegregation with carbon which offsets the embrittling effect of 
phosphorus, and the role of molybdenum as acting as traps for phosphorus seems to be 
only additional. 


References: p. 1284. 



1274 


E.D. Hondros et al. 


Ch. 13, §6 


6.4. Further examples of metallurgical phenomena influenced by microchemical 
processes 

The case studies of metallurgical properties affected by microchemical processes 
presented above are being increasingly supported by a growing body of theoretical and 
experimental studies. Some of these are now fairly mature examples. As fig. 35 (above) 
indicates, there exist many other examples of metallurgical phenomena at various stages 
of understanding and different levels of significance, which are being progressively 
scrutinized in terms of the contribution of the interfacial microchemistry. Below we 
consider, in brief, a selection of case studies which are being currently expanded by work 
in several laboratories and which represent, to a certain extent, fresh directions in the 
understanding of traditional metallurgical problems. 

6.4.1. Microchemical barrier layers 

There is a category of metallurgical phenomena in which interfacial microchemistry 
induces its effect through an impedance layer across the interface. The phenomena refer 
mostly to ripening or microstructural shape modification processes (ch. 9, §4.2.2.; ch. 
15, § 2.4.2), which depend on the ease with which atoms can transfer across the interface 
between solid phases, but they may also operate at a free surface (as in nitridation or 
carburization from gas phase equilibria), involving atom transfer across the free surface 
from the vapour to the interior of the solid. 

Consider second phase particles or precipitates in a solid matrix at service or pro- 
cessing temperatures. There exists an inherent tendency for such particles or precipitates 
to coarsen in order to reduce the total interfacial free energy associated with them. The 
rate-determining step in normal ripening processes is the volume diffusivity, however in 
the presence of segregants at this type of interface, the atom transfer step may become 
rate-determining. This may lead to a significant retardation in the kinetics of ripening. 

There exists much indirect evidence for the segregation of impurities to these 
interfaces. Boyd and Nicholson [1971] showed that the addition of 0.1% Cd to an 
Al-4%Cu alloy produced a remarkable reduction in the ageing kinetics of O' precipitates 
at 473 K as indicated by the rate of growth of the mean particle size with time. In 
addition, they were able to measure by isothermal calorimetry a considerable reduction 
in interfacial free energy, from 1530 mJ/m 2 to 250 mJ/m 2 , produced by the cadmium 
additive. This is clear evidence for the segregation of the cadmium to the interface. This 
subject has been considered in most detail for the ripening of y'-Fe 4 N precipitates in an 
Fe-N alloy. Low bulk additions of antimony, oxygen, phosphorus and tin produce a 
remarkable inhibition in the rate of coarsening of the nitride. This is illustrated in the 
work of Miller [1979] in fig. 42 in which the increasing bulk tin content is seen to 
reduce the mean nitride particle coarsening rate by an order of magnitude compared with 
the behaviour in pure iron. 

In considering the theory of the segregation-induced retardation of ripening, the 
starting point is the Lifshitz-Wagner equation (Lifshitz and Slyozov [1961]; Wagner 
[ 1961]) for volume-diffusion controlled growth of a dispersion of particles of initial mean 
radius r 0 : 
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= B,y,V 2 C 0 D v 
RT 




(75) 


where r is the mean particle radius after a time y, the interfacial free energy, V 

the molar volume of precipitates, C 0 the molar concentration of solute in equilibrium 
with a particle of infinite size and 5, is a numerical parameter which can vary according 
to the shape of the precipitates. Segregation to such interfaces will reduce y t typically by 
about 50%. However, much larger inhibitions in coarsening rates have been observed, 
and a reduction in interfacial energy cannot in itself account for the observed behaviour. 
It is believed that the most likely mechanism is that the segregate inhibits the rate of 
transfer of solute atoms across the interface. In this situation the relevant ripening 
equation becomes: 


2 _ 2 _ ^y,v 2 c 0 6 


r - r n = 


RT 




(76) 


where Q is a reaction rate parameter associated with the ease of transferring a species 
across the interface. Thus, according to this theory, the mean particle size varies as t l/i 
for volume-diffusion control and with t w 2 for interface-reaction control. In studies of this 
nature it has not been easy to make a clear distinction in the mechanisms on the basis of 
this time-dependent growth behaviour because these functions both lie within the 
experimental scatter. Thus, although these studies do not give a clear confirmation on 
this mechanism, neither do they contradict it. Miller [1979] has re-examined the theory 
and has shown that it predicts a time-invariant size distribution profile depending on the 
above growth mechanisms and that the data shown in fig. 42 can be fitted more closely 
to the particle-size profile predicted for interface control. 

The surface-active elements which inhibit the ripening of precipitates are now shown 
to be able to form barrier layers at iron surface, thereby strongly retarding nitridation and 
carburization reactions. The inhibition can be quite significant as demonstrated by 
Grabke et al. [1977] with sulphur impurity adsorption up to a level of a monolayer. In 
this type of atom transfer process, the alloying species is initially present in a molecular 
form, as C0 2 , CH 4 or NH 3 , in the vapour phase, and for its entry into the metal the basic 
mechanism is the stepwise catalytic dissociation of the surface-adsorbed molecule, allow- 
ing the C or N ions to penetrate into the interior. The strongly bound impurities at the 
surface impair the dissociation step, as in a poisoning reaction. In the reverse process, the 
surface denudation of nitrogen or carbon in iron alloys has also been shown to depend 
strongly on the existence of chemical barrier layers which inhibit the formation of the 
molecular species at the free surface. The precise mechanisms appear to be strongly re- 
lated to the catalytic poisoning of surfaces by impurities, which operate in these instances 
in a manner that effectively retards the rate of ingress or expulsion of the elements. 


6.4.2. Creep-embrittlement 

It was noted earlier that as a consequence of surface segregation, the rate of grain- 
boundary creep cavity generation is increased, leading to a deterioration in creep rupture 
life. This pertains to typical weldable creep-resistant steels of the Cr-Mo-V series where 
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the impurities present in the bulk exercise their role in the heat-affected zone of the weld, 
following a stress-relief heat treatment at temperatures of the order of 973 K. Thus in a 
stress-relief operation in the average commercial UK steel of the type ^Cr^MojV 
(impurities in ppm: P, 170; Sn, 120; Sb, 30; As, 160; Cu, 1200; Ni, 1200), the most 
active embrittling impurities are Sn, Cu, As, Sb and P in that order with relative fragility 
potencies given by the ratios 3:2.5:2:1:1, respectively (Lea etal. [1980]). 

We now consider these alloys in service conditions, where stress and temperatures are 
lower than in the above, and under longer-term creep durations. Analysis of existing data 
by Seah [1980c] reveals a different effect. Here it appears that the rate-determining 
mechanism is the transport of matter or vacancies along the grain boundaries in order to 
feed the growing cavities. For a given strain associated with cavity growth, a high 
density of small cavities is much more harmful than a lower density of large cavities. 
The formed condition may be promoted either by increasing the cavity nucleation rate 
(which we have seen is induced by surface segregation to the cavity walls) but it may 
also be promoted by retarding the growth rate. These effects are noted in the analysis of 
creep cavitation by Skelton [1975], and are described by the equation 


e. = 


0.65 


2i TD b So{l 
NkT 




(77) 


where e r is the strain to rupture, d is the average grain diameter, D b is the grain-boundary 
diffusivity and N the continuous rate of creation of cavities; the other symbols have been 
defined earlier. 

A regression analysis on the effects of various impurities on the rupture ductilities 
during long-term creep at 823 K on ^CrlMojV commercial steels has shown a decrease 
in the rupture ductility with increasing impurity content as indicated in fig. 43 (Seah 
[1980c]). The dependence of rupture ductility upon impurities is given by: 

-A(rupture ductility) 1.5P + 0.36Cu + 1.6Sn + 1.2S, (78) 


where the element symbols refer to solute concentrations in the bulk. For the above 
typical commercial-purity steel, the relative damaging potencies of the impurities are 
different from that of the stress-relief case of embrittlement mentioned above: the order 
is now Cu, P, Sn and S with relative fragilities of 2.5: 1.5: 1:1, respectively. 

Thus, in both forms of creep embrittlement pertaining to the generation and growth 
of grain-boundary cavities, such as those illustrated by stress-relief cracking or long-term 
creep-embrittlement, the impurity sensitivity arises from different modes of interfacial 
segregation. This difference appears to be associated with the very different strain rates 
are experienced. 


6.4.3. Intergranular stress-corrosion cracking 

The notions of interfacial microchemistry and the effects on interfacial cohesion are 
being applied currently to explain various forms of intergranular cracking in the presence 
of both stress and a corrosive environment. Thus, pure iron is resistant to stress-corrosion 
cracking in nitrate environments, however in commercial carbon steels, failure is 
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Fig. 43. Regression analysis of creep ductility in JCrj-MojV steels as a function of a combined impurity 
parameter involving P, S, Cu and Sn (after Seah [1980c]). 


invariably intergranular. Many attempts have been made to trace the role of alloying 
elements, for example carbon, aluminium, and chromium, but there is seemingly little 
systematic behaviour. The importance of the impurity contents of these commercial steels 
has been recognized. Lea and Hondros [1982] prepared samples from high-purity mild 
steel stock, each sample containing a single impurity at a dilute level comparable with 
that found in commercial material. Here the free residual sulphur, which is very active, 
was first rendered harmless by the addition of manganese. Each impurity element was 
added separately at a bulk level which was predicted to give rise to a grain-boundary 
segregation content of about 20% of a monatomic layer following an isothermal heat 
treatment. The samples were tested to failure in a controlled environmental cell con- 
taining ammonium nitrate, at a constant strain rate. In each case, a control sample was 
tested under exactly the same conditions except for a neutral solution. Here it was seen 
that every specimen exhibited stress corrosion in the nitrate environment, failing in a 
shorter time than that in the control environment. The copious data shown diagram- 
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matically in fig. 44 refer to the time for intergranular failure for alloys stress-corroded at 
a constant strain rate, noting that the data on the right-hand side of the diagram refer to 
the tests in the controlled solution, here paraffin. In the nitrate solution the fracture was 
almost always entirely intergranular, while in the control solution ductile fracture 
predominated. The figure shows that in the aggressive environment, with all other 
conditions identical, the reduction in life depends on the impurity present. Taking the 
time to failure under constant strain-rate testing, normalized to the value in a benign 
environment as the measure of the reduction in the resistance of the steel to cracking, the 
data can be presented in the form 

-A R = 20P + 1 ,9Cu + 1 .OSn + 0.9Sb + 0.9(others). (79) 

Here the element symbols represent the bulk content in wt%. The changes in R thus 
provide a ranking order of the potent segregating impurities which affect the stress- 
corrosion propensity. We note that in commercial steels S, Ca and Al are mainly present 
as precipitates. To optimize the performance of carbon steel in nitrate environments, the 
clearest metallurgical guideline from this is a reduction in bulk impurity levels, par- 
ticularly that of residual phosphorus. 

Although the detailed micromechanism for this effect awaits elaboration, the principal 
observations can be described fairly adequately by a dissolution model in which the grain- 
boundary material is preferentially removed, the driving force for this being the diver- 
gence of the segregant’s electrochemical potential from that of iron. In an oxidizing 



Fig. 44. The times to failure of 1 1 specimens of carbon steel, each containing a different impurity element, 
stress-corrosion tested in ammonium nitrate solution at a constant strain rate. The control tests in a neutral 
environment are shown to the right (after Lea and Hondros [1982]). 



Ch. 13, §6 


Interfacial and surface microchemistry 


1279 


environment such as aqueous ammonium nitrate, the driving force for atomic dissolution 
is related to the potential difference between the matrix and the segregants, which form 
a galvanic cell. Thus for segregants which have a more positive potential than iron, these 
atoms themselves are dissolved more rapidly and a very sharp grain-boundary crack is 
exposed for further dissolution. On the other hand, for segregants having a more negative 
potential than iron, atoms adjacent to the boundary are dissolved, thus enhancing the 
intergranular corrosion but with a different crack tip profile. Grain-boundary segregation 
during stress-corrosion cracking furthermore promotes dissolution by reducing the rate of 
repassivation after slip has ruptured protective oxide films and crack tips (Lynch [1989]). 

6.4.4. Intergranular hydrogen-embrittlement 

An aspect of the ubiquitous problem of the hydrogen-embrittlement of steels is of 
interest here. This is the intergranular mode of failure, which has been considered 
recently in relation to the microchemical nature of the grain boundaries. Thus Latani- 
sion and Opperhauser [1974] proposed that hydrogen penetration along the grain- 
boundary regions should be promoted by hydrogen-recombination poisons which are 
present as grain-boundary segregants. These elements, such as P, S, As and Sb, which 
have a strong tendency to segregate to grain boundaries in steel also have a stronger 
intrinsic binding to hydrogen than does iron (Lynch [1989]). A probable effect of this 
is that following the dissociation of a hydrogen molecule which arrives from the vapour 
or from solution, the stronger bonding available at the solute atom sites causes the 
hydrogen to reside at these sites for much longer times than at normal iron sites, and 
therefore there is a greater chance for the hydrogen to be absorbed into the lattice. Hence 
the role of these solute species is effectively to increase the amount of hydrogen 
occluded along the grain-boundary zones. If the mechanism of hydrogen-embrittlement 
requires the concentration of hydrogen in the metal just ahead of the crack tip, the above 
grain-boundary segregants should therefore enhance this embrittlement. 

6.4.5. Inhibition of surface oxidation on alloys 

In certain alloys, especially those containing aluminium, surface oxidation can be 
inhibited by the introduction of dopants which both reduce the rate of grown of the oxide 
and also inhibit “breakaway” oxidation by improving the scale adhesion at the metal/ 
metal-oxide interface. This complex subject has been reviewed in detail, for example by 
Whittle and Stringer [1980] who assessed various mechanisms for the improvement 
in performance. Several microchemical phenomena have been proposed to account for 
this. One proposal, which requires experimental verification but which is consistent with 
the analysis given in § 5.2, is that the dopant segregates to the oxide grain boundaries, 
thereby reducing the boundary diffusivity. In their study of oxidation behaviour of 
■y-y'-Cr 3 C 2 eutectic composite containing additions of yttrium at a low level, Bullock 
et al. [1974] observed a remarkable improvement (i.e., reduction) in oxidation weight 
gain at 1100°C and in addition showed by AES that the yttrium level increased with 
distance in the oxide bulk and reached a peak level at a depth which suggested that the 
interface between the oxide and the alloy was enriched in yttrium. In addition to the 
probable improvements in growth kinetics by segregation to the boundaries, it was 
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considered that the formation of a microchemical barrier layer at the metal/metal-oxide 
interface improved the adhesion of the alumina in the mixed metal-oxide scale to the 
metal substrate and hence increased the resistance to oxide scale sparling. 

A third possibility for inhibited high-temperature corrosion in through the formation 
of a microstructural barrier layer at the metal-oxide/vapour interface. The presence of 
boron in the vapour phase is known to improve greatly the oxidation resistance of 
chromium-containing steels by increasing the time to “breakaway” oxidation. Lea [1979] 
showed by AES that the boron was present essentially only at the outer oxide surface 
layer and it was also accompanied by a high chromium enrichment. The mechanism for 
this type of inhibition thus has similarities with the processes considered above (§ 6.4.1). 
In particular, it was considered that the introduction of O-B-O covalent bonds in the 
outer oxide surface structure would impede the transfer of O ions onto the lattice, thus 
forming a film which in turn allows slowly diffusing protective chromium from the alloy 
to reach the free surface (ch. 14, § 3). 

7 . Interfacial microchemistry and materials design theory 

All the important themes of physical and chemical metallurgy contribute collectively 
to the theory of materials design — including the thermochemistry of phase equilibria, 
crystal defect theory, fracture theory, oxidation and corrosion resistance and so on. This 
chapter has placed emphasis on the importance of the relatively new subject of interfacial 
microchemistry for a wide diversity of metallurgical properties, and the insight and 
information now made available should be viewed in terms of its contribution to 
materials design. Here we mention two basic approaches to the materials design theory, 
the alloy design and the grain-boundary design. In the former one we distinguish between 
deliberate additions of specific elements which procure beneficial effects on certain 
properties, and the planned control of known elements which are detrimental. In the latter 
approach, effort is focused on tailoring the structure of a polycrystalline material with 
improved properties using a specific arrangement of appropriate grain boundaries. The 
advanced knowledge in these fields provides many opportunities to manipulate material 
properties, often in the context of “fine tuning” but equally often to gain significant 
improvements in service life. In the past two decades, several lines of advance have been 
made in this subject matter which are valuable to alloy design: (i) the identification of 
potent impurity species (both harmful and beneficial); (ii) the basic mechanisms 
(sometimes only partly understood) by which they operate; (iii) the enrichment levels as 
well as bulk contents at which the effects are significant. As for grain-boundary design 
approach, progress has been made in: (i) identification of the types of grain boundaries 
characterized by different properties; (ii) processes for controlling the grain-boundary 
character distribution in polycrystalline materials. 

This chapter provides several guidelines. Thus, on the basis of a retardation of grain- 
boundary transport, or on improved interfacial cohesion, certain types of elements can be 
beneficial — as in the improvement of oxidation characteristics or in the rate of Coble 
creep. Certain elements can be predicted to offer an improvement in sintering behavior 



Ch. 13, §7 


Interfacial and surface microchemistry 


1281 


through a segregant stabilization of fine grain structure. In an example of benefit to 
mechanical properties, White et al. [1981] demonstrated that the room-temperature 
strength and ductility of a Pt/Rh/W alloy can be improved by the addition of 75 wt ppm 
of B which was shown to enrich, in a typical manner, the grain boundaries. 

Where the impurities produce deleterious effects, clearly the alloy design aspects 
involve the elimination of the impurities or, more practicably, their control to within 
tolerance levels. The identification of potent surface-active species and the assigning of 
bulk tolerance levels have been the specific objectives of many studies in this field. The 
information is couched often in formulae of the type presented above for intergranular 
stress-corrosion cracking (eq. (79)) or creep rupture ductility (eq. (78)). This permits the 
design of alloys of closely controlled compositions for improved performance in the 
defined conditions of use (Hondros [1986]). 

In a previous discussion (Hondros and Seah [1977b]), we presented a number of 
procedures which might offer a control over the effects of harmful impurities — 
including the judicious use of the principles of competitive segregation, the selection of 
remedial segregants, the thinning out of the embrittling species by spreading them out 
over a large interfacial area, i.e., a very fine grain size, reducing the kinetics of the 
embrittling species, and the used chemical fixation or precipitation reactions. In the 
meantime, at least one of the above approaches has been explored in depth, in connection 
with affording a simple and economical control over temper-embrittlement. Seah et al. 
[1979] demonstrated that temper-embrittlement induced by P and Sn in 2jCrlMo and 
3yNi-Cr-Mo-V steels can be reduced by additions of La or Ce at a predicted level. The 
selection of La, Ce and mischmetal (approx. 50% Ce, 25% La, 25% other rare-earth 
metals) are based on a comprehensive thermochemical assessment of the possibility of 
forming stable microprecipitates in order to reduce the bulk levels of active P and Sn. 
The optimum addition of La required to remove the embrittling effects of P and Sn is 
given by the formula 

La = 8.7S + 2.3Sn + 4.5P (80) 

where the element symbols represent the wt% of the element in the steel. Figure 45, 
taken from this work, shows the high DBTT for a pure 2{CrlMo steel with 0.017% P 
added. The further addition of various levels of La lower the transition temperature 
significantly, to the limit of a high purity material. 

In the future, the increasing knowledge of structure/property relationships for grain 
boundaries (including anisotropy of grain-boundary segregation and thus, for instance, 
anisotropy of brittle intergranular fracture) may be practically used to control materials 
properties. The differences in grain-boundary character and geometric configuration 
enhance the heterogeneity of all types of deformation and fracture in polycrystals. This 
effect is schematically shown in fig. 46. The presence of low-energy grain boundaries 
(2-boundaries in the figure) in a polycrystal will not contribute much to the deformation 
and fracture while the presence of general grain boundaries (R-boundaries) will exert a 
significant contribution. Therefore, a heterogeneity of the fracture process occurs which 
can either lead to an easy fracture (figs. A, B, D or F) or to a difficult fracture (figs. C 
and E) under the same stress. On the basis of these considerations, Watanabe [1984] 
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Fig. 45. Improvement in the ductile-brittle transition temperature in a steel susceptible to temper embrittlement, 
resulting from low level additions of lanthanum which combat the effect of phosphorus (after Seah et al. 
[1979]). 


pointed out the importance of the effect of the type and frequency of grain boundaries to 
the behaviour of a polycrystal and proposed the concept of grain-boundary design. The 
basic idea of grain-boundary design is to produce polycrystalline materials with grain 
boundaries of such character and distribution as will suppress a “detrimental effect” of 
the grain boundary and conversely, increase its “beneficial effect” as much as possible. 
In this way, the properties of a polycrystal could be improved to a significant extent. 

To fabricate a polycrystalline material by means of grain-boundary design, two 
fundamental steps have to be performed. The first step consists in prediction of the 
performance and thus, in the proposal of character and distribution of grain boundaries 
for the given purpose. The other one represents the processing method to achieve the 
predicted grain-boundary character distribution and configuration in order to obtain the 
proposed structural material. Practically, of course, the latter step seems to be the 
limiting one because no method has commonly been developed to produce such 
structures till now. Thermomechanical processing based on working and annealing has 
widely been applied to control microstructure of polycrystals such as grain size, second 
phase dispersion, and texture. Unfortunately, these processes are not developed to such 
an extent at present that they completely control the character and distribution of grain 
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Fig. 46. Intergranular fracture processes in polycrystals with different grain-boundary character distributions and 
boundary configurations (after WaTANabe [1993]). 


boundaries. From this point of view, magnetic annealing of ferromagnetic materials 
seems to be promising to introduce specific types of grain boundaries with a character- 
istic frequency in a given material. Of course, novel procedures have to be developed in 
future (Watanabe [1993]). 

There is an aspect of the general subject matter of this chapter which has wider 
ramifications in the context of metals recycling, and which implies that it will be a 
subject of growing importance in the future. It is clear that because of geo-political, 
economic and social factors there will be in the future a greater emphasis on the 
recycling of used materials in industrialized countries. This will reduce their vulnerability 
to external sources of raw materials and also, there are inherent energy savings in 
reclaiming metals from used articles rather than obtaining them from raw supplies of ore. 
Because of the complex mechanical systems used by industry and society and the wide 
combination of metals involved, inevitably metals derived from recovered scrap will 
contain certain unwanted residuals which bypass the refinement process (Hondros 
[1980b]). This chapter has underlined that such small quantities of noxious elements can 
provoke remarkable effects on physical and mechanical properties, and more specifically 
the accumulation of residual elements, such as copper and tin in steel from scrap, will 
have effects on the performance. Thus, an increased effort devoted to the recycling of 
metals will, in general, exacerbate the effects of unwanted tramp elements on the 
properties of alloy aimed for particular end uses. On the basis of a better metallurgical 
understanding, there will be effort devoted to a control tramp element effects, involving 
remedial measures, of the type discussed above. 

However, we envisage the need for a broader perspective, leading to a new metallur- 
gical or engineering subject matter encompassing the theory of design involving recycling. 
In this global approach, we consider the final end use of the material as the primary 
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parameter, and recyclability should be built in to the design of the final material. Thus we 
may plan for the evolution of secondary alloy products which may not be adequate for the 
original property requirements, but which may be suitable for other applications. An 
interesting example of this is a carbon steel with high levels of tin and cooper arising 
from the use of scrap containing much tin plate — this has been demonstrated to have an 
improved corrosion resistance associated with the presence of these elements in sub-scale 
layers. Such an alloy may not have adequate mechanical properties for the original 
purposes, however it can be used in situations where the improved corrosion resistance is 
advantageous. Thus, in this theory of design with recycling we can envisage recovery 
epicycles in which certain grades of alloy might be blended together to produce new 
alloys which are more suitable for specific applications. The science of interfacial 
microchemistry will make basic contributions in the future in connection with material 
design in the context of recycling, improved reliability and advances in performance. 
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1. Introduction 

1.1. Definitions 

Oxidation and hot corrosion of metallic materials are oxidation reactions (in the 
chemical sense) that result in the conversion of metals into compounds such as oxides, 
sulphides or carbides when exposed to high temperatures above about 400°C. Oxidation 
processes as the name implies refer to formation of oxide layers and occur primarily 
during exposure to air or steam. Hot corrosion is the more general term where products 
other than oxides can form during exposure to air containing contaminants such as 
sulphurous gases from the combustion of fossil fuel or specific process conditions in 
petrochemical plants where high levels of carbon activity are common. In some cases 
exposure to molten salts occurs either directly or by condensation reactions, and in that 
case the term hot-salt corrosion is used. 

1.2. General 

Protection of material exposed to aggressive environments at high temperatures 
depends on the properties of the corrosion product which should act as a diffusion barrier 
to prevent further attack by the atmosphere. Thus the layer should be chemically stable 
in the environment, prevent diffusion of both the reacting gases and the metal species 
and finally, of course, in order to maintain its effectiveness as a diffusion barrier, the 
layer should be mechanically stable, that is it should not crack or spall. The importance 
of each one of these properties will become clear as various forms of attack are 
discussed in this chapter. 

Oxidation and hot corrosion are important forms of materials degradation that often 
limit process conditions, which can have serious implications for plant reliability and 
efficiency. For example, power station boilers, petrochemical plant, gas and steam 
turbines, waste incinerators, diesel and internal combustion engines and nuclear reactors 
are all prone to degradation by oxidation and hot corrosion. The current drive toward 
energy saving by improved efficiency while at the same time minimising emission of 
pollutants places a heavy demand on the materials scientist, since in most cases these 
requirements mean higher operating temperatures and exposure to more aggressive 
environments. Clearly the high temperature behaviour of materials, therefore, can be a 
major factor inhibiting progress, and it is for this reason that the topic has such impor- 
tance for our society. 

Clean coal technology is a good example of the sort of developments in process 
technology that have important implications for the construction materials of the plant. 
Alternative processes to conventional combustion have been developed which involve 
fluidised bed combustion with additions of limestone as a sulphur capture agent, or 
gasification processes where materials are exposed to reducing environments which are 
relatively rich in sulphurous gases. In the case of fluidised bed combustion, heat 
exchange tubes must be resistant to the erosive action of particulates in the fluidised bed 
(sand, coal ash and limestone) and, since the sulphur is retained within the bed, also 
resistant to attack by the calcium sulphate formed by reaction of the limestone and the 
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sulphurous gases released during the combustion of coal. For coal gasification, on the 
other hand, process conditions are reducing to promote formation of CH 4 , H 2 and CO 
from the coal/steam mixture, so that the metallic components only form a protective 
oxide layer if highly alloyed materials are used, and, furthermore, there is a heightened 
risk of sulphidation because of the relatively high sulphur potential. Again the process 
designer wants high operating temperatures to achieve high overall efficiency. 

1.3. Outline 

The thermochemistry of oxidation is considered and illustrated by means of Elling- 
ham diagrams. Kinetics of oxidation of pure metals is developed and discussed in 
relation to the mechanisms of oxide formation. Alloy oxidation allows for the possibility 
of selective oxidation, and the criteria controlling the rate controlling processes are 
described. The importance of the strain tolerance of the protective oxide is then 
considered in relation to the mechanisms that lead to stress in the layer and the possibil- 
ity of stress relief. Here the effects of thermal cycling and the mismatch in thermal 
expansion coefficient of the oxide layer and the metallic substrate are dominant. 

Materials are not always exposed in simple oxidising environments, so that there is 
the possibility of competing reactions where oxide, sulphide or carbide may form. The 
conditions controlling these reactions are described. Again, simple gaseous attack can be 
further complicated by the presence of a deposit which can be molten or solid; here the 
chemical stability of the first formed layer may be inadequate, so that, for example, the 
oxide can dissolve when in contact with a molten layer. 

Test and measurement methods are an important aspect of any technology, so that a 
brief introduction is given to this topic before considering life prediction and modelling. 

Finally, protection is considered where alloy design is inadequate, or where the prop- 
erties of the bulk (creep strength, for example) are incompatible with the corrosion resis- 
tance. Here a range of options is available, and the different processes for the application 
of protective coatings are described. Coating design is considered: mechanical and chem- 
ical properties must be matched with those of the substrate. Design philosophies have 
evolved that exploit graded coatings to optimise both surface and interface properties. 

For a more detailed account of oxidation and hot corrosion the reader should consult 
reference books on the topic: Kofstad [1988a]; Birks and Meier [1983]; Kubaschew- 
ski and Hopkins [1966]; Hauffe [1967], Rapp [1983] and Evans [I960], 

2 . Fundamentals of oxidation 

2.1. Thermodynamics 

Classical thermodynamics can be used to determine the temperature and gas pressure 
required to cause the following reaction to take place: 

M + ■£• 0 2 -» MO (1) 

The Gibbs free energy for this reaction is: 


References: p. 1357. 
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AG = AG° + RT\n (2) 

a M -a^ 

where: AG° =-AG° M - + AG° m 

a - activity 

A G° = standard free energy 

If A G° > 0, spontaneous forward reaction will occur. At equilibrium, AG = 0 and equation 
(2) reduces to the Van ’t Hoff isotherm: 

AG° = -RT In ° M0 . n (3) 

Since the standard free energy of elements is zero, equation (2) becomes: 

AG° mo = —RT In (4) 

Equations (3) and (4) describe the conditions at equilibrium where both the metal and 
oxide are stable, or, in other words, where the formation of oxide on a metal just begins. 
The equation can be further simplified since the activity of a material in the pure state 
is unity and under conditions of low pressure and high temperature, the activity of 
oxygen can be approximated to its partial pressure, thus: 

RT 

AG° mo = In Pc>1 (5) 

Therefore, knowing the Gibbs standard free energy of oxide formation at a given 
temperature, we can readily calculate the oxygen partial pressure required for oxidation 
to occur. Richardson and Jeffes have compiled the standard free energies of formation of 
oxides as a function of temperature, and fig. 1 shows these data. These diagrams are 
commonly known as Ellingham diagrams and allow the oxygen pressure for oxide 
formation at any required temperature to be determined directly. This is done by aligning 
the point O at the upper left of the diagram with the reaction and temperature of interest, 
and reading off the pressure on the scale on the right of the diagram. For example, for 
A1 2 0 3 at 1000°C, align the point O with the line representing the reaction 4/3A1 + 0 2 = 
2/3Al 2 0 3 to intersect it for 1000°C and read off the p(0 2 ) of 10“ 37 atm on the p(0 2 ) scale 
at the bottom of the diagram. Also included on this diagram are scales which give similar 
information about oxidation in two commonly encountered gas mixtures, C0/C0 2 and 
H/HjO; in these cases, however, the critical gas mixture is given by aligning the 
temperature and reaction of interest with the points C and H, respectively. Similar 
diagrams can be constructed that indicate conditions for formation of sulphides, carbides, 
halides or any other reaction product of interest. (See also ch. 5, § 5). 

In considering the oxidation of an alloy it is useful to decide which components 
oxidise preferentially. This may be determined by comparing the free energies of oxide 
formation; the oxide with the most negative free energy of formation forms preferential- 
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Fig. 1. Ellingham diagram for some oxides of importance in the high temperature oxidation of metals and 
alloys. 


ly. Whether a separate phase forms depends upon the concentration of the components 
and the mutual solubility of the various oxide phases. If no new phases form, the 
oxidation rate may be changed by solution of the oxides, and as will be discussed later, 
this alters the number of defects in the oxide, thereby changing diffusion rates. 
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2.2. Oxide structure 

It is necessary to have an understanding of oxide structure before proposing models 
for the formation and further growth of the oxide layer. An oxide growing on a metal or 
alloy may be amorphous or crystalline; generally the very thin films are amorphous and 
thicker films crystalline. As will be shown later, oxide growth involves the movement of 
ions and electrons and it is important therefore to understand the restrictions imposed by 
the structure of amorphous or crystalline oxides upon the movement of ions and electrons. 

2.2.1. Amorphous oxides 

These oxides can be classified according to their bond strengths; i.e., glass or network 
formers with covalent bond strengths greater than 300 kJ mol -1 , intermediates with bond 
strengths between 300 and 200 kJ mol -1 and network modifiers with ionic bond strengths 
of less than 200 kJ mol' 1 . Zachariasen [1932], proposed certain rules for glass-forming 
oxides, which are useful although not universally applicable because of kinetic consider- 
ations. Oxides of the form MO and M 2 0 do not form networks; M 2 0 3 can form a 
network if the oxygen ions form triangles around each metal atom; M0 2 and M 2 0 5 form 
networks if oxygen tetrahedra form around each metal atom; M0 3 , M 2 0 7 and M0 4 are 
also probable network formers. The cations are tightly bound in a network former 
because of the high bond strengths. A large ion moves more easily than a small ion in 
such an open network because the small ions are mainly incorporated in the network so 
that anion transport would be expected in network formers. Since network modifiers do 
not form extensive networks and the bond strengths are generally low, cation transport 
is expected in these oxides. 

2.2.2. Crystalline oxides 

Crystalline oxides which form as the result of oxidation of a metal are generally non- 
stoichiometric (Kofstad [1972]) and thus exhibit both ionic and electronic conductivity. 
The non-stoichiometric oxide may be either metal-rich or metal-deficit. Oxides having an 
excess of metal ions maintain electrical neutrality by incorporating additional electrons; 
this type of compound is known as an n-type semiconductor. Oxides deficient in metal 
maintain electrical neutrality by the formation of positive holes; a positive hole occurs 
where the valence state of the cation in the oxide is higher than usual; these compounds 
are p-type semiconductors. The non-stoichiometry of these oxides can be accommodated 
in a number of ways. Figure 2 depicts an n-type semiconductor with the excess metal 
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Fig. 2. Zinc oxide as an example of a metal excess (n-type) semiconductor. 
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located at interstitial positions in the lattice. Alternatively, the oxides may have oxygen 
ion vacancies. Figure 3 illustrates the structure of a p-type semiconducting oxide which 
can in principle have both oxygen interstitials and metal vacancies. Generally only metal 
vacancies are found, although in compounds of the CaF 2 type interstitial anions occur, for 
example Th0 2 , U0 2 , etc. The defect structure that has been described allows easy 
movement of ions, cations via interstitials or vacant cation lattice sites and anions via 
vacant anion sites or occasionally via interstitials. 

It can be seen, therefore, that an n-type semiconducting oxide may allow anion or 
cation transport, whereas p-type oxides generally permit only cation transport. Table 1 
classifies some oxides according to their semiconducting nature, where amphoteric oxides 
may have either metal excess or metal deficit (n- or p-type). 

2.3. Kinetics 

An essential concept in oxidation of metals and in hot corrosion generally is that the 
reaction product can act as a barrier between the metal and the environment thereby 
controlling rates of attack. Thus, the properties of the reaction product often determine 
subsequent rates of reaction and the mechanism of the oxidation process is reflected in 
the kinetic laws that are followed. By accurately determining the kinetics of the process 
from relatively short-term experiments it should be possible to determine long-term 
behaviour by extrapolation. However, this can only be done if there is no change in the 
process and, as will be discussed subsequently, discontinuities are frequently observed in 
oxidation kinetics. There is greater certainty about the reliability of such an extrapolation 
if the initial data represent a significant fraction of the lifetime to be estimated. 

The movement of ions within an oxide layer, necessary for oxide growth, may involve 
a thermally activated diffusion process, or assistance by an electric field may reduce the 
overall activation energy to zero. The two basic mechanisms of oxide growth convenient- 
ly divide the subject into (i) the thin oxide film region (i.e., growth by a field assisted 
mechanism), and (ii) the thick film region (i.e., growth by thermal activation). The transi- 
tion between the two processes usually occurs at a limiting thickness of about 100 nm. 

In general, the kinetics of oxidation can be described by four laws, viz., logarithmic, 
cubic, parabolic and linear (fig. 4). All four kinetic laws are found in the thin film region. 
Linear for the first few monolayers of oxide growth, logarithmic up to about 50-100 atom 
layers of oxide and cubic or parabolic at still greater thicknesses. In the thick-film region 
a parabolic law is commonly encountered as well as linear growth laws. Often a transition 
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Fig. 3. Nickel oxide as an example of a metal deficit (p-type) semiconductor. 
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Table 1 

Electrical conductivity of oxides (after Kubaschewski and Hopkins [1966]) 


Metal-excess semiconductors (n-type) 

BeO, MgO, CaO, SrO, BaO, Ce0 2 , Th0 2 , U0 3 , U 3 O a , Ti0 2 , Zr0 2 , V 2 O s , 
Nb 2 0 5 , Ta 2 O s , Mo0 3 , Mn0 2 , Fe 2 0 3 , MgFe 2 0 4 , NiFe 2 0 3 , ZnFe 2 0 4 , W0 3 
ZnCo,0 4 , ZnO, CdO, A1 2 0 3 , MgAl 2 0 4 , ZnAl 2 0 4 , T1 2 0 3 . Si0 2 , Sn0 2 , Pb0 2 

Metal-deficit semiconductors (p-type) 

U0 2 , Cr 2 0 3 , (< 1250°C), MgCr 2 0 4 , FeCr 2 0 4 , CoCr 2 0 4 , ZnCr 2 0 4 , MnO, 
Mn 3 0 4 , Mn 2 0 3 , FeO, NiO, CoO, PdO, Cu 2 0, Ag 2 0, CoAl 2 0 4 , NiAl 2 0 4 

Amphoteric conductors 

TiO, Ti 2 0 3 , VO, Cr 2 0 3 (> 1250°C), Mo0 2 , (Ir0 2 ), Ru0 2 


from parabolic (protective oxidation) to linear (breakaway oxidation) is observed and this 
is normally associated with mechanical failure of the protective oxide. 

2.3.1. Thin him region 

At low temperatures, electrons from the metal are able to pass through an oxide film 
to the oxygen adsorbed at the outer surface; electrons do this by the tunnel effect (a 
quantum-mechanical process by which electrons of lower maximum energy than that of 
a barrier possess a finite probability of penetrating that barrier). Electrons pass through the 
initial chemisorbed film to the outer surface and form oxygen anions, and metal cations form 
at the metal/oxide interface. It has been shown that the potential drop across the film is 
approximately 1 V so that with a film thickness of 5 nm, the field strength would be of the 
order of 10 7 V cm' 1 . The strong electric field generated is able to pull ions through the film. 



Fig. 4. Oxidation kinetics. 
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Logarithmic kinetics have been interpreted as resulting from the rate-controlling step 
being either anion movement or the electron tunnelling process which creates the field 
across the oxide layer initially (Mott [1947]; Hauffe and Ilschner [1954]), and an 
inverse logarithmic law can result when cation transport is the slow step (Hauffe [1966]; 
Evans [I960]; Cabrera and Mott [1949]). Cubic or parabolic kinetics follow from the 
prediction that the rate of diffusion of ions is proportional to the magnitude of the 
electric field, which under constant potential decreases with increasing thickness; a 
parabolic law follows for p-type semiconductors and a cubic law for n-type semiconduc- 
tors (Cabrera and Mott [1949]; Engell etal. [1954]). Linear kinetics are observed in 
the formation of the first few monolayers of oxide where the growth process is controlled 
by surface diffusion and adsorption. 

2.3.2. Thick film region 

In the thick film (thermally activated) region, parabolic kinetics are derived by 
assuming that film thickening is controlled by diffusion in a concentration gradient 
(x 2 = 2k p 't=kpt where w is the oxide thickness, t is the time and k^ (or kp) is the 
parabolic rate constant). Linear kinetics are observed when the barrier provided by the 
oxide layer fails, so that either the oxide fails completely and there is direct contact 
between the metal and the gas, or partial mechanical failure occurs so that a constant 
thickness of protective oxide remains. Most practically important oxidation problems 
relate to the thick oxide film region and thus most emphasis will be given to the 
consideration of oxidation processes in this technically important region: 

a) Wagner’s theory 

Wagner [1975] derived an expression for the rate of thickening of the oxide expressed 
in terms of the specific conductivity of the oxide, the transport numbers of cations, 
anions and electrons and the free energy decrease during the oxidation reaction. This 
simplified derivation assumed that the conductivity is independent of the oxygen pressure 
and in fact this is not the case, so that a more complex expression results. There are only 
three metal oxide systems that accurately follow Wagner’s parabolic equation; viz. 
Cu/Cu 2 0, Co/CoO and Fe/FeO, and the very good agreement between the calculated and 
observed rate constants is evident from Table 2. 

Wagner’s theory of parabolic oxidation refers only to monophase scales, but in 
practice most oxides layers are multiphase, for example FeO, Fe 2 0 3 and Fe 3 0 4 all occur 
on iron when oxidised at temperatures above 600°C. It has been shown, however, that 
the rate of growth of each phase obeys parabolic kinetics and that their relative thickness 
is independent of time but temperature-dependent. 

b) Deviations from Wagner’s theory 

One of the main assumptions in Wagner’s model is that the scale is compact, but 
frequently scales are observed to be porous, and therefore under these conditions 
deviations from Wagner’s theory are to be expected. A porous scale can form during 
cationic or anionic growth. For example, consider the case of nickel where upon 
oxidation a compact outer layer of NiO and a porous inner layer, also of NiO, are 
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Table 2 

Measured and calculated rational rate constants* 


Temp. 1000°C 

Pressure, atm 

2Cu + 0 2 —> Cufi 

k, calc, 
eq cm -2 sec -1 

k, obs. 
eq cm -2 sec' 1 

8.3 x 1(T 2 

6.9 x 10' 9 

6.2 x 10- 9 

1.5 x 10~ 2 

4.8 x 10-* 

4.5 x 10' 9 

2.3 x 1(T 3 

3.4 x 10' 9 

3.1 x 10- 9 

3.0 x 10^ 

2.1 x 10^ 

2.2 x lO" 9 


2Co + 0 2 —>2CoO 


Pressure, 1 atm 

k, calc. 

k r obs. 

Temp. °C 

eq cm -2 sec' 1 

eq cm' 2 sec' 1 

1000 

1.25 x lO" 9 

1.05 x 10^ 

1148 

5.15 x 10- 9 

1.30 x 10- 9 

1350 

3.15 x 10' 8 

4.65 x 10' 9 


2Fc + 0 2 — ^ 2FeO 


Pressure, 1 atm 

k, calc. 

k, obs. 

Temp. °C 

eq cm -2 sec -1 

eq cm" 2 sec" 1 

800 

2.5 x 10 -9 

2.6 x 10' 9 

897 

1.1 x 10" 8 

1.2 x 10" 8 

983 

2.8 x 10" 8 

3.3 x 10“* 


* The rational parabolic rate constant, kp is used when the reaction rate is expressed in terms of equivalents of 
M,O b per square centimetre per second. 


formed. A single layer of nickel oxide forms during the early stages of growth of the 
oxide, but as the layer thickens a duplex structure develops, which consists of an inner 
region of equiaxed, fine-grained crystallites and an outer region of large columnar 
crystals. The inner layer is generally more pronounced on less pure materials, and is 
believed to be due to the presence of impurities segregating to the grain-boundaries, 
thereby inhibiting grain growth. Tracer diffusion studies have shown that the outer layer 
grows by movement of nickel vacancies along the grain-boundaries, and the inner layer 
by molecular oxygen penetration along microcracks and fissures which are present in the 
outer layer due to the build-up of stress in that layer. In the case of nickel oxide, 
compressive stresses result because of the constraints imposed on the oxide layer by the 
receding metal. 

A dissociative mechanism has been proposed by Mrowec [1967] for the formation 
of scales of this type. Metal atoms enter the scale at the metal/oxide interface and diffuse 
through the existing scale and react with oxygen at the oxide/oxygen interface to form 
new oxide. There must be corresponding movement of cation vacancies in the opposite 
direction so that a cationic vacancy flux arrives at the metal/oxide interface and must 
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somehow be removed. Whilst the film is thin, plastic deformation of the oxide is 
sufficient to allow continued adherence to metal, but as it thickens the oxide becomes 
stronger and can no longer deform, so that longitudinal cracks form at the metal oxide 
interface and the rate of transport of metal from the substrate to the oxide decreases. 
However, because of the chemical potential gradient in the scale, outward diffusion of 
metal can still occur. This disturbs the equilibrium at the inner surface of the scale which 
leads to an increase in chemical potential of oxygen at that surface and eventually oxide 
decomposes to form molecular oxygen, cations and electrons. The growth of an inner 
layer due to the presence of molecular oxygen is then possible and the processes 
described are illustrated in fig. 5 which shows the dissociative mechanism for double 
layer formation in a single-phase scale. This explanation is disputed by Kofstad [1985] 
who proposed that pores form due to plastic deformation and creep caused by growth 
stress in the scales. For scale growth controlled by diffusion, a gradient in defects will 
be developed through the scale such that a lower concentration of defects will occur in 
the outer layers of the scale; this will have the effect of promoting faster diffusional 
creep in these outer layers, so that there will be greater stress relief there and will thus 
avoid the development of the porous structure observed in the inner layers. KOppen- 
bender and SchOtze [1994] have demonstrated that “pseudo”-plasticity can occur by a 
micro-cracking and healing mechanism. Pseudo-plasticity requires compressive defor- 
mation in one or two directions and the possibility of oxide grain movement in at least 
one direction perpendicular to the compressive deformation axis. 

The other major assumption often used in applying Wagner’s theory is that bulk 
lattice diffusion processes occur. This is only valid at high temperatures (> -0.6 T mp ), and 
most industrially relevant materials are used at lower temperatures. Thus, for polycrystal- 
line oxides, grain-boundary diffusion becomes the dominant process. This is shown very 
clearly by referring to fig. 6 which shows the parabolic rate constant for the oxidation of 
nickel as a function of temperature compared with calculated lattice and grain-boundary 


0 2 M z *.2e' 



Fig. 5. Schematic representation of the dissociative mechanism. 
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Fig. 6. Comparison of the oxidation rate of nickel with lattice and grain-boundary diffusion of Ni in NiO 
(Atkinson [1987]). 


diffusion constants (Atkinson [1987]). By extrapolation of the experimental data shown 
in fig. 6, it can be seen that it is only at temperatures of about 1200°C and above that the 
activation energies for lattice diffusion and oxide formation would be similar (230-250 
kJ mol -1 ). At the lower temperatures the rate of oxidation is increasingly greater than 
would be predicted assuming control by bulk diffusion in the oxide lattice, activation 
energies being reported in the range 155-170 kJ/mol. The effect of prior cold-work in 
the nickel is to increase the oxidation rate, but the observed rate law is usually less than 
parabolic. Both these observations suggest that the rate of oxidation is controlled by 
grain-boundary diffusion in the oxide; the less than parabolic rate observed in the cold- 
worked material occurs because the initially fine-grained oxide coarsens during the 
oxidation process, thereby eliminating some short-circuit diffusion paths. Models have 
been developed (e.g., Smeltzer etal. [1961]) to describe the oxidation reaction where 
the rate is controlled by dual lattice and grain-boundary diffusion in which the effective 
diffusion coefficient is given by: 

D e (r = D, + 2(D'8)/g (6) 

where g is the grain size normal to the growth direction, 8 is the grain-boundary width, 
and D' and D, are the diffusion coefficients of the boundary and the lattice, respectively. 
Atkinson [1987] used this model to calculate parabolic rate constants for the oxidation 
of nickel where grain-boundary diffusion dominates, and there was good agreement 
between the calculated values and those obtained experimentally, as shown in fig. 6. 
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2.4. Properties of oxide layers 

It will have become clear from the preceding discussion that the electrical and 
mechanical properties of an oxide layer are of paramount importance in controlling the 
course of the oxidation reaction. We will now consider how these properties may be 
modified in order to control the rate of oxidation of metals. 

2.4.1. Electrical properties (diffusion) 

The defect structure of an oxide may be modified by a small addition of other 
elements and, if the oxidation process proceeds by the Wagner mechanism, one might 
expect changes in the oxidation rate to occur. For example, the addition of a monovalent 
metal to NiO, a cation-deficit p-type semiconductor, would be expected to reduce the 
cation vacancy concentration and hence also decrease the oxidation rate. Conversely, if 
ions of a higher valency are added to a p-type oxide, then the cation vacancy con- 
centration will increase and thus also the oxidation rate. The reverse situation would be 
expected for n-type oxides. 

Similarly, if the anion sub-lattice is altered, for example, by the incorporation of N' v , 
this would increase the number of vacant anion sites in an n-type semi-conductor and 
hence the oxidation rate would be increased if anion transport were the rate controlling 
step. This mechanism has been proposed to explain the differences in oxidation rates of 
titanium in pure oxygen (where a low value is observed) and in air (a high value). 

For a dopant to be effective the solubility, which is determined principally by ionic 
size, must be sufficiently high. A further probable restriction is that for growth processes 
controlled by cation diffusion the dopant should be added from the gas phase since new 
oxide is formed at this interface, and for anion diffusion the dopant should be added 
from the alloy. 

Thus it can be seen that the addition of minor amounts of other elements can cause 
large differences in the oxidation rate where the Wagner mechanism is operating. It has 
already been noted that the Wagner mechanism has only been quantitatively established 
in three cases. However, there are many more examples where qualitative effects of the 
type just described confirm the validity of this mechanism. 

In the ease where grain-boundary diffusion is rate-controlling, the effect of impurities 
can be to alter the diffusion rates per se or to influence the microstructure. In the latter 
case segregation effects are likely to dominate, leading to a reduction in grain size and 
a consequent increase in the oxidation rate. Very few data exist to indicate which is the 
controlling effect, but, in the case of nickel, theoretical modelling of the grain-boundary 
structure by Duffy and Tasker [1986] and experimental measurements of grain- 
boundary diffusion rates (Atkinson and Taylor [1986]) indicate that impurities often 
decrease grain-boundary diffusion rates (fig. 7). Thus it would appear that the effects of 
impurities in increasing the oxidation rate of nickel most probably results from a 
reduction in the oxide grain size with a consequent increase in the number of short- 
circuit diffusion paths. 
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Fig. 7. Arrhenius plot summarising the diffusion coefficients for various impurity cations in the NiO lattice, 
gTain-boundaries and dislocations (Atkinson and Taylor [1986]). 


2.4.2. Mechanical properties (stress generation and relief) 

Very recently a critical review of this topic was undertaken in a Workshop on the 
Mechanical Properties of Protective Oxide Scales which was published in Volume 12 of 
the Journal Materials at High Temperatures, 1994. 

It was pointed out some time ago by Pilling and Bedworth [1923] that, depending 
upon the growth processes involved, an oxide layer could develop internal stresses as a 
result of the growth process. If, for example, cation transport is the rate-controlling 
process, new oxide would form at a free surface, that is, the oxide/gas interface, and thus 
no stress would be imposed. However, if the oxide grows by oxygen transport, new 
oxide would form at the metal/oxide interface and the metal lattice would then have to 
expand or contract to accommodate the newly formed oxide. If the volume of oxide were 
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larger than the volume of metal consumed, the oxide would be in compression, and, if 
vice versa, the oxide would be in tension. The ratio, oxide volume/metal volume, is 
called the Pilling-Bedworth ratio, and for most metals it is greater than one. Growth 
stresses have also been observed in oxides which form by cation transport, so that other 
factors need consideration. It is clear that unless some stress-relief mechanism operates, 
build-up of growth stresses will ultimately result in cracking or blistering of the oxide 
layer, and, as was discussed earlier, this would result in a change of kinetics from 
parabolic (protective oxidation) to linear (non-protective or breakaway oxidation). 

In addition to stresses arising from the oxide volume/metal volume ratio, other 
mechanisms for stress generation have been proposed and recently reviewed by Stott 
and Atkinson 1994. These include: epitaxial stresses, recrystallisation within the scale, 
the composition of the metal or oxide may change during oxidation, point defect stress, 
oxide growth within the oxide layer, coulombic forces due to the electric field produced 
by differing mobilities of the ions, and finally thermal stress (i.e., stress arising from 
thermal cycles where a thermal expansion coefficient mismatch exists between the oxide 
and substrate) which is usually by far the most important cause of stress generation in 
oxides formed on engineering iron- and nickel-base alloys. 

That zirconium oxidises to produce a coherent oxide is surprising, since anion trans- 
port is favoured in the oxide and, with a large Pilling-Bedworth ratio, one would expect 
considerable stresses to be developed which could cause the oxide to crack or spall. It is 
evident, therefore, that various mechanisms which allow stress relief by plastic deformat- 
ion must operate. The following processes have been proposed; dislocation glide, grain- 
boundary sliding, mechanical twinning, viscous flow in amorphous oxides, Herring- 
Nabarro or diffusion creep, and finally, dislocation climb processes are often observed 
and are probably the most generally applicable mechanism for stress relief in oxides. 

Where plastic deformation of oxides is insufficient for benign stress relief, catastroph- 
ic stress relief will take place by spallation and scale cracking. Evans [1994] has 
developed spallation maps which can be used to predict regions of oxide stability as a 
function of the temperature drop during thermal cycling. These maps are based on two 
possible modes of scale spallation in which buckling can occur when the interface is 
weak and the oxide relatively strong (usually when the oxide is thin) or the wedging 
mode of failure where the interface is strong and the oxide is relatively weak (usually for 
thick oxides). Figure 8 is an example of such a spallation map, calculated on the basis 
of measured oxide properties for alumina formed on a ferritic steel. 

In simplistic terms, scale failure occurs when the in-situ stress exceeds some critical 
value (<7 C ). The critical stress can be evaluated using simple fracture mechanics con- 
siderations to predict (Evans and Cannon [1987]; Hancock and Nicholls [1988]), 
whether through-thickness cracking (tensile failure) or scale spallation (compressive 
failure) results. For the case of tensile failure, the critical stress is given by: 

= K u {irll)' Vl (7) 

where K lc is the fracture toughness and a is a measure of the defect size present in the 
oxide. K lc is related to the energy of fracture by K lc 2 = 2 E 0 -y 0 where E 0 is the oxide 
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Fig. 8. Notional spallation map for alumina on Fecralloy steel (Evans [1994]). 


elastic modulus and y 0 is the energy required to form a unit area of new oxide surface. 
Hence fracture mechanics analysis (Evans and Cannon [1987]; Hancock and 
Nicholls [1988]; Nicholls and Hancock [1989]) and the assessment of fracture strain 
energies (Evans [1988]) are alternative approaches to solving the same problem. 

The in-situ stress acting at a defect, within the oxide or at the oxide/metal interface, 
is the summation of all stresses acting on the defect. This stress will include growth 
stresses, thermally induced stresses and directly applied stresses which may be tensile, 
compressive or bending. Hence to quantify oxide fracture, the state of stress, fracture 
toughness and defect distributions present within the oxide or at the metal/oxide interface 
(depending on the mode of failure) must be capable of being assessed. This topic was 
recently reviewed in depth by Hancock and Nicholls [1994]. 

3. Oxidation of alloys 

The same principles apply to the oxidation of alloys as have been discussed for pure 
metal oxidation. The problem is more complex however, because there are at least two 
constituents which can be oxidized. The added complexity makes prediction of alloy 
oxidation rates extremely difficult. 

3.1. Selective oxidation 

Selective oxidation of a component of an alloy will occur when the free energy of 
formation of its oxide is more negative than that of the oxide of the other alloy constit- 
uents. Dependent upon the difference in free energies, temperature, oxygen pressure and 
concentration, it is possible for the oxide of the minor constituent to form to the 
exclusion of all other oxides. Wagner [1956] derived an expression for determining the 
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minimum atomic fraction N A(min) in an alloy AB for exclusive formation of a coherent 
layer of the oxide AO, given that the AO will form in preference to BO: 


N 


A(min) 


16 Z A c 


irk. 


\'/ 2 


(8) 


where d is the interdiffusion coefficient in the alloy, Z A is the valency of A in the oxide, 
kp is the parabolic rate constant for growth of oxide AO, and c is the number of gram 
atoms of metal per unit volume. 

This equation should be treated as predicting only the necessary minimum con- 
centration; it is not a sufficient condition in itself. For example, for the copper-aluminium 
system at 800°C the calculated value N A(min) = 0.004 (0.0008 wt% Al); however, in 
practice even when N A = 0. 1 1 (5 wt% Al) some copper oxides still form. Examination of 
the compositional changes that occur at the alloy/oxide interface as a result of selective 
oxidation shows that the concentration profile is determined by two opposing processes, 
consumption by the oxidation process and replenishment by diffusion within the alloy. 
Clearly, if a protective oxide is removed for any reason and a depleted alloy is exposed 
to the oxidising environment, the protective layer may not be able to reform; it is this 
situation which is usually responsible for breakaway oxidation. Exclusive formation of 
A1 2 0 3 , Cr 2 0 3 or less frequently, Si0 2 , is most commonly used as a means of protecting 
many industrially important alloys from oxidation. Generally, the oxidation rate of 
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alumina-forming alloys is lower than that of chromia-forming alloys, but the lowest rates 
of all result when silica scales form (fig. 9). In the latter case this is due to the fact that 
silica is amorphous at temperatures up to about 1200°C and thus grain-boundary 
diffusion is no longer a factor controlling mass transport. 

In cases where several different oxides form, the scale forms in such a way that the 
oxide with the highest dissociation pressure (most positive A G°) forms at the oxide/gas 
interface and that with the lowest dissociation pressure at the metal/oxide interface. In 
other cases selective oxidation results in an enrichment of an alloy constituent in the 
oxide layer relative to the alloy concentration without exclusive formation of its oxide. 
If the oxides are mutually soluble the major constituent of the scale will have its defect 
structure altered in a manner previously described by incorporation of the solute. Where 
the oxides do not form solid solutions, composite oxide layers result. A commonly 
encountered situation is the formation of complex oxides (double oxides, spinels, etc.). 
This may happen when oxides of the alloying components react with one another to form 
a more stable oxide. Ni-Cr, Ni-Al and Fe-Cr alloys provide good examples of this 
situation where spinel layers result. It is often observed that diffusion rates are much 
lower in spinels than in normal oxides, and protective oxide scales may consist of these 
complex oxides. Wood and Stott [1983] have summarised the oxide growth mech- 
anisms for these cases and the various possibilities are illustrated schematically for the 
nickel-aluminium system as shown in fig. 10, where alloys rich in aluminium eventually 
form an inner layer of alumina, or for more dilute alloys where the alumina forms only 
as an internal oxide. In the first case, after simultaneous formation of both oxides, the 
slower growing oxide (A1 2 0 3 ) forms in the inner regions by the displacement reaction 

Ni-Al rich inAl Ni-Al low in Al 



Alloy Alloy 

Al,0, NiO Al,0, NiO 



Alloy «Al 2 03 subscale 


Fig. 10. Schematic representation of transient oxidation of Ni-Al alloys (Wood and Stott [1983]). 
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3NiO + 2A1 3+ — » A1 2 0 3 + 3Ni 2+ , while at the interface between the A1 2 0 3 and the NiO 
layers a solid solution reaction occurs to allow formation of the complex oxide NiAl 2 0 4 . 
The case of internal oxidation is discussed in more detail in the next Section. 

3.2. Internal oxidation 

When oxygen dissolves in the alloy, the more active component (most negative AG°) 
may form a precipitate of oxide particles within the alloy. This phenomenon is known as 
internal oxidation or sub-scale formation. Internal oxidation requires a higher diffusion 
rate of oxygen in the alloy than that of the alloying component. This means that an 
oxygen gradient is established and the dissolved oxygen reacts with the most reactive 
constituent to form the oxide. Wagner [1959] studied the transition from internal to 
external oxidation and has derived a model based on the fact that the cross-section 
available for oxygen diffusion is reduced by the presence of internal oxide. Above a 
certain alloy concentration, oxygen diffusion slows down to the extent that only an 
external oxide forms. Lowering the oxygen partial pressure also encourages external 
oxidation since the solubility of oxygen in the alloy is reduced. If internal oxidation 
occurs, the oxidation rate is usually high because a protective oxide layer is not 
established. The transition to external oxidation will result in a reduction of the oxidation 
rate since now a complete barrier between the alloy and the environment exists. 

3.3. Intermetallic alloys 

Intermetallic materials are increasingly considered for engineering applications 
because of their high temperature strength and low density, and a comprehensive review 
has recently been published by Doychak [1994]. Generally these materials are based on 
the aluminide or silicide systems where protective alumina or silica scales can form. A 
particular problem relating to intermetallics, especially NbAl 3 and MoSi 2 , has been the 
so-called “pest”, where accelerated oxidation occurs at an intermediate temperature 
(550-850°C) after a brief incubation period. Early work on the mechanisms of this 
process proposed that it was stress-induced, but more recent work (Grabke etal. [1991]; 
Doychak et al. [1991]) suggests that the rapid degradation of NbAl 3 is the result of 
inward diffusion of oxygen through an outer layer of Nb 2 0 5 , continued inward diffusion 
of oxygen along the aluminium-depleted intermetallic grain-boundaries and precipitation 
of internal alumina within the intermetallic in the region unaffected by oxidation, as 
illustrated schematically in fig. 11. 

3.4. Scale adhesion 

As was mentioned in the introductory section, a major problem that must be 
addressed once a protective film has been established is the control of adhesion or 
mechanical integrity. It has already been pointed out that the protective oxide layer can 
be subjected to stresses dependant upon the operating growth mechanisms, which usually 
results in compressive stresses. Additionally, however, thermal cycles (cooling to room 
temperature) also result in compressive stress being applied to the oxide scale since most 
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(a) Amorphous external scale 




Fig. 11. Schematic of a) the accelerated oxidation mechanism on NbAl 3 at 727°C and b) the initiation of pest 
degradation (Doychak and Hesbur [1991]). 


oxides have a lower coefficient of thermal expansion than the metal or alloy. The 
additive effect of these compressive stresses is often sufficient to cause scale spalling. 

Small additions of “ active elements” (i.e., elements with a high affinity for oxygen) 
and notably the rare earths, are known to be very effective in promoting the formation 
of an adherent oxide layer that is resistant to thermal cycles. The active element can be 
added in elemental form or as an oxide dispersoid. In the latter case, a novel series of 
alloys produced by mechanical alloying — the so-called oxide dispersion strengthened 
(ODS) materials which have been developed primarily for enhanced high-temperature 
strength, also show good oxidation resistance (Michels [1977]). The alloy MA6000 
(Ni-15 Cr, 4.5 Al, 4 W, 2.5 Ti, 2 Mo, 2 Ta, 1.1 Y 2 0 3 , by weight) is now being used in 
some gas turbine applications. Early versions of this type of material, e.g., thoria- 
dispersed (TD) alloys, were evaluated some time ago and Stringer et al. [1972] 
reported that TD-Ni20Cr had excellent resistance to spalling and reduced oxidation rates 
compared with the simple binary alloy. 

As well as improving adhesion of the scale, the active element addition also reduced 
the growth rate and the concentration of chromium or aluminium required for preferential 
formation of the scale, particularly for the chromia-forming alloys. Whittle and 
Stringer [1980] reviewed the various theories that have been proposed to account for 
this effect, which include enhanced scale plasticity, formation of a graded seal, modificat- 
ion to the oxide growth process, stronger chemical bonding at the interface, elimination 
of voids by inert oxide particles acting as vacancy sinks and, finally, oxide protrusions 
into the alloy which act as “pegs” to improve adhesion. More recently, Luthra and 
Briant [1986] Smeggil etal. [1984] and Less [1987] have proposed that segregation of 
sulphur to the scale/alloy interface is responsible for the poor adhesion of the oxide, and 
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that the effect of the “active element” is to scavenge the sulphur present in the alloy and 
so restore the intrinsically strong bond between the oxide and the substrate. Luthra and 
Briant [1986] have been unable to confirm this effect experimentally, but both 
Funkenbusch etal. [1985] and Smialek [1987] have reported results that indicate that 
for high-purity alloys with sulphur contents of less than about 10 ppm, adherent oxide 
scales were formed on alumina-forming nickel-base alloys without rare earth alloying 
additions. Pieraggi and Rapp [1994], proposed that the active element blocks active 
sites, i.e., interfacial dislocations, that support diffusion growth at the scale/alloy 
interface, and thereby alter the growth mechanism and hence also the adhesion of the 
scale. All these various mechanisms were recently reviewed by Strawbridge and Hou 
[1994], and it is clear that no one theory can satisfactorily explain all the experimental 
observations. 

4. Multi-component atmospheres 

4.1. Phenomenology 

In many industrially relevant cases, alloys are exposed to environments containing a 
mixture of gases so that there exists the possibility of forming phases other than oxides. 
Commonly sulphides, carbides and to a lesser extent chlorides are formed, and in almost 
every case formation of these phases results in greatly increased rates of attack. This is 
because diffusion rates in sulphides and carbides are higher than in the corresponding 
oxide, and chlorides are often volatile, thus providing no barrier to further reaction. 
However, in most practical cases, alloys are used that contain reactive elements such as 
aluminium or chromium and there is usually sufficient oxygen present in the environment 
to allow formation of an initially protective oxide layer. Nevertheless, ingress of sulphur, 
carbon or chlorine through this protective layer can occur, leading to formation of the 
sulphide, carbide or chloride of the oxide-forming constituent within the alloy so that it 
becomes depleted in that constituent. Thus, should that oxide layer be damaged by 
growth stresses or differential contraction induced by thermal cycles, the alloy would be 
unable to reform a protective oxide. Increased rates of attack may not always be entirely 
due to alloy depletion effects. For example, oxide doping could occur, and it has been 
suggested that in the case of Cr 2 0 3 , S~ is introduced which leads to an increase in the 
chromium vacancy concentration and a corresponding increase in diffusion rates. In 
addition, the size of the impurity ion may cause a dilation of the crystal lattice and hence 
also encourage diffusion. Since most metallic sulphides, carbides or halides are less 
stable than the oxide, continued reaction results in the conversion of these second phase 
particles to the oxide which does not form a coherent protective layer, as shown, for 
example, in the following reaction: 

2MS + 0 2 -> 2MO + S 2 (9) 

It can be seen from this reaction that sulphur is released and is then available for further 
attack upon the alloy. This form of cyclic sulphidation/oxidation/sulphidation is fre- 
quently observed and leads to catastrophic rates of attack. 
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The process by which these aggressive species penetrate the protective oxide layer is 
often greatly disputed in specific cases, but in principle there are two possibilities: (a) 
bulk diffusion or (b) short-circuit diffusion via grain-boundaries or microcracks within 
the oxide layer. It will be shown in a following section that the nature of the diffusion 
path can sometimes be determined from the reaction products. 

4.2. Prediction of reaction products 

A powerful method of determining the nature of the product when an alloy reacts 
with a complex gas mixture is to construct the relevant phase stability diagram, and with 
a knowledge of the composition of the gas mixture the thermodynamically stable phases 
are indicated. For example, the phase stability diagram for the Fe-O-S system at 800°C 
plotted as a function of the partial pressures of oxygen and sulphur is shown in fig. 12. 
Thus, for a typical combustion gas mixture containing C0 2 , 0 2 , N 2 , H 2 0 and about 400 
ppm S0 2 the composition can be determined in terms of p(0 2 ) and p(S 2 ) and is shown 
as “X” in the diagram. It can be seen that Fe 2 0 3 is the stable phase. However, if p(0 2 ) 
is reduced and p(S 2 ) is increased as, for example, in a coal gasification environment, the 
composition of this gas mixture given as “Y” in the diagram indicates that FeS would be 
the stable phase. It should be noted that the axes selected for these diagrams do not relate 
to the major constituents of the gas mixture, but because equilibrium is assumed in the 
calculation, the same result would be obtained no matter what oxygen- or sulphur- 
containing species were used as a variable. For simplicity, oxygen and sulphur are 
usually chosen when plotting diagrams of this nature. 

The phase stability diagram can be constructed by considering all possible equilibria in the 
three component system and writing equations in terms of these components, for example: 



Fe 2 (S0 4 ) 3 


log N Fe = 0 


Fig. 12. Phase stability diagram for the Fe-O-S system at 800°C (partial pressures in pascals). 
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Fe + j 0 2 -» FeO 
where the equilibrium constant. 


K = “FeO = 10 9 at 800° C 

a (Fe) ' 


( 10 ) 


thus, knowing the equilibrium constant, the partial pressure of oxygen at the Fe/FeO 
phase boundary can be calculated and plotted on the diagram. Similarly all other relevant 
equilibria can be calculated at the temperature of interest. Use of these diagrams clearly 
shows the benefits of addition of elements such as aluminium and chromium in high- 
temperature corrosion-resistant alloys. Figures 13 and 14 are phase stability diagrams of 
Cr-O-S and Al-O-S systems, respectively, for the same conditions as were plotted in fig. 12 
for Fe-O-S. It can be seen that oxide formation covers a much greater range of conditions, 
so that for an iron-aluminium or iron-chromium alloy, protective oxides could form in cases 
where, in the absence of these alloying additions, iron sulphide would appear. For example, 
the region delineated by a rectangle in figs. 13 and 14 defines typical conditions encountered 
in a coal gasification system, where normally iron sulphide would form (point “Y” in fig. 12). 



log N Cr = 0 


Fig. 13. Phase stability diagram for the Cr-O-S system at 800°C (partial pressures in pascals). 
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An even more complex problem, however, is to consider relevant phase diagrams of 
an alloy in a multi-component gas mixture where oxygen-, carbon- and sulphur-contain- 
ing species may result in oxide, carbide and sulphide formation. Commonly all the phase 
stability diagrams for individual elements are superimposed so that an approximate 
description is obtained. Clearly, a complete representation of the equilibria could only be 
depicted by using a three-dimensional diagram. These are very difficult to calculate and 
even more difficult to construct. Approximations, therefore, are required, and it is 
convenient to consider the behaviour of an element in an alloy by plotting diagrams of 
a single element using a reduced activity. This approximation neglects the possibility of 
mixed or complex oxide formation, but it is now possible to carry out the necessary 
calculations to produce phase stability diagrams for mixed oxide systems (Barry and 
Dinsdale [1994]; Pelton et al. [1979]). If the gas mixture contains more than two 
active components a useful approach is to maintain one component at a constant level 
whilst varying the other two and then systematically alter components of interest to cover 
the range required. Clearly these are approximations and this needs to be borne in mind 
when reviewing the information available from these diagrams. 
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It must be remembered, however, that these diagrams will only indicate the presence 
of a phase by assuming equilibrium conditions, but kinetic factors are also very 
important. For example, in some conditions it has been observed that oxide phases only 
form exclusively at partial pressures of oxygen about 10 4 greater than the values for the 
thermodynamic boundary for transition from sulphide to oxide formation, giving rise to 
the so called “kinetic boundary” (Natesan [1983]). Figure 15 shows the position of the 
kinetic barrier for an austenitic stainless steel (AISI 3 10) at 650°C on the superimposed 
phase stability diagrams for iron, chromium and nickel, together with the region defining 
typical coal gasification atmospheres shown as a rectangle in the figure. Two points of 
interest arise from this diagram. Firstly, that the kinetic barrier for this alloy lies within 
the composition used in coal gasification so that in many plants this alloy would be 
unsuitable, and secondly, the position of the line is also very close to the FeS/FeO phase 
boundary. This latter observation may explain the origin of the kinetic barrier with 
chromia-forming iron-based alloys, since if iron diffusion occurs, sulphides would then 
form on the surface and promote sulphur migration through the chromia layer. 


4.3. Reaction path 


If one assumes that an oxide layer forms in a multi-component atmosphere and 
aggressive species such as carbon-, sulphur- or chlorine-containing gases are also present 
and are able to penetrate the oxide, then the nature of the corrosion product may depend 
upon the diffusion path in the oxide. If only bulk diffusion of the aggressive species 
occurs, then, since diffusion can only take place in a concentration gradient, the partial 
pressure of, for example, sulphur would be lower at the metal/oxide interface than at the 
oxide/gas interface. If, on the other hand, the oxide contained microcracks and the gas 
were able to penetrate the oxide, the gas equilibrium would no longer be controlled by 
the bulk gas composition because the partial pressure of oxygen would be set by the 
metal/metal oxide equilibrium and not by that in the gas phase (Stroosnijder and 
Quadakkers [1986]). In most practical cases where protection is conferred by formation 
of alumina or chromia, the partial pressure of oxygen at the metal/oxide interface is 
much lower than that in the gas phase; thus for a gas containing S0 2 , for example, this 
could result in a large increase in the partial pressure of sulphur at the metal/oxide 
interface compared with that in the bulk gas mixture. The effect of reducing the partial 
pressure of oxygen is clearly seen by considering the following equilibrium: 

so 2 0 2 +|S 2 

where the equilibrium constant 


K = 



= 1(T 14 at 800° C 


( 11 ) 


In the case of a nickel chromium alloy exposed to a combustion gas mixture containing 
400 ppm S0 2 at 800°C, the p(Sj) = 10“ 21 and p(Oj) = 10 3 Pa. However, at the alloy/oxide 
interface where Cr 2 0 3 forms, the partial pressure of oxygen would equal 10“ 22 Pa, and 
this in turn would cause p(S 2 ) resulting from 400 ppm S0 2 at the metal/oxide interface 
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Kinetic Barrier 



log p (0 2 )-in Pa 

Fig. 15. Phase stability diagrams for the Cr-O-S, Fe-O-S and Ni-O-S systems at 650°C, indicating the range 
of compositions found in coal gasification atmospheres and the kinetic barrier for exclusive formation of 
chromia observed for AISI 310 exposed in these conditions. 


to equal 10 13 Pa. In many ways this is a somewhat unrealistic calculation since in most 
cases there may not be complete access to the gas at the metal/oxide interface, and only 
a reduced concentration of S0 2 may be present. It is useful to consider the minimum 
concentration of S0 2 at that interface that is necessary for sulphide formation to occur. 
This can be shown to be defined by: 

^so 2 (min)) = exp AG MS + 2AG M0 - AG SC>2 /RT (12) 

Figure 16 is a schematic phase stability diagram in which three possible conditions are 
illustrated for penetration of the oxide by either the bulk diffusion or the microcrack 
mechanism, and the limiting sulphur dioxide pressure is marked by a dashed line. In 
region 1 the p(S0 2 ) is lower than the minimum value for sulphide formation so that 
sulphide cannot form by either mechanism. In region 2 transport of molecular sulphur 
dioxide through the scale would result in sulphide formation for p(S0 2 ) greater than the 
minimum, but not by bulk diffusion processes, whereas in region 3 bulk diffusion of ions 
would result in sulphide formation. Table 3 shows values for ^(SOjXmin) for various 
oxides as a function of temperature, where it can be seen that the presence of highly 
stable oxides (large negative A G°) results in very low values of p(S0 2 )(min). 
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Fig. 16. Schematic thermodynamic phase stability diagram for M-O-S systems, indicating the minimum p(S0 2 ) 
for sulphide formation (see text for details). 


It is important to realise that if the sulphidising gas does not contain oxygen (if it is 
H 2 S for example), then no mechanism exists for increasing the p( S 2 ) at the oxide/alloy 
interface, so that this rather dangerous condition which promotes rapid sulphidation 
would not apply. 

5. Hot-salt corrosion 

Hot-salt corrosion is a form of high-temperature attack on metals and alloys that 
occurs in the presence of a molten deposit; metalwork in power station boilers, waste 
incinerators, gas turbines and diesel engines is susceptible to this degradation process. 
Most work in this area has been related to the gas turbine but the underlying principles 
can be readily extended to other spheres. We will consider the environment, the 
phenomenology of the corrosion process, the mechanisms proposed to account for the 
corrosion behaviour and, finally, briefly consider the implications of new types of fuel 
for the gas turbine. 

5.1. The environment 

Molten deposits containing largely Na 2 S0 4 and/or V 2 0 5 form on turbine blades and 
nozzle guide vanes in the hot section of the engine. The deposits form as a consequence 
of combustion of sulphur- and vanadium-containing fuels which may also contain alkali 
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Table 3 


Minimum values of p(S0 2 ) in the atmosphere for sulphide formation at the metal/scale interface by 
molecular transport of S0 2 through the scale. 


Temperature, °C 


P(S0 2 ) (Pa) 


Ni 

Co 

Fe 

Cr 

400 

1.6 x 10“ 7 

1.0 x 10' 8 

2.0 x 10' 14 

1.3 x 10“ 33 

600 

2.5 x 10“ 2 

6.3 x 10^* 

1.6 x 10 -8 

6.3 x 10‘ 23 

800 

7.9 x 10’ 

6.3 x 10' 1 

4.0 x 10"’ 

3.2 x 10“ 16 

1000 

6.3 x 10 3 

7.9 x 10‘* 

7.9 x 10‘ 3 

4.0 x 10' 11 

1200 

- 

- 

4.0 x 10' 1 

2.5 x 10" 8 


metal impurities which are usually sodium-rich, either as NaCl or sodium naphthenate. 
Alternatively, the alkali metal may be ingested as an impurity in the air used for 
combustion, and again NaCl (seawater) is generally the main impurity, particularly for 
the marine gas turbine. Since the air:fuel ratio is about 50:1, it is clear that air quality is 
very important in determining the quantity of sodium sulphate formed in the combustion 
gases. The following equation indicates the process by which sodium sulphate forms: 

2 NaCl + H 2 0 + S0 2 +$0 2 + Nc^SO^ +2 HCl (13) 

The combustion gases containing these impurities are in the temperature range 1100-1400°C 
so that these compounds are usually in the vapour phase. However, when the combustion gas 
reaches the turbine blades and nozzle guide vanes where metal temperatures may be about 
850°C, condensation will occur if the vapour pressure of Na 2 S0 4 is greater than the equilib- 
rium value for that temperature, i.e., if the dewpoint is reached. It is generally accepted that 
there will be severe corrosion whilst the deposit is molten. If the temperature is lowered so 
that the deposit solidifies, the rates of attack that are observed are greatly reduced. 

A problem of particular importance in gas turbines is the rate at which the sodium 
sulphate-forming reaction proceeds. It should be realised that the gas velocity in a turbine 
is about 300 m s -1 and the distance between the combustion chamber and the blade can 
be as little as 0.3 m or for larger industrial engines 2-3 m, so that the sulphate forming 
reaction must be complete in about 1-10 ms. In the case of NaCl particles in the gas 
stream, clearly the size of the particle will affect the conversion process. It has been 
calculated, for example, that at 1300°C and 10 atmospheres (typical flame conditions) a 
10 /zm NaCl particle would only be 10% converted to sulphate in 5 ms (McCreath 
[1983]). In normal turbine operation particle sizes would be 10 /tm or less, and therefore 
when NaCl is the main contaminant some unreacted NaCl particles would be expected 
to hit the blades before evaporation or conversion processes are complete. 

Typically then, the environment in which gas turbine blades find themselves is one 
in which molten Na^C^ and/or V 2 0 5 can deposit from the vapour, and also, depending 
upon the operating conditions of the turbine, there is the possibility of impaction by NaCl 
(seawater) particles; in addition NaCl vapour and HCl gas may also be present. 
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5.2. Phenomenology of hot-salt corrosion 

Two types of attack can be observed. The more usual form occurs in the temperature 
range 800-950°C, known as Type I hot-corrosion and is characterised by the formation 
of a thick porous outer oxide layer, an intermediate layer of internal oxide particles 
mixed with depleted alloy and an inner region containing internal sulphide particles 
which leads the attack; fig. 17 is a micrograph illustrating this type of attack. It is 
generally observed that a maximum in rate of attack occurs at about 850-900°C; at 
higher temperatures, deposition of molten salts is reduced since the vapour pressure is 
insufficient for condensation. 

The other form of attack is known as low-temperature corrosion or Type II and is 
usually observed in marine gas turbines in the temperature range 650-750°C. In this type 
of attack large oxide and sulphide-filled pits form, but there are no internal sulphide 
particles, and fig. 18 is a micrograph illustrating this type of attack. Owing to the large 
temperature gradient that exists along the length of a turbine blade, it is possible for both 
types of hot-salt corrosion to occur on one component. 

5.3. Mechanism of attack 

We will consider in detail the more usual form of Na 2 S0 4 -induced hot-salt corrosion. 
Type I, and the principles applied will provide a basis for interpreting the special case of 
low temperature hot-corrosion or Type II. A brief description of vanadate-induced attack 
will also be given. 

5.3.1. Na 2 S0 4 -induced attack 

No single mechanism can be proposed to explain all observations but there are three 
main theories, viz., a) sulphidation, b) fluxing theory and c) pseudo-scale formation, and 
we will briefly consider the relevance of each. 



Fig. 17. Optical micrograph of Nimonic 105 blade after service in a gas turbine, showing conventional hot-salt 
corrosion attack. Type I hot-corrosion. 


References: p. 1357. 


1320 


S. R. J. Saunders and J. R. Nicholls 


Ch. 14, §5 



Fig. 18. Optical micrograph of 1N738LC blade after service in a gas turbine, showing low temperature pitting 
attack, Type 11 hot-corrosion. 


a) Sulphidation 

One of the first theories to explain Na 2 S0 4 induced corrosion was proposed by Simons 
et al. [1955] who suggested that sulphidation of the alloy occurred, as indicated by the 
following reactions: 


Triggering phase: 

Na 2 S0 4 + 3R — i Na 2 0 + 3RO + S 

(14) 


M + S -» MS 

(15) 

Autocatalytic phase: 

NajSO, + 3MS -» 4S + 3MO + Na 2 0 

(16) 


4M + 4S — » 4MS 

(17) 

Enhanced oxidation phase: 

2M + 2MS -» 2M.MS 

(18) 


2M. MS + 0 2 2MO + 2MS 

(19) 


The “R” in the triggering phase is a reducing agent and this is usually the alloy, but pyrolytic 
carbon resulting from poor combustion of the fuel may also act as a reducing agent. 

The sulphidation reaction proceeds in two phases, a triggering phase followed either 
by the autocatalytic phase or the enhanced oxidation phase dependent upon the thickness 
of the melt present. With thick Na 2 S0 4 layers, where there is limited oxygen access, the 
autocatalytic reaction is observed, but with thin layers we find the enhanced oxidation 
phase where the eutectic M.MS phase formed is oxidised to MO and more sulphide. The 
enhanced oxidation phase should be self-sustaining once the initial sulphide has formed 
with oxidation of the sulphide releasing sulphur for further sulphide formation. Pre- 
sulphidation of many alloys often leads to continuous sulphide formation by this 
mechanism when the material is subsequently oxidized in air or oxygen. 

(b) Fluxing theory 

The fluxing model was initially proposed by Bornstein and DeCrescente [1969], and 
developed by Goebel and Pettit [1970], and can best be illustrated by considering the 
decomposition of Na 2 S0 4 into its acidic and basic components. 
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NajSO,, — > Nap (base) + S0 3 (acid) (20) 

With basic fluxing of nickel oxide the following reaction then takes place: 

NiO + Nap Na 2 Ni0 2 (21) 

and for acidic fluxing: 

NiO + S0 3 NiS0 4 (22) 


The processes involved in basic fluxing are schematically represented for the case of 
nickel in fig. 19. It can be seen that basic fluxing is favoured by processes that increase 
the oxide ion activity in the Na2S0 4 melt. Figure 20 is a phase stability diagram for the 
Al-O-S system which illustrates acidic and basic fluxing of alumina. The formation of 
oxide under the molten salt would result in a decrease in the pPj) at the melt oxide/ 
interface and a consequent increase in the p(S 2 ) so that there is now sufficient driving 
force for diffusion of sulphur into the oxide, thereby decreasing the p(S0 3 ) and promot- 
ing basic fluxing. Acidic fluxing will be favoured by low temperatures because of the 
increasing stability of S0 3 , or by reactions of alloying constituents with Na 2 S0 4 that 
deplete the oxide ion concentration by, for example, complex oxide formation. 

Mo0 3 + Na 2 0 Na 2 Mo0 4 (23) 

Rapp and Goto [1981], pointed out that in order to sustain the dissolution reaction, a 
solubility gradient must be present in the molten salt layer. Thus oxide can dissolve at 
the oxide/melt interface, migrate down a concentration gradient to a site of lower 
solubility where precipitation occurs, and this is believed to be the origin of the external 
porous oxide layer found on samples exposed to this form of attack. Type I hot-corrosion 
is generally believed to be the result of basic fluxing or of direct attack on the substrate 
by the melt through a cracked oxide layer, and Type II is the result of acidic fluxing by 
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Fig. 19. Schematic illustration of the basic fluxing reactions of Na 2 S0 4 with pure Ni. 
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Fig. 20. Phase stability diagram of the Al-O-S system at 1000°C: indicates S 2 isobars, — > compositional 

charges in the NajSO., melt at the melt oxide interface with increasing time. 


a low-melting eutectic of, for example, 38% NiS0 4 .Na2S0 4 (m.p. 681°C) which are 
stabilised by a high p s0 (Luthra and Shores [1980]). 

The solubility of oxides relevant to the hot-salt corrosion of superalloys has been 
measured by Rapp and co-workers, and fig. 21 is taken from a review of their work, 
(Rapp [1987]). It is noteworthy that there is a difference of about six orders of mag- 
nitude in basicity (i.e., a Ns0 ) between the solubility minima for the most basic oxides 



Fig. 21. Oxide solubility in Na 2 S0 4 at 927°C as a function of oxide ion activity (Rapp [1987]). 



Ch. 14, § 5 


Oxidation, hot corrosion and protection 


1323 


and the protective alumina, chromia and silica which are all acidic oxides. Chromia 
scales are known to be more resistant than alumina to acid fluxing. It is suggested that 
chromia scales are able to support a buffering action and resist large changes in the 
basicity of the melt, and that the solubility minimum corresponds to the conditions in the 
gas turbine. The solubility data do not immediately suggest the cause of the superiority 
of chromia relative to alumina, but Rapp [1987], suggests that this might be the result of 
the faster growth of chromia. Another possibility is that the alumina scale is more 
vulnerable to mechanical damage, which would permit direct attack on the alloy. 

(c) Pseudo-scale 

In this theory (Reising and Krause [1974]) the Na 2 S0 4 acts merely to transport oxygen to 
the alloy surface and the Ni 2+ ions produced by the oxidation process are mobile in the liquid 
medium and move away to the liquid/gas interface where NiO particles form, producing a 
porous scale. Rapid oxidation ceases when the porous scale has absorbed all the Na 2 S0 4 
removing it from contact with the alloy surface and hence a protective oxide layer can be re- 
established. Experimental observations indicate that the NiO : Na 2 S0 4 ratio is usually constant 
and about 50:1, whereas the fluxing theory which has the following overall reaction: 

2Na2S0 4 + 9Ni 2Na20 + 6NiO + Ni 3 S 2 (24) 

suggests that the NiC^Na^O,, ratio should be 3:1. However, the effects of alloy 
chemistry and corrosion are probably better explained by the fluxing theory than by the 
“pseudo-scale” model. The fluxing model is now largely used in interpreting effects of 
hot-salt corrosion and has gained wide acceptance. 

5.3.2. Vanadate-induced attack 

Residual fuels contain relatively large amounts of sodium and especially vanadium. 
Figure 22 shows the phase diagram for the Na-V-O system which indicates formation 



Fig. 22. The quasi-binary Na 2 0-V 2 0 3 diagram (Kerby and Wilson [1973]). 
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of complex sodium vanadates with melting points as low as 535°C, although experi- 
mentally even lower temperatures have been found. The presence of these compounds in 
the combustion gas effectively limits the operation of diesel engines using these fuels to 
exhaust valve temperatures of about 450-500°C. Higher temperatures would promote 



Ch. 14, §5 


Oxidation, hot corrosion and protection 


1325 



Fig. 23. (a) Backscattered electron images of Stellite 6 samples after exposure in a burner rig at 600°C, as a 
function of sodium and vanadium concentrations in the fuel, (b) Backscattered electron images of Stellite 6 
coated exhaust valve, after exposure in a marine slow-speed diesel engine. Compare with the micrograph 
showing attack with 150 ppm V and 16 ppm Na in (a). 


acid fluxing by the molten vanadates. As with the previous case of sulphate corrosion, 
alloys and coatings with high chromium contents are used to resist attack when molten 
deposits form. 

A study of the effects of varying sodium and vanadium ratios on the corrosion of 
diesel exhaust valve alloys (Saunders et al. [1990]) showed that corrosion product 
morphology found in service could be simulated in the laboratory (fig. 23), and this was 
used to predict exhaust valve lives that agreed well with service experience. 

5.4. Coal-fired gas turbines 

In conventionally fired gas turbines, levels of 0.6 ppm Na in the fuel result in very 
severe corrosion of Nimonic 105, a material of moderate corrosion resistance. As 
mentioned previously with an air-to-fuel ratio of about 50:1, air purity is particularly 
important. If, however, the gas turbine were fired with a gas derived from coal where the 
calorific value of the gas is such that the air-to-fuel ratio would be about 3 : 1 then fuel 
quality is considerably more important for this application. Generally, the sulphur level 
in coal-derived fuels will be somewhat lower than in currently used oils. However, 
sodium and chlorine levels would be at least an order of magnitude greater than in 
conventionally fired gas turbines and also potassium will be present in significant 
quantities. The effect of potassium is not clearly defined but it might be expected that the 
lower melting point of sodium and potassium sulphate mixtures would encourage hot-salt 
corrosion at even lower temperatures. The presence of increased HC1, formed from 
combustion of the chlorine-containing compounds, would cause NaCl to form as a stable 
phase on the blades and, even in coals with relatively low chlorine contents, a 1 % NaCl 
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solution in NajSC^ may result which would be expected to increase corrosion rates. 
Clearly this is a complex situation and much work is required to understand the 
chemistry of these Na-K-Cl-S-0 molten salts. 

In addition to the greatly increased contaminant levels in the fuel, coal-fired gas 
turbines will be subjected to erosion from ash particles which despite filtration may 
remain entrained in the combustion gas. Some erosion tests have shown that coal ash 
particles 2-20 /rm in diameter are capable of removing large amounts of metal in very 
short periods. Some testing has been carried out, however, and although severe corrosion 
has been observed in some cases, the catastrophic rates that might have been predicted 
have not been observed. The blades quickly became coated with a deposit which was 
largely composed of coal ash and it is possible that the “pseudo-scale” theory may 
explain the somewhat better corrosion behaviour. The coal ash could be acting as a 
porous sponge, soaking up the sulphate melt and removing it from contact with the 
underlying alloy. 

6. Test and measurement methods 

The performance of materials at high temperatures is dominated by their resistance 
to mechanical deformation and attack by the environment. Thus, in attempting to select 
materials for service in these conditions it is important to have reliable means of 
predicting their behaviour, and whilst some information can be obtained by using the 
experience gained in the operation of existing plant, introduction of new materials or of 
different operating conditions requires data to be produced from laboratory testing 
procedures. High temperature corrosion can involve attack in the gas phase or the 
combined effects of hot gases and molten or solid deposits. As might be expected, if 
solid deposits can form, an increase in gas stream velocities would result in erosion 
rather than deposition but this is beyond the scope of this chapter. The mechanical 
behaviour of oxide layers is an extremely important factor determining the overall 
performance in corrosion and erosion processes, so that the available methods of 
ascertaining the mechanical properties of thin layers will also be considered. 

6.1. Monitoring oxidation processes 

The methods used to measure the rates of oxidation, i.e., processes forming only 
oxide layers, are considered in this section but, as will be evident, many of the tech- 
niques are applicable to the other reactions and processes discussed in this chapter. In the 
first place, consider the simplest process in which a test is carried out isothermally. 

6.1.1. Isothermal testing 

The purpose of tests of this type is to determine the rate at which the specimen is 
converted to oxide, and the most commonly used procedure is the gravimetric method in 
which the change in mass of the sample is recorded. Usually a mass gain corresponding 
to uptake of oxygen is observed, but if oxide spalling occurs this mass gain is reduced 
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by the amount of metallic oxide lost, and in extreme cases net mass losses can be 
recorded. The simplest form of the experiment is to place the sample directly into the 
furnace, withdrawing it periodically and measuring the change in mass. This experiment 
clearly imposes temperature cycles, which are discussed later, but in some cases, particu- 
larly for thin adherent oxide layers, similar results are obtained with experiments carried 
out continuously. Figure 24 shows schematically a typical arrangement for conducting 
such experiments with continuous weighing of the sample using a microbalance. 

Other methods involving measurement of changes in pressure or volume, or of the 
optical properties of the surface, such as ellipsometry are also used, but a complete 
description of these and other methods is unfortunately beyond the scope of the present 
chapter. The reader is referred to Kubaschewski and Hopkins [1967] for full details. 

Oxidation rates also have been determined by measurement of the natural frequency 
of a freely suspended specimen by Bruce and Hancock [1969]; Hancock [1970]; 
Hurst et al. [1975], and as will be shown later in the section on mechanical stability of 
oxides, cracking and spalling of the scale can also be detected. The characteristic 
frequency of the first mode of vibration, /, is related to the dimension and the elastic 
modulus, E, of the composite rod as follows: 


f = 


r f E tn c 4 + E 0 (b 4 - c 4 )l 

16irl 3 [m 0 + Aw) L ' 'J 


(25) 


where E m and E 0 are the elastic moduli of the metal and oxide, m 0 is initial mass of the 
rod, A m is the mass gain, c is the radius of metal and b is the radius of the composite. 


Balance beam Quartz or 
stainless alumina rod 



Fig. 24. Schematic drawing of an automatic recording thermobalance. 
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Thus, by solving the above equation, mass changes can be derived, together with the 
elastic properties of this composite, as follows: 


2 PM m M a — M m 
M„ -M m M a + M„ 


(26) 


where: A is defined by m = m 0 ( 1 + A) 

f B is frequency of the unoxidised rod 

M m , p m , E m are the atomic weight, density and elastic modulus of the metal 
M g , p 0 , E a refer to an oxide 
and ft = (E 0 p m M 0 )!(E m p 0 M 0 ) 


6.1.2. Cyclic oxidation 

The superimposition of thermal cycles in an oxidation experiment, cyclic oxidation, 
is used to encourage scale failure due to spallation (Lobb [1984]; Barrett and Lowell 
[1975]; Barrett et al. [1978]). Hence cyclic oxidation tests are used to monitor both 
scale adherence and the ability of the scale to successfully repair after repetitive scale 
failure. 

The performance of materials is generally monitored gravimetrically, either con- 
tinuously (Lobb [1984]; Barrett and Lowell [1975]) or in a discontinuous manner 
(Barrett and Lowell [1975]; Barrett et al. [1978]) although other monitoring 
methods have been used. For example, measurement of the resonant frequency of the 
specimen during thermal cycling has been successfully used to detect (Hancock [1970]) 
the onset of scale failure, before gross spallation, and hence mass loss was observed. 
Figure 25 illustrates typical data produced by gravimetric tests. Curve (a) represent good 
performance with little scale failure; curve (b) is intermediate behaviour, that is initially 
protective but after a limited test duration becomes non-protective, while curve (c) shows 
poor behaviour with scale spallation occurring from the onset of the test. Curve (b) is 
characteristic of materials that have a limited reserve of stable scale forming elements, 
such as chromium, aluminium and silicon, so that after repeated scale failure and repair 
the morphology and composition of the scale changes, possibly forming less protective 
spinels, for example. These less protective scales spall more readily and rapid mass 
losses are recorded (Barrett et al. [1978]). 

6.1.3. Mechanistic studies 

Various experimental procedures have been devised specifically to obtain information 
about the mechanism of oxidation and should be included in any discussion of test 
methods. When considering mechanisms of oxide growth it is important to know the transport 
path (lattice or grain-boundary diffusion), where oxide growth is occurring, i.e., anion or 
cation movement, the diffusion coefficients of the more mobile ions and the equilibrium 
defect concentration. Tracer and marker techniques can be used to determine the nature of 
mobile species and transport paths and the Rosenburg [1960] method, using interrupted 
kinetics experiments, gives information about diffusion coefficients and defect concentrations. 
The methods most commonly used are marker and tracer and these are discussed below. 
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Fig. 25. Schematic diagram of typical mass changes versus time during cyclic oxidation: (a) protective, 
(b) protective initially, (c) spalling. 

i) Marker methods 

If inert material, platinum or a stable oxide, is placed on the surface of unoxidised metal, 
its final position in the oxide scale will give information about the transport processes 
that have taken place, as is illustrated in fig. 26. If the scale is pore-free the final 
position of the marker ideally reflects the transport processes that have taken place. 
However, the presence of pores can result in the marker being displaced. For example, 
in cases where an outer dense layer forms over an inner porous region, the marker is 



Metol Metal Metal 

(a) (b) (c) 


Fig. 26. Ideal location of inert markers after lattice diffusion-controlled growth of oxide scales: (a) metal 
diffusion predominant; (b) oxygen diffusion predominant; (c) simultaneous metal and oxygen diffusion. 
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frequently found at the interface between the porous and dense layers even though cation 
lattice diffusion is faster than anion diffusion. In this case, the most likely mechanism is 
that molecular oxygen has penetrated the outer layer as the result of cracking of the scale 
or of the development of microchannels due to build-up of stress, so that the final 
position of the marker reflects movement of material but not solid-state diffusion 
processes. It is also suggested that with large markers, the marker itself can partially 
obstruct the diffusion process and some plastic flow can occur around the marker. 

ii) Tracer methods 

In this method, a radioactive form of one of the reactants in the oxidation process is 
used. This can be done by either depositing a thin layer of radioactive metal on the 
surface of the specimen or by carrying out the experiments with radioactive oxidant at 
some stage in the experiment; 18 0 is commonly used in oxidation experiments. Atkinson 
et al. [1979] have calculated idealised profiles for various limiting cases which are 
reproduced in fig. 27, where it can be seen that for the case of a scale, MO, growing by 
vacancy diffusion the profile depends on the effective charge, a, of that vacancy. Where 
grain-boundary diffusion takes place it has been assumed that there is no exchange of 
metal between the grain-boundary and the oxide lattice and thus the cations are im- 
mobilised. In fact, some exchange does occur and this would have the effect of broaden- 
ing the profile. In most practical cases, some combination of the limiting cases referred 
to above is usually found, so that the resultant profile will be an appropriate mixture of 
the simple cases. 

6.2. Mechanical failure of oxide scales 

By careful design of a cyclic oxidation experiment the susceptibility of a material to 
spalling can be assessed. Hence cyclic oxidation tests provide a qualitative method of 
assessing scale adherence. To quantify scale adherence, and the onset of spalling, control- 
led cooling experiments are necessary. Continuous monitoring permits the onset of first 
fracture to be detected such that from a knowledge of the temperature drop the fracture 
energy for scale spallation can be calculated. This approach has been used by Evans and 
Lobb [1984] to evaluate the interfacial fracture energy of chromia-rich scales formed on 
a 20Cr/25Ni/Nb-stabilised stainless steel after oxidation in C0 2 /2%C0 at 930°C. 

6.2.1. Internal stress measurements 

Two methods have been widely used to determine the internal or growth stresses 
present within oxide scales; these are X-ray stress analysis (e.g., Luthra and Briant 
[1986]) and methods based on the deflection of a thin metallic strip oxidising on one 
surface only. 

X-ray stress measurement techniques are usually evaluated at room temperature, 
although some facilities are available with a high temperature capability (Stout et al. 
[1989]). Hence the internal stress determinations include both growth stresses and 
thermally induced stresses during the cooling cycle. The X-ray determination of stresses 
is based on the measurement of lattice strains in a set of lattice planes of known 
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Fig. 27. (a) Metal tracer in a scale of MO growing by metal vacancy diffusion only, (b) Distribution of M and 
,8 0 tracers in a scale growing by short circuit diffusion, (c) Distribution of M and 18 0 tracers in a scale growing 
by short circuit diffusion and lattice diffusion (Atkinson el al. [1979]). 


orientation and then correlating these lattice strains with the strain calculated from elastic 
theory (Luthra and Briant [1986]). This approach is known as the sin 2 i // method. 

The determination of internal and growth stresses by direct deflection measurements 
on a thin metal beam at temperature (Roy and Burgess [1970]) has been widely used 
by Huntz and co-workers (Delaunay etal. [1980]; Huntz [1988]). Figure 28 schemat- 
ically illustrates this method of measurement. The thin metallic strip under test is only 
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To recording 
mechanism 


Fig. 28. Schematic diagram of the deflection study apparatus. 


permitted to oxidise on one side by coating the second, usually with silica. Oxidation 
causes the strip to bend with the degree of deflection, a measure of the stress generated 
within the oxide as defined by: 



E m X 2 m P 
3 L 2 X a 


(27) 


where D is the beam deflection, L is the beam length, X m and X„ are thickness of the 
metal and oxide, respectively and E m is the elastic modulus of the metal. This measured 
stress is only a part of the total stress due to oxidation, as a residual stress component 
remains after beam deflection due to the tensile elastic strains generated within the metal 
beam. This method does not take account of plastic deformation of the substrate by 
creep, for example, and recently a detailed analysis has been carried out Evans [1995] 
in which an analytical solution was developed from a relationship derived from the 
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Fig. 29. Stress distribution in an alumina scale formed on Fecralloy at 1100°C (Evans [1995]). 


balance of forces and the balance of momentum. Figure 29 is an example of the results 
obtained using this method for the case of a Fecralloy steel oxidised at 1100°C. 

6.2.2. Detection of scale failure 

Three methods have been used to date to detect scale cracking both during thermal 
cycling and under conditions of applied load. As discussed in the section on cyclic 
oxidation measurements, controlled cooling experiments can reveal the critical temper- 
ature drop necessary to cause through-scale cracking (increased oxidation rates are 
observed) or scale spallation (mass losses are recorded) (Lobb [1984]; Evans and Lobb 
[1984]; Baxter etal. [1984]; Rolls and Nematollahi [1983]). However, this method 
is restricted to metal/oxide systems where mass changes induced by scale cracking can 
be readily detected. The vibration technique (as described in section 6.1.1) can also be 
used to detect the onset of scale cracking. It is very sensitive to crack formation within 
the oxide which results in a reduction of the stiffness of the composite system and hence 
a discontinuity in resonant frequency is observed (see fig. 30) when the oxide scale 
cracks. Continued cracking would result in a continual decrease in the recorded resonant 
frequency. Should oxide repair occur then a frequency transition is observed with the 
frequency recorded after repair approaching that observed before the cracking event. 

More recently, acoustic emission has replaced the vibration technique as a route to 
detect in situ early scale failure, and its use in the determination of oxide fracture was 
recently reviewed by Saunders etal. [1994]. This technique is not limited to particular 
specimen geometries, has similar sensitivities to scale cracking and can be monitored 
automatically. Acoustic emission has found wide usage in monitoring cracking during 
thermal cycling (Khanna [1985]; Christl etal. [1987]), superimposed stress (SchOtze 
[1985]) and during isothermal oxide growth (Price and Bennett [1989]). Figure 31 shows 
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Fig. 30. Schematic response to breakaway oxidation, monitored using three methods. 

the typical coupling arrangement of an acoustic transducer to a specimen through the use 
of an acoustic waveguide. Figure 31 specifically relates to monitoring acoustic events 
during the cracking of oxides under applied tensile loads (Schutze [1985]), however, 
similar geometries can be used to monitor scale failure under cyclic conditions or during 
isothermal oxidation by combining acoustic emission with thermogravimetric studies 
(Bennett etal. [1989]). Acoustic microscopy in which an acoustic signal is focused onto 
a sample using water as a coupling medium can also be used to detect scale cracking. 

6.2.3. Measurement of the macro defects (cracks, voids and pores) present in an 
oxide scale 

As discussed earlier in this section, scale failure results when the in situ stress exceeds 
some critical value necessary to propagate a defect. Hence from a knowledge of the 
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Fig. 31. Schematic of an acoustic emission test arrangement. 


stress to cause fracture, the growth stress and the defect distribution in the scale, a 
measure of the scale fracture toughness and scale fracture energy can be determined. 
Hancock and Nicholls [1988] proposed a method for assessing the composite defect 
size in an oxide containing multiple voids and pores. The analysis follows guidelines 
proposed for the assessment of defects in welds, and requires that each defect is idealised 
and compared with its neighbours to assess if any interaction occurs. Interacting defects 
are considered as a single defect and the assessment procedure continues, until a single 
composite equivalent defect results. Using this defect size, values of fracture toughness 
can be evaluated. Evidently, this process requires detailed metallographic analysis. This 
approach, however, has been applied (Nicholls and Hancock [1989]) to a number of 
scales containing active elements and has clearly demonstrated that a major role of active 
element additions is to reduce the macro defects present within the oxide scales with 
consequent benefit for scale adherence and spalling resistance. 

6.3. Mixed oxidant tests 

Many industrial applications involve exposure of high-temperature components to 
complex gas mixtures, and it is necessary to devise relevant laboratory test procedures. 
In designing the test it is important to consider whether the atmosphere being simulated 
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is equilibrated or unequilibrated, and, also in the context of the laboratory environment, 
whether adequate safety precautions exist since frequently the gas mixtures to be used 
contain toxic or combustible components. In this section consideration will be given to 
factors controlling the composition of the atmosphere and the experimental arrangements 
required to monitor gas composition, including any modification to experimental 
arrangements necessary to allow safe operation and introduction of samples. 

6.3.1. Control of gas composition 

Normally, equilibrated gas mixtures are required, and therefore, it is common practice 
to assume that the main oxidising (in the chemical sense) species controls reaction rates, 
so that the partial pressures of oxygen ( p Q ), sulphur (p s ), halogenic gases (for example, 
p c| ) and the activity of carbon (a c ) are controlled by establishing relevant gas equilibria. 

It is relatively straightforward to carry out calculations to determine the partial 
pressure in these simple binary gas mixtures, but in multi-component atmospheres 
calculation of the equilibrated gas composition requires the use of iterative routines such 
as the MTDATA (Davies and Barry [1989]). The composition of the gas used in the 
test will, of course, control the type of scale that forms (see section 4.2). 

The choice of gas mixture is usually determined by a direct simulation (for industrial 
atmospheres or environments containing high partial pressures of reactive species) or by 
using a gas with an equivalent partial pressure of active species for low p Q , p s , etc. 
Kofstad [1989] has questioned this latter approach, since he considers that tfie specific 
components of a gas mixture can individually control scale defect concentrations or 
microstructure thereby altering rates of attack. For example, it is known that H 2 /H 2 0 or 
C0/C0 2 mixtures of equivalent p(0 2 ) yield different rates of attack. Representative 
testing, therefore, should be carried out with exact simulations of the relevant gas 
mixture where possible. 

In flowing gas streams, which are usually used to avoid depletion effects of active 
species (often present in only small quantities), it may be necessary to preheat the gas by 
passing it over a heat exchange system or in a serpentine path through the hot-zone of 
the furnace before contacting the samples. Some gas mixtures are slow to equilibrate so 
that use of catalysts is necessary, and a good example of this is the S0 2 /SO s equilibrium 
which is readily established over a platinum catalyst (Luthra and Worrell [1976]). In 


Table 4 
Gas equilibria 


Gas equilibrium 

Partial pressures and activities 

Hj/HjO 

P(0 2 ) 

CO/COj 

p(0 2 ) and a c 

Hj/H 2 S 

PtSJ 

SOj/Oj 

p(S 2 ) and p(0 2 ) 

ayH 2 

a c 

Hj/HCl 

P(cy 
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other cases the presence of the metal or oxide on the specimen surface is an effective 
catalyst, for example, iron, nickel and iron oxide. However, this catalytic activity may be 
poisoned by components of the gas mixture such as hydrogen sulphide (Grabke and 
Wolf [1987]). The nature of a hot corrosion experiment requires that the gas mixture 
flows in a thermal gradient, so that care is required to ensure that thermal diffusion 
effects do not cause unmixing of the gas by differential diffusion of the components in 
the temperature gradient (JoST [1952]). Generally, the heavier components will tend to 
concentrate in the cooler regions of the furnace for high convection velocities and vice 
versa for low velocities. 

6.3.2. Experimental procedures 

A paramount consideration is safety. Acceptable toxicity levels are given in the 
Registry of Toxic Effects of Chemical Substances, and the laboratory should be regularly 
checked to ensure that safe levels exist in the ambient atmosphere. 

Monitoring the gas composition is, of course, desirable and gas chromatography and 
mass spectrometry are commonly used to determine the composition at ambient temperat- 
ure, but it is not usually possible to measure gas compositions at the test temperature by 
these methods. Use of solid electrolyte zirconia probes to measure p(0 2 ) of the gas 
mixture within the hot-zone is being used increasingly (Kiukkola and Wagner [1957]). 
This method relies upon the establishment of the following electrolytic cell: 

Pt/0 2 (air)/Zr02/Pt (0 2 unknown) (28) 

Here the e.m.f. generated is directly proportional to the difference between the reference 
(for example, air) and unknown pfCy values. These probes are available commercially 
or can be relatively easily made in the laboratory. There have been attempts to develop 
probes for the measurement of sulphur and carbon activities but these are still in their 
infancy (Worrell [1975]; Taniguchi etal. [1988]). 

6.4. Attack by molten salts 

The case of gas turbine operation will be considered as an example. In designing a 
suitable test it should be borne in mind that in the turbine the salt deposit is constantly 
being replenished, and that owing to the action of centrifugal forces the deposit is also 
being removed to the blade tips and thus is usually quite thin. When a porous oxide layer 
or deposits form on a blade then the molten salt can also be trapped within pores. This 
is a complex situation which most researchers acknowledge cannot easily be realistically 
simulated in laboratory tests, so that various simplified test methods have been devised 
to allow some information about reactions kinetics and mechanisms to be obtained. 
Saunders and Nicholls [1984] reviewed the various test methods that have been used 
and have classified these into crucible tests, furnace tests and burner rig tests. 

There is general agreement that burner rig tests gives the best simulation of operating 
experience, and corrosion product morphologies obtained as a result of burner rig testing 
agree well with those found on blades taken from operating engines. However, furnace 
tests with careful control of gas and deposit chemistry and deposit replacement can give 
a close simulation of dynamic corrosion that can be expected in service, at a very 
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modest cost. Considerable disagreement exists about the quantitative comparison of 
corrosion rates from different rigs. A major factor in the confusion is that in the past 
operators of rigs have attempted to compare results taken from rigs where the con- 
centration of contaminants was controlled to that expected in the turbine. Since it has 
been established (Saunders and Nicholls [1984]) that deposition rate is the primary 
factor controlling corrosion rates, only rigs operating at the same velocity as the turbine 
would be expected to produce comparable rates of attack. It has been pointed out 
(Hancock [1982]; Saunders etal. [1982]) that contaminant flux rate (CFR) is a better 
parameter to control deposition and allows low velocity burner rigs to operate at 
comparable deposition rates to high velocity rigs and gas turbines. Recently, an inter- 
national round-robin was carried out which established the validity of this approach 
(Saunders and Nicholls [1995]). 

6.5. Attack by solid deposits 

In cases where solid deposits form it is not only important to ensure that the amount 
and composition of the deposit corresponds with that found in service, but, because of 
porosity effects in altering gas composition, the structure of the deposit must also be 
controlled. 

In recent years considerable interest has developed in fluidised-bed combustion of 
coal, where deposit-induced corrosion can occur. In this case the deposit consists of a 
mixture of CaS0 4 (derived from sulphur capture processes in the bed) and coal ash and 
it is believed that the active species in the corrosion process is CaS0 4 . Tests were carried 
out in which samples were buried in various CaS0 4 -containing mixtures (CaS0 4 + C, 
CaS0 4 + CaO etc.) with or without control of the gas atmosphere. While at high tempera- 
tures these tests generally reproduced corrosion found in service they did not reproduce 
attack observed at temperatures below about 800°C. This is believed to be due to 
inadequate simulation of the pore structure of the deposit. Slurries of CaS0 4 -containing 
material have been used by Natesan [1982], while Saunders and Spencer [1987], 
attempted to overcome the problem by using powder compaction techniques. In ash 
deposition conditions typical of those found within a fluidised bed, one cannot discount 
the r61e of particle impact in modifying deposit morphologies and hence corrosion 
(Stephenson and Nicholls [1992]). 

7. Life prediction modelling 

There is considerable interest in the ability to predict when oxide scales fail and their 
consequence on increased corrosion rates. This section will review “lifing” models that 
have been developed based on oxide failure, and hot-salt corrosion mechanisms. 

7.1. Oxidation models 

The development of spallation maps (see fig. 8) and their application to the prediction 
of scale failure was was discussed in section 2.4.2. From this work it was shown that 
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scale thickness in conjunction with temperature drop determines when the scale fails. 
This concept has been extended further to predict the lives of thermal barrier coatings 
(TBC) on hot-path components. The spallation of the Zr0 2 -8wt%Y 2 0 3 TBC can be 
modelled on the basis of the growth of the alumina at the bond coat/TBC interface, with 
the number of cycles to failure, N f , determined from a knowledge of the oxide thickness 
(h), the thickness of the oxide to fail in one cycle (h c ) through a Manson-Coffm type 
relationship: 


N f = 


Ae J 


f h' 
1 - — 

K, 


'h' 


(29) 


where Ae f is the strain required to cause failure in one cycle with no oxidation damage 
and Ae is the tensile component of the in-plane elastic strain range in the oxide at the 
interface, b and c are constants (Meier etal. [1991]; Meier etal. [1992]). 


7.2. A probabilistic model of corrosion loss 

Hot corrosion processes invariably proceed in two distinct stages; an initiation stage, 
where breakdown of the protective surface oxide occurs, albeit by chemical or mechan- 
ical means, followed by a propagation stage where deposited contaminants have direct 
access to the substrate and corrosion rates can be considered catastrophic. 

To model localised maximum corrosion attack (e.g., Type II, vanadic attack and other 
more generalised corrosion morphologies) measurement methods must be developed that 
not only account for the rate of growth of the pits, but also the dispersed nature of this 
form of attack. The basic concept behind these models was developed by one of the 
authors in the early 80s and has been used to predict corrosion rates within the marine 
gas turbine (Nicholls and Hancock [1983]; Nicholls and Hancock [1987]) and 
within diesel engines (Nicholls and Triner [1990]; Nicholls and Saunders [1990]). 
When modelling localised corrosion the common practice of measuring mean section loss 
and/or maximum observed corrosion penetration clearly does not apply, and hence 
statistical method for evaluating metal loss due to corrosion must be used (Nicholls and 
Hancock [1983]). 

The statistical evaluation of metal loss requires that many readings of section loss (a 
minimum of 24 readings is recommended) be taken randomly along a corroding surface 
and from one component section to another. These are then evaluated using statistical 
techniques, for example fig. 32. In this figure, remnant coating thickness was determined 
at 2.5 percentiles, i.e., 39 using data points per curve. The probability of exceedance is 
calculated by ordering the section loss data from highest loss to lowest. Then if i is the 
ranking of the corrosion loss reading x jy the probability of exceedence is given by 
i!(n + 1), where n is the total number of readings. The statistical model that is most 
appropriate (a normal or log normal distribution for general corrosion) can be determined 
from the probability plot. For example in fig. 32, the region of low corrosion loss approxi- 
mates to a normal distribution. However, a few localised deep pits are observed, with a 
morphology typified by fig. 18. A method for modelling this extreme corrosion is required. 
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Fig. 32. Probability plot of pack aluminide coating thickness following burner rig testing at 700 °C. 

7.3. Modelling extreme corrosion 

Here, the life-limiting rate of corrosion is associated with the growth of the deepest 
pits, that is to say the extreme corrosion data. In fig. 32, and from similar plots of 
section loss, extreme corrosion can be seen to occur for the upper 10% of metal loss 
readings. For the data included in fig. 32 where each point represents a 2.5 percentile, 
four data points can be considered extreme. This should be the minimum number of data 
points necessary to model extreme corrosion. However, by increasing the number of 
sections examined it is possible to increase the number of extreme data values available. 
A limited number of data points taken from multiple sections would provide better 
estimates of the extreme corrosion if multiple sections are available for analysis. 

This extreme corrosion data can be modelled using “extreme value statistics”; the best 
model being a Gumbel (Berry [1975]) extreme value model of maxima (Nicholls and 
Hancock [1983]; Nicholls and Hancock [1987]). 

A necessary condition for the use of this model is that “the upper tail of the original 
distribution decreases at least as rapidly as the exponential function”. This conditions is 
met for data that follow either a normal or log-normal distribution, and hence a Gumbel 
extreme value model can be used to model extreme rates of hot-salt corrosion. However, 
this condition should be validated (through the use of probability plots) when applying 
extreme value statistics to other localised corrosion phenomena. 

The probability density function f(x), of the Gumbel extreme value model of maxima 
is given by: 
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f(x) = (1 /<T e ) ■ exp[l -(x-fi e )/a e ] ■ exp[- exp(-x - )] (30) 

where and <r e are the distribution parameters 

The cumulative distribution function, F(x), is given by: 

F(x) = f{x)dx = exp(- exp(x - fi e )/<T e ) (31) 

The expected value, E(x), is given by: 

£(*) = L x = Ve+Ve-y (32) 

where y is Euler’s constant (=0.5772), and the variance of x, V(x), is given by: 

V(x) = ttV;/6 (33) 

by using the log-likelihood function, L(x) given by: 

L(x) = n ■ log(o- c ) - X ( x, • - jx e ) /(T e - £ exp[-(x, - n e )/<r e ] (34) 

;=i i=i 

the maximum likelihood estimates of /u. e and a t can be evaluated by solving the 
simultaneous equations: 

n 

n = X ex P[~( x < “ )/<r e ] (35a) 

1=1 

n = S x i l 1 - ex P (~( x < _ /*. )] (35b) 

i=l 

These equations have to be solved iteratively. 


Figure 33 shows an example of the relationship between the expected arithmetic 
mean (E(x)) and the expected standard deviation (Vv(x)) and those measured exper- 
imentally (Nicholls [1993]). In both cases the degree of fit is very good, confirming 
that these extreme corrosion data may be modelled using a Gumbel extreme model of 
maxima. 


7.4. Development of a life prediction model 

Since Type II corrosion is associated with the growth of pitting morphologies, only 
the extreme corrosion data from these burner rig tests is considered in developing the 
type II hot corrosion life model. 

The most likely “extreme” value of corrosion (xj can be calculated and its value, of 
x c that has only a 5% chance of exceedance, is given by: 

x e = /x e = (T e ln[ — ln(0.95)] (36a) 
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Fig. 33. Relationship between (a) expected (E x ) and 
deviation of the extreme values. 



(b) Actual std. dev (SJ pm 
actual mean, (b) expected (V V x ) and actual standard 


and, more generally, at any probability of exceedance (a): 

x e = fi e = <r e ln[- ln(l - a)] (36b) 

By modelling p, e and <r e as a function of exposure time a corrosion life model can be 
developed which has the form for any probability, 

*. ( i^ =A+ B J/ + Gln[(l-a)] (37) 

where A, B and C are constants dependant on the corrosion process, temperature etc., t 
is the exposure time in hours, x e is the maximum expected Type II hot-corrosion pit 
depth, and a is the risk that the pit depth will exceed x e within the exposure time (t). 

Clearly, the constants A, B and C depend on the rate of arrival of contaminants, i.e., 
the contaminant flux rate. For low contaminant flux rates it can be shown that the 
observed corrosion rate increases with contaminant flux and follows a power law. 
Incorporating contaminant flux dependence into equation 37, one obtains: 

=X ™{ A+B it + Ct ln(l -a)} (38) 

where A, B and C are redefined as flux independent corrosion rate constants, and take 
the values of 15.9, 2.2 and -0.12 respectively, for the model reproduced in fig. 34. 

Two alternative failure criterion exist. Figure 34 plots the predicted maximum corro- 
sion rates, calculated using equation (38), for two sodium contaminant flux rates (0.05 and 
0.13 mg/cm 2 , h) and for two confidence limits (95 and 99%). Also included in the figure 
are the curves for the most likely extreme value of Type II corrosion (i.e., the most likely 
Type II corrosion pit depth) at these two sodium fluxes. As can be seen, these models are 
capable of predicting the corrosion expected under engine service trials. It is also possible 
to predict the point at which corrosion penetration through the coating occurs, or when the 
corrosion pit is sufficient to cause fatigue failure (Nicholls et al. [1993]). 
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Exposure lime (h) 

Fig. 34. Prediction of Type n corrosion at 700°C for sodium contaminant fluxes of 0.05 and 0.13 mg cm -2 IT 1 . 


8. Developments in coating technology 

In discussing coating properties, performance will be assessed with reference to the 
desirable properties of a high-temperature coating system (table 5). In general, two types 
of coating process can be identified. The first requires that the composition of the surface 
of an alloy should be altered by diffusion, while the second involves the deposition of a 
metallic or ceramic overlay with improved oxidation resistance. Hence, these two 
processing routes are known as diffusion and overlay coating, respectively. 

Diffusion coatings provide a surface enrichment of aluminium, chromium and/or sili- 
con which results in the formation of protective surface oxides of A1 2 0 3 , Cr 2 0 3 or Si0 2 . 


References: p. 1357. 



1344 


S. R. J. Saunders and J. R. Nicliolh 


Ch. 14, § 8 


Table 5 


Requirements of a high-temperature system 


Coating property 

Requirement 


Location 




Coating Surface 

Bulk 

Coating/substrate 





interface 

Oxidation/corrosion 

Low rates of scale formation. 

X 



resistance 

Uniform surface attack. 

A thermodynamically stable sur- 

X 




face oxide. 

X 




Ductile surface scales. 

X 




Adherent surface scales. 

X 




High concentration of scale form- 
ing elements within the coating to 

X 




act as reserve for scale repair. 

X 

X 


Interface stability 

Low rate of diffusion across inter- 



X 


face at operating temperatures. 
Limited compositional changes 
across interface. 



X 


Absence of embrittling phase 
formation during service. 



X 

Good adhesion 

Matched coating and substrate 
properties to minimise coating 
mismatch and stress generation at 
coating/substrate interface. 


X 

X 


Optimum surface condition before 
coating. 

Growth stresses during coating 



X 


formation should be minimised. 


X 

X 

Mechanical strength 

Coating must withstand all stress 
(creep, fatigue, and impact load- 
ing) that is generated at com- 
ponent surface during service. 

Well matched thermal expansion 
coefficients between coating and 
substrate to minimise thermal 


X 



stressing and thermal fatigue. 


X 

X 


Metallic overlay coatings are a specifically designed corrosion resistant alloy that are 
deposited onto a component surface and are typified by the M-Cr-Al-X series where M 
can be Fe, Ni and/or Co and X is an oxygen active metal, i.e., Y, Ce, Hf, Si or a combi- 
nation of these. For this class of coating, interaction with the substrate is generally neg- 
ligible during deposition. In addition, thicker coatings may be produced which can give 
extended service lives. The reduced substrate interactions, choice of corrosion resistant 
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alloy and the ability to deposit thicker coatings results in greater design flexibility. 

Ceramic barrier coatings are a class of overlay coating, where corrosion protection 
(and thermal protection) are provided by an impervious ceramic barrier. For oxidising 
environments the ceramic barrier should be a stable oxide, and Si0 2 , A1 2 0 3 and Zr0 2 are 
the prime contenders, while nitride ceramics, TiN and Si 3 N 4 have been examined for 
reducing conditions. In addition to their role as a corrosion barrier, ceramic overlay 
coatings may be engineered to provide additional thermal protection, the thermal barrier 
concept, where a cooled component, for example a turbine blade is coated with a layer 
of ceramic (typically 0.3-0.4 mm thick) having low thermal conductivity, thereby 
retarding the rate of heat flux into the component surface. Several types of thermal 
barrier system have been developed with varying degrees of success. The preferred 
family of ceramics used for proprietary coatings is based on stabilised zirconias, and in 
particular yttria partially stabilised zirconia (Y-PSZ). 

8.1. Diffusion-coating processes 

Diffusion-coating processes have been applied for many years to improve the 
environmental resistance of a base alloy by enriching the surface in Cr, Al, or Si. Pack 
chromising has been in widespread use since the early 1950s (Samuel and Lockington 
[1951]; Sully and Brandes [1967]) to increase the oxidation/corrosion resistance of low 
alloy steels. In the late 1960s aluminising was first used for the protection of superalloy 
gas turbine aerofoils (Goward and Boone [1971]; Grisaffe [1972]). There was 
renewed interest in siliconising and silicon-modified diffusion coatings for high temper- 
ature service in the early 1970s (Felix and Erdos [1972]; Bauer et al. [1989]) when 
novel solutions to the low temperature corrosion problems, associated with contaminants 
in industrial turbine plant burning impure fuels, were required. Siliconising had previous- 
ly been dismissed as a major contender because of the strong embrittling effect on high- 
temperature alloys and the tendency to form many low melting point metallic and oxide 
eutectics (Felix and Erdos [1972]). 

Diffusion coatings can be applied to hot-gas components using several techniques, 
including pack cementation, slurry cementation, and metallising. A comprehensive review 
of the methods of deposition of diffusion coatings was in Mevret etal. [1986]. Fluidised 
bed techniques can also be used to deposit diffusion coatings on a laboratory scale 
(Duret etal. [1982]). This latter approach would permit the coating of large components 
with a close tolerance on coating thickness, as a result of the uniform temperatures and, 
therefore, chemical activities possible within a fluidised bed. 

In the pack cementation process, components to be coated are buried in a “pack”, 
contained in a sealed retort (fig. 35). The retort is heated to the desired processing 
temperature under either an inert gas or hydrogen atmosphere to prevent oxidation. The 
exact process cycle, time, and temperature are dependent on the required coating, coating 
thickness, and subsequent substrate heat treatment. The pack contains three elements; i) 
a donor alloy that releases solute material at a known rate and hence determines the pack 
activity, ii) a halide activator that dissociates during the process cycle and acts to 
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Schematic diagram of a pack cementation retort 
Fig. 35. Schematic diagram of pack cementation retort. 


transport solute material from the pack to the component to be coated, and iii) an inert 
oxide diluent to prevent pack-sintering. 

Typical pack compositions used to produce a range of metallic coatings are given in 
table 6. Of these the two diffusion-coating processes that are most widely used are 
“aluminising” and “chromising”. The pack-aluminising process will be used as an 
example to illustrate a typical process cycle. During the aluminising process material 
from the pack is transferred to the component surface through the formation of inter- 
mediate volatile aluminium monohalide gas and therefore the coating process is probably 
more accurately described as a chemical vapour deposition process. Interdiffusion 
between the depositing aluminium and the substrate alloy results in the formation of the 
intermetallic coating, primarily NiAl or CoAl, depending on the alloy base, but contain- 
ing to a degree most of the elements present in the base alloy either in solution or as 
dispersed phases. 

The deposition rate and morphology of the coating depend on pack activity, process 
time and temperature. Coatings are classified as either “low activity”, when outward 
diffusion of the base alloy occurs, or “high activity” when inward diffusion of aluminium 


Table 6 


Typical pack composilions and deposition temperatures for halide activated pack cementation 


Coating 

Pack composition, wt% 

Deposition temperature, °C 

A1 

1.2-2.7A1, 0.8-1 1.2NaF, bal. A1 2 0 3 

900 

Cr 

48Cr, 48A1 2 0 3 , 4NH 4 C1 

850-1050 

Ti 

77Ti, 20TiO 2 , 3NH 4 C1 


Si 

5Si, 3NH 4 CI, bal. A1 2 Q 3 
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occurs. In the latter case a surface layer of Ni 2 Al 3 forms and a further heat treatment is 
required to convert this brittle phase to NiAl. This step is usually combined with the heat 
treatment required to recover substrate properties. Figure 36 illustrates a typical alumin- 
ide coating deposited onto a nickel based superalloy using a high activity coating 
process. The coating was deposited using an aluminising pack containing 2%A1 at 900°C, 
and was heat-treated for 2 h at 1 120°C, then 24 h at 845°C. 

Clearly, the properties of the aluminide coating (or for that matter any diffusion 
coating) depend upon the process methodology, the substrate composition and the 
subsequent heat treatment. Typically, aluminide coatings contain in excess of 30 wt%Al 
and are deposited to thicknesses between 30-100 /xm depending on the type of aluminide 
formed. They offer satisfactory performance for many aviation, industrial and marine 
engine applications. Under severe hot corrosion conditions, or at temperatures above 
1 100°C, aluminide coatings offer limited protection. Hence modified aluminide coatings 
have been developed that offer improved corrosion resistance. 

8.2. Modified aluminide coatings 

Modified aluminides have been fabricated using one of the following techniques 
(Duret etal. [1982]): 

i) Co-deposition of elements from the pack or slurry; 

ii) pretreatments of the superalloy before pack aluminising, for example, chromising 
prior to pack aluminising; 

iii) deposition of a metallic layer using electroplating or PVD (physical vapour 
deposition) techniques. For example, a platinum aluminide coating is formed by 
depositing platinum onto the superalloy prior to aluminising. 

Alloying additions include Cr, Si, Ta, the rare earths and precious metals. Many of these 
coatings are now commercially available. Probably, the most significant advance in this 
area was the development of the platinum modified aluminide class of coating. Figure 37 



Fig. 36. Backscattered electron micrograph of a fully 
processed high activity pack aluminide coating on IN 


738LC. 


Fig. 37. Backscattered electron micrograph of a plati- 
num aluminide (RT22) coating on IN 738LC. 
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illustrates the microstructure of the RT22 variant, marketed by Chromalloy. The SEM 
micrograph is a backscattered electron image, the platinum-rich aluminide phase within 
the coating being shown in light contrast. 

8.3. Overlay coatings processes 

Diffusion coatings, by the nature of their formation, imply a strong interdependence 
on substrate composition in determining both their corrosion resistance and mechanical 
properties, hence, the possibility of depositing a more “ideal” coating, with a good 
balance between oxidation, corrosion and ductility has stimulated much research interest. 
Early coatings of this type were alloys based on cobalt (Co, Cr, Al, Y) containing 
chromium additions in the range 20-40%, aluminium additions between 12-20% and 
yttrium levels around 0.5%, with the most successful coating being Co25Crl4A10.5Y 
(Talboom etal. [1970]). Recent coatings are more complex and are based on the M-Cr- 
Al-X system, where M is Ni, Co, Fe or a combination of these and X is an active 
element, for example Y, Si, Ta, Hf etc. (Goward [1983]; Clarke [1979]; Prater etal. 
[1981]; Goebel etal. [1981]). The composition of the M-Cr-Al-X system is selected to 
give a good balance between corrosion resistance and coating ductility, while the active 
element addition(s) can enhance oxide scale adhesion and decrease oxidation rates. 
Current thinking suggests that a combination of active elements is beneficial in reducing 
coating degradation through their synergistic interaction. Overlay coatings have been 
deposited using a range of techniques including plasma spraying and physical vapour 
deposition (PVD). Table 7 summarises some typical commercial overlay coatings and 
their methods of deposition. 

The earliest production method was electron-beam PVD (Talboom [1970]). However, 
because of the high capital cost in setting up a commercial EB-PVD plant, plasma spray 
methods have found wide acceptance, particularly the argon-shrouded and vacuum- 


Table 7 


Some typical commercial overlay coatings 



Composition 

Specification 

Deposition Process 

Co, Cr, Al, Y 

Co-23Cr-12Al-0.5Y 

ATD2B 

EB PVD* 


Co-18Cr-l 1 A1-0.3Y 

ATD5B 

EB PVD 


Co-18Cr-8Al-0.5Y 

LC029 

Argon shrouded plasma spray 

Co, Ni, Cr, Al, Y 

Co-32Ni-21Cr-8Al-0.5Y 

LC022 

Argon shrouded 
plasma spray 


Co-23Ni-30Cr-3Al-0.5Y 

LC037 

Argon shrouded 
plasma spray 

Ni, Co, Cr, Al, Y 

Ni-23C0-18Cr-12.5Al-0.3Y 

ATD7 

PWA270 

EB PVD 

Ni, Cr, Al, Y 

Ni-20Cr-l 1 A1-0.3Y 

ATD10 

EB PVD 


* Electron-beam physical vapour deposition 
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plasma spray processes (Nicoll [1984]; Taylor et al. [1985]; Restall and Wood 
[1986]). More recently, composite electroplating (Kedward [1969]; Forster et al. 
[1985]; Honey et al. [1986]) and laser fusion (Powell et al. [1985]; Steen [1987]) 
methods have been used to deposit overlay coating systems. However, coatings produced 
by EB PVD processes are still considered the commercial standard against which other 
process routes are compared. 

8.3.1. Physical vapor deposition 

Physical vapour deposition processes encompass both electron beam (EB) evaporation 
and sputtering, either of which may be used in conjunction with ion-plating (see below). 

Commercial EB PVD coatings are produced in a vacuum environment, lO^-KT 4 Pa, 
with deposition rates often exceeding 25 /j,m/min. Components are generally preheated 
in vacuum to between 800-1 100°C, and are rotated within the evaporant cloud during the 
deposition process (Boone [1980]). Rotation attempts to ensure uniform coverage as the 
process is primarily “line of sight” at these pressures. Problems such as columnar grains 
with unbonded interfaces (known as “leaders”) due to shadowing effects can occur as the 
coating grows. High deposition temperatures result in increased surface diffusivity which 
reduces the density of these leaders and also permits some minimal interdiffusion 
between the coating and substrate during the coating process cycle, ensuring good 
adhesion. Hence coating spallation, a problem with many processing routes is not a 
problem provided the substrates are properly cleaned prior to coating. Post-coating 
processing includes glass bead peaning and heat treatment which results in complete 
closure of any remaining leaders. Figure 38 is a micrograph of a Co,Cr,Al,Y coating 
(ATD5B) produced by an EB-PVD processing route. 

In the sputtering process, positive gas ions (usually argon) produced in a glow 
discharge or plasma bombard a target of coating material dislodging groups of atoms. 
These then enter the vapour phase and deposit onto substrates to be coated (Bunshah 
[1980]). Deposition rates are much slower than the EB evaporation route, generally 





Fig. 38. Secondary electron micrograph of an EB PVD 
Co-Cr-Al-Y (ATD 5B) coating on Mar M002 - note the 
fine microstructure compared with that produced by vacuum 
plasma spraying, cf. fig. 39. 


Fig. 39. Backscattered electron micrograph of 
a vacuum plasma sprayed Co-Ni-Cr-Al-Y 
(ATD2B) coating on IN 738LC. 
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10-20 /xm/h for a diode system or up to 50 /xm/h for a planar magnetron system. 
Magnetron systems (having magnetic plasma confinement adjacent to the target) usually 
result in more uniform deposition and can virtually eliminate substrate heating during the 
deposition process. Working pressures are of the order of 1-10 _2 Pa depending on 
whether a DC glow discharge or RF plasma is used to generate ion bombardment and 
hence offers excellent throwing power and good overall coverage of components to be 
coated. Because of the low process temperature, coatings are invariably heat-treated 
following deposition to produce the desired properties. 

Both EB evaporation and sputtering may be combined with ion-plating. Ion-plating 
is essentially a PVD process in a soft vacuum (1-KT 1 Pa) with evaporant depositing onto 
substrates held at a high negative potential (often between 2-5 kV) (Teer [1980]). 
During ion-plating the components to be coated are initially bombarded with positive 
ions, formed in the discharge, which remove oxides and other contaminants from the 
surface. When the surfaces are sufficiently clean the vapour source is energised and 
metal evaporant enters the discharge and is deposited onto the sample. With EB 
evaporation and ion-plating, deposition rates are typically 10-20 /xm/min. 

An interesting combination of sputtering and ion-plating (sputter ion-plating, SIP), 
using a hollow cathode sputtering target and application of a bias to the substrates held 
within the hollow target, has been described (Coad and Dugdale [1979]) and used to 
deposit coatings on turbine blade materials (Coad and Restall [1982]). The system 
operates at soft vacuum (1-10 Pa). The geometry of the sputter targets and components 
to be coated offer very good throwing power and therefore uniform coatings. The system 
is heated to ~300°C during coating with a bias typically of 1 kV. Deposition rates are 
typical of diode sputtering systems, i.e., -10 /xm/h. Again, heat treatment of the as 
deposited coating is required to obtain the desired properties. 

8.3.2. Spraying processes 

Plasma- spraying has the advantage of being able to deposit metals, ceramics, or a 
combination of these, generating homogeneous coatings with microstructures consisting 
of fine equiaxed grains, i.e. no columnar/leader defects. High deposition rates are 
possible with little significant change in composition occurring from the powder 
feedstock through to the coating, even when the elements in the coating have widely dif- 
fering vapour pressures. The most obvious limitations of this process are that the coating 
process is ‘line of sight’, requiring complexed robotic manipulation for complete 
coverage, and that the more reactive elements may well oxidise during the spraying 
process if conducted in air. Porosity problems previously reported for plasma spray 
coatings can largely be overcome using post coating thermomechanical treatments. Figure 
39 shows a fully processed, argon-shrouded plasma-sprayed LC022 (Co-Ni-Cr-Al-Y) 
overlay coating. 

Carrying out the plasma-spraying process in a closed chamber under reduced pressure 
(5-7 x 10 3 Pa), known as low-pressure plasma-spraying (LPPS) or vacuum plasma- 
spraying (VPS), permits close control of the gaseous environment. This removes the 
possibility of unwanted gas-metal reactions and also permits high particle velocities 
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giving improved adhesion and higher densities, thus overcoming many of the limitations 
of plasma-spraying but at an increased capital plant cost. 

In the high-velocity oxy-fuel (HVOF) spraying process fuel is burnt with oxygen at 
a high pressure and generates a high-velocity exhaust jet. Fuel gases may include 
acetylene, kerosene, propane, propylene, hydrogen, methacetylene-propadiene (MAPP) 
mixture etc. The ratio of the gas flow rates determines the temperature of the flame. The 
temperature of oxygen-acetylene reaches a maximum at 3170°C if mixed 1.5 to 1 (by 
volume) and oxygen-propylene reaches the maximum temperature of 2900°C at the ratio 
of 4: 1 (Neiderberger and Schiffer [1990]). The velocity of the exhaust jet in the 
Jet-Kote torch is about 2000 m s -1 (Kreye [1991]). The main advantage of this process 
is the shorter residence time in the flame (lower powder temperature) and higher kinetic 
energy of the particles impacting. This produces a dense coating with less degradation of 
this powder during spraying. 

The most widely sprayed powders are carbides, although there is considerable interest 
in using the HVOF process to deposit M-Cr-Al-Y overlay coatings. The bond strength 
of HVOF-sprayed coatings can be as high as 90 MPa (Kreye [1991]). Porosity is low, 
less than 1%, and typical coating thicknesses are in the range 100-300 /im. 

8.3.3. Laser processes 

Lasers can be used to form coatings directly by (i) preplacing a powder onto the 
substrate, (ii) blowing the powder into a laser-generated melt pool, or (iii) applying the 
clad in wire or sheet form. Alternatively, coatings produced by other routes such as 
plasma-spraying or electroplating may be modified by surface treatment with the laser. 
By careful selection of laser conditions, dilution of the coating by substrate melting can 
be minimised and large areas can be surfaced by a series of overlapping tracks. Power, 
beam diameter, and beam velocity are the main process variables with typical values 
being 1.5 kW, 5 mm, and 10 mm s" 1 , respectively. As with plasma-spraying it is 
necessary to apply an inert gas shroud to reduce oxidation of the coating. Powell et al. 
[1985], suggest that adhesion is improved if some melting of the substrate occurs. 

Problems associated with reflectivity of the surface occur particularly with use of 
wire and sheet, and for the preplaced powder method there are difficulties in maintaining 
the powder on the surface. Thus the blown powder method is favoured by Steen [1987]. 
Porosity in coatings can occur from cavities between overlapping tracks, solidification 
cavities, or gas evolution, but problems can generally be overcome be careful choice of 
processing parameters. 

Residual stresses in a laser coating are generally tensile and occur from restrained 
contraction of the prior molten clad. Cracking may result, but can usually be overcome 
by preheating or pretensioning the substrate before coating (Steen [1987]). 

8.4. Oxidation and hot-salt corrosion resistance of diffusion and overlay coatings 

The corrosion performance of a range of diffusion and overlay coatings has been 
reviewed by Malik etal. [1982] and Nicholls etal. [1984], Over the temperature range 
700-950°C the platinum-modified aluminides performed exceptionally well. The other 
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diffusion coatings, chromised and chrome-aluminised coatings, generally fared worse 
than the conventional aluminide, although chromium-rich coatings have been shown to 
offer improved corrosion resistance in an industrial turbine environment (Malik et al. 
[1982]; Luthra and LeBlanc [1987]). 

Overlay coatings generally perform better at higher temperatures where oxidation is 
the dominant failure mode (above 900°C) reflecting the good adherence of the thin 
alumina scales which is promoted by the presence of active elements such as yttrium. 
However, at lower temperatures, 650-850°C, corrosion rates are relatively high. Methods 
have been investigated to improve the traditional M-Cr-Al-Y coatings by use of a 
platinum underlayer and overlayers (Prater etal. [1981]), other additions such as Ti, Zr, 
Hf, Si and Ta, (Duret et al. [1986]; Saunders and Nicholls [1984]; Bornstein and 
Smeggil [1982]). Surface modification by CVD (Restall and Hayman [1984]), PVD 
(Goebel etal. [1981]) or slurry cementation techniques, (Goebel etal. [1981]) and re- 
processing of the coating surface using laser beams (Duret et al. [1982]; Bornstein and 
Smeggil [1982]) have also been considered with varying degrees of success. 

Surface modification results in the formation of a duplex coating structure and this 
can result in improved performance, for example, a pulse-aluminised Co-Ni-Cr-Al-Y 
coating exhibits superior corrosion resistance at 750 and 850°C compared to its plasma- 
sprayed counterpart (Nicholls et al. [1984]). Similarly, silicon modifications to the 
surface of Co-Cr-Al-Y coatings (Goebel et al. 1981) have also been shown to improve 
resistance to low-temperature hot corrosion. 

In addition to corrosion resistance, surface coatings can have a major influence on the 
erosion/corrosion resistance of hot-end components. As mentioned earlier, environmental 
resistance depends on the formation of thin protective oxide scales and therefore the 
interaction between an erodent and the oxide scale that is formed on a component is of 
particular importance in determining the erosion/corrosion behaviour of a particular 
coating. The effect of particle impingement on degradation rate depends on whether the 
impact event produces failure within the protective oxide scale. The higher the impact 
energy (larger particles, higher velocities) the more likely it is that scale failure will 
occur and this will result in an increase in corrosion rate (Hancock et al. [1987]; 
Stephenson [1989]) and reduce incubation times to the onset of catastrophic corrosion. 

8.5. Thermal stability of diffusion and overlay coatings 

Structural stability of coatings is also an important factor if they are to maintain their 
protective qualities over extended periods of time at high temperatures. Coatings degrade 
not only by loss of scale-forming elements to the surface, but also by interdiffusion with 
the substrate. This can result in additional problems such as the formation of topologic- 
ally close-packed (TCP) phases, e.g., sigma, below the coating, which cause embrit- 
tlement of the substrate. 

In aluminide diffusion coatings, degradation results in the breakdown of the outer 
/3-NiAl phase by the formation of a y'-Ni 3 Al network which after extended exposure time 
can completely penetrate through the /8-phase. These y' leaders behave as short-circuit 
corrosion paths which results in rapid failure of the coating. The thermal stability of diffu- 
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sion coatings, including pack aluminide, chromium aluminide and platinum aluminide var- 
iants were studied in the early eighties (Cooper and Strang [1982]; Lang and Bullock 
[1982]). These studies have shown that the thermal stability of /3-NiAl can be improved 
by the addition of platinum. The two-layered PtAl 2 (3 structure degrades to single-phased (5 
with no ■/ leaders even after extended exposure time, e.g., 18,000 h at 850°C (Hancock and 
Nicholls [1984]). This is a major factor which contributes to the improved corrosion 
resistance of platinum-aluminide coatings compared to conventional aluminides. 

Few papers have been published on the metallurgical stability of overlay coatings, 
probably because diffusion is not a prime factor in the formation of these coatings. 
Nevertheless, substrate interaction effects are still observed e.g., hafnium from hafnium 
modified MM200 (Goward [1983]) has been found to improve the hot corrosion 
resistance of Co-Cr-Al-Y coatings. Movement of other elements, e.g., tungsten and 
molybdenum, from the substrate, may well be detrimental to corrosion resistance. The 
beneficial effect of a platinum underlayer under marine conditions (Clarke [1979]) may 
well be due to a diffusion barrier effect. 

One of the most comprehensive studies on the stability of overlays is that of Mazars 
et al. [1986], which have examined in detail the diffusion degradation of Ni-Cr-Al-Y, 
Co-Cr-Al-Y, and Fe-Cr-Al-Y systems. From their work they conclude that diffusion 
stability can be considered a life limiting factor if coatings are operated at temperatures 
> 1000°C for prolonged periods. However, limited interdiffusion was observed in the 
Ni-Cr-Al-Y/Ni based, Ni-Cr-Al-Y/Fe based, and Co-Cr-Al-Y/Fe based systems and 
hence these systems are of particular interest for high temperature applications. 

8.6. Mechanical properties of diffusion and overlay coatings 

Probably the most important property of a coating for service at elevated temperature 
is its resistance to cracking by thermally induced stresses. A summary of the ductile- 
brittle transition temperatures (DBTT) for a range of diffusion and overlay coatings is 
shown in fig. 40, in which it can been seen that the DBTT of the aluminides is higher 
than that observed for many of the overlay coatings. Since the peak tensile surface strains 
on the turbine blades are likely to occur at relatively low temperatures, (Glenny [1974]) 
then from a ductility point of view these diffusion coatings will be inadequate or only 
marginally adequate for relatively high strain applications. 

Unlike the aluminides, the ductile-to-brittle transition of overlay coatings, and hence 
their resistance to cracking, can be modified by varying the coating composition. It is 
evident from fig. 40 that, for a given Cr and Al level, the ductility of Ni-Cr-Al-Y 
coatings is significantly better than that of Co-Cr-Al-Y coatings. The ductile to brittle 
transition temperature is found to increase with Cr and Al level. This is not unexpected 
since coatings with a high volume fraction of Co-Al or Ni-Al shows ductile to brittle 
transition temperatures close to those for the diffusion aluminides. 

This transition from ductile behaviour to brittle behaviour, as the temperature drops, 
is of significance when considering the thermo-mechanical cycles seen in service. Should 
coatings crack on cooling, there is a danger of the crack propagating to the coating/ 
substrate interface. It may arrest there, may propagate along the interface or may run into 
the substrate component, thereby decreasing the component’s life. In an investigation of 
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Ductile/ brittle transition curves for diffusion coatings. 



Ductile/ brittle transition curves for overlay coatings. 


Fig. 40. Ductile/brittle transition curves for a) diffusion coatings and b) overlay coatings. 


a plasma-sprayed Ni-Co-Cr-Al-Y coating on a single crystal super alloy at 650°C, a 
marked reduction in the fatigue life of the coated component at low strain ranges was 
observed (Gayda et al. [1987]). The mechanical behaviour of coated components is 
therefore of major importance but is beyond the scope of this chapter. Readers are 
recommended to consult the papers by Wells and Sullivan [1968]; Strang and Lang 
[1982]; Gayda etal. [1987]; and Au et al. 1990 for further appreciation of the subject. 

8.7. Future trends in overlay coating design 

Although coatings are available that offer adequate oxidation/corrosion resistance for 
many applications, improvements in coating performance can still be made. The drive to 
operate plant hotter, to increase efficiency, coupled with the desire for increased component 
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lives, requires that new generations of coatings be developed. As operating temperatures are 
increased, it no longer becomes possible to achieve the desired service lives using diffusion 
coatings and overlay coatings therefore are the only possible development route. 

For metallic systems, much work is underway to develop overlay coatings containing 
multiple active elements. This acknowledges that although many of the active elements 
(Y, Hf, Ta, Si etc.) produce similar improvements in oxidation behaviour, they function 
by different mechanisms and hence show maximum benefit at different operating 
temperatures. Since high-temperature components encounter a range of temperatures, 
clear benefits are possible from the inclusion of multiple active elements in a coating. 

But even within the M-Cr-Al-X systems, diffusion of elements between the substrate 
and coating can have a major influence on coating performance. Therefore, to provide 
long-term stability it is necessary to develop diffusion barrier coatings to minimise the 
interdiffusion between the coating and the substrate. Some interdiffusion is of course 
necessary to give good adhesion, hence the diffusion barriers must be tailored to limit the 
movement of particular problematic elements. 

Figure 41 illustrates the deposition of an intermetallic diffusion barrier at the interface 
of a high Cr-containing overlay coating and a superalloy substrate (MarM002). The dif- 
fusion barrier was developed to limit chromium movement from the coating into the sub- 
strate (Nicholls etal. [1988]) and hence reduce the likelihood of forming sigma phase 
within the alloy below the coating. Chromium diffusion rates were reduced by one to two 
orders of magnitude over the temperature range 750-1 150°C. Diffusion barrier concept 
based on amorphous silica have also been examined to limit coating oxidation by 
restricting grain-boundary diffusion (Bennett etal. [1991]; Morssinkof etal. [1989]). 

Once the concept of a diffusion barrier is accepted as a method of providing good 
interface stability, one is no longer constrained in the design of the best overlay coating. 
No longer is substrate compatibility a requirement in specifying the overlay coating 
composition. By removing this constraint it is possible to design overlay coatings with 
optimised oxidation or corrosion resistance (Nicholls etal. [1989]; Nicholls etal. 



Fig. 41. Graded M-Cr-Al-Y overlay coating and diffusion barrier (located at the coating/substrate interface) 
system on Mar M002, deposited by EB-PVD. 
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[1993]). These studies have shown that optimum performance is achieved using different 
base alloy compositions. 

Matching the diffusion barrier coating compositions to those of the optimised 
corrosion resistant alloys leads to the concept of functionally graded (or multilayered) 
coatings. The graded coating concept is not new; it was first proposed by Boone in 1979 
(Boone etal. [1979]). However, processing technology was not capable of varying the 
composition of more than one element through the coating at that time. It is now possible 
to vary the composition of all three base elements, Ni, Cr and Al, and also add up to 
three active elements as necessary. Figure 41 illustrates this new generation of EB-PVD 
overlay coating. The coating incorporates a diffusion barrier layer, an interfacial zone 
low in chromium and a zone graded outward in Cr, with a high Cr, high Al corrosion 
resistant surface layer (Nicholls et al. [1993]). For low-temperature hot-corrosion 
resistance the surface composition should ideally be Ni-33Cr-33Al with the presence of 
active elements, possibly in the form of additions of yttrium, silicon and hafnium in 
combination. 

Plasma-sprayed thermal barrier coatings (e.g., fig. 42) have been used since the early 
1950s and have been effectively used to protect piston crowns and valve faces in diesel 
engines and flare head and primary zone sections of combustors against the effects of hot 
spots within turbines, giving considerably improved component lives. Because of this 
success much attention has been directed towards the use of thermal barriers on gas 
turbine blades and vanes (Duvall and Ruckel [1982]); but only within the last decade 
have they been used on highly stressed turbine components within commercial gas 
turbine engines (Fairbanks and Hecht [1987]; Rhys-Jones [1990]). Strain tolerant 
TBC are now being developed which have a columnar grain structure that are produced 
by EB-PVD processing, and fig. 43 is an example of an experimental coating now being 
produced at Cranfield. 

Extending these concepts further, one can envisage the “ideal” coating for hot gas 
paths. It would consist of a strain-tolerant thermal barrier coating, deposited onto a bond 
coat which exhibited good corrosion resistance but, more importantly, matched thermal 



Fig. 42. Air plasma-sprayed thermal barrier coating, A TBC, B bond coat and C substrate. 
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Fig. 43. Secondary electron micrograph of a EB-PVD thermal barrier coating, deposited at Cranfield. 


expansion coefficients. Within this bond coat, composition can be graded to provide the 
required corrosion resistance, coating mechanical properties and a diffusion barrier at the 
interface with the substrate. This conceptual coating is possibly only a few years away, 
for research is underway into many of the fundamentals necessary to produce is 
functionally graded high-temperature protection system. The studies aimed at developing 
strain tolerant ceramic layer have been discussed above. Work was carried out in 
Germany (Fritscher [1987]) to develop bond coats with matched thermal expansion 
coefficients and the possibilities for graded corrosion-resistant coatings and diffusion 
barrier coatings have already been mentioned. 
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1. General considerations 

1.1. Introduction 

Solid-state phase transformations are a central topic in physical metallurgy, since 
almost all industrial alloys are heat-treated after casting to improve their properties. The 
heat-treatment changes the microstructure of the alloy, either by deformation followed by 
a recovery and recrystallization process (ch. 25), or by some type of phase transfor- 
mation. Two main types of phase transformation are found: polymorphous changes and 
precipitation reactions. In a polymorphous change, in for example iron, cobalt or 
titanium, there is a change of the metal’s crystal structure. This affects all the atoms in 
the alloy and has a tremendous scope for changing the microstructure of the alloy. In a 
precipitation reaction, which is crucial in alloys based on aluminium, copper and nickel, 
which have the same crystal structure up to the melting temperature, the main method of 
modifying the microstructure is to alloy with elements that are soluble in the base metal 
at high temperature but which come out of solution at lower temperatures. The particles 
of the new phase are called precipitates. The name comes from the same reaction in 
liquid solutions where the resulting solid crystals, being heavier than the mother liquid, 
do fall out of solution. No such movement under gravity is expected in “precipitation” 
from solid solutions. In a precipitation reaction there is a change from a single phase, a, 
structure to a two phase, a +/ 8, structure. The growth of the {3 precipitate requires both 
a change of composition of the matrix and, in cases where the two phases have a 
different structure, an interface reaction in which all the atoms that are to form the (3 
phase undergo this structural change. In a minority of precipitation reactions such as 
those that precipitate Guinier-Preston (GP) zones, the two phases have the same structure 
and orientation. In such cases, no separate interface reaction is required to grow the 
precipitate, merely a change in composition. 

In all polymorphous and most precipitation reactions, there is a migration of an 
interface between two crystalline phases, and there are two possible modes of interface 
migration. In the first of these modes, atoms make thermally activated random jumps 
across the interface, a “diffusive” mechanism. In the second mode, the daughter crystal 
grows into the parent, by a coordinated shear-type (martensitic) motion of all the atoms 
at the interface. The description “military” suggested by Christian [1965] for this 
second type of transformation is rather appropriate and leads to his equivalent label of 
“civilian” for the uncoordinated interface motion of atoms in the first type of transfor- 
mation. In recent literature, for example Bhadeshia and Christian [1990], the alter- 
native terms of “reconstructive” and “displacive” are used for the two types of method 
for transforming one crystal structure to the new one. The present author prefers the label 
of civilian or reconstructive to diffusive since any reaction such as the formation of 
bainite in steel (a mixture of low carbon, bcc, a, iron plus iron carbide) from single 
phase, solid solution of carbon in fee, y, austenite requires both a change of composition, 
that takes place by a true diffusive motion of individual carbon atom jumps and the y to 
a interface reaction. The interface process could be either military or civilian. If it were 
military then bainite would have to be described as being diffusive and non-diffusive! 
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The civilian or reconstructive types of transformation are the subject of this chapter while 
the military types are the subject of ch. 15. Bainitic transformations, which appear to 
have both a diffusive character for change of carbon concentration and a martensitic 
character for the change of matrix structure, are discussed in § 2.6. The precipitation of 
close-packed hexagonal, silver-rich, Ag 2 Al from an Al-Ag solid solution, which is now 
universally agreed to take place by a combined diffusive long-range composition change 
and a displacive interface reactions is discussed in detail in § 2.2.3 and 2.2.4. 

The same distinction between a military/displacive and a civilian/reconstructive 
transformation is seen for the movement of a grain boundary in a single phase structure. 
Here the military transformation is that of mechanical twinning and the civilian one is 
the usual mechanism of thermally activated grain boundary migration seen in recrystal- 
lization and grain growth. 

1.2. Driving forces — free energy changes 

All structural transformations are driven, at constant temperature and pressure, by a 
reduction in Gibbs free energy, G, from the original to the final structure. The solution 
thermodynamics that are the intellectual base of the analysis of phase transformations are 
described in detail in ch. 5. The definition of G is: 

G = H - TS, (1) 

where T is the absolute temperature, S is the entropy and H, the enthalpy, is given by: 

H = U + PV, (2) 

U is the internal energy, P is the pressure, V the volume of the material. In transfor- 
mations between condensed phases, the second of these terms is negligable with respect 
to the first term. Under these circumstances the difference between G, the Gibbs, and F, 
the Helmholtz free-energies becomes insignificant at least as regards work done against 
external pressure and the symbol F will be used throughout this chapter for the function 
that is minimized during a structural transformation: 

F - U - TS, (la) 

For a polymorphous transformation from a parent a-phase to a daughter /3 -phase, the 
driving “force”, in Joules per unit volume, which has the equivalent dimensions as a 
pressure, in Newtons per unit area, is given in terms of the changes in internal energy, 
A U a p\ and entropy, A S ap \ by: 

= At/;, - T ASl p (3) 

At the equilibrium transformation temperature, T e , A F a p is of course zero. So if A U a p 
and A S a p are effectively constant, at a finite undercooling, AT=(T C - T), the driving 
force is given by: 

= AU:,(Ar/r c ) - Aff^ATA) (4) 

For small undercoolings the constancy of A U a £ and A S a ^ are reasonable assumptions 


References: p. 1497. 
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but for larger undercoolings, a correction should be made to eq. (4) whenever the 
specific heats of the two phases are different. This is readily carried out using standard 
thermodynamic methods, see for example Swalin [1972]. Table 1 gives some typical 
values of the latent-heat changes, A H ap y , from which it can be seen that the driving force 
for solid-state polymorphous phase changes is much smaller than for solidification at 
similar undercoolings. 

For a precipitation reaction in a binary alloy AB, such as the one described by the 
free energy/composition curves in fig. 1, the free-energy changes can be readily 
determined. The overall free-energy change, per mole of the alloy , is I — > J, where J lies 
on the common tangent to the two free-energy curves. However, the free energy change, 
per mole of precipitate, is K L where K lies on the tangent that touches the a-phase at 
I, the alloy composition. The intercepts M and N of the tangents at the atomic fraction 
of B, C B , of C B = 1, are the partial molar free energies, F n , also called the chemical poten- 
tials, /x B , of component B in the a-phase at atomic fractions C 0 and C a , respectively. 
From standard solution thermodynamics, we have; 

F B {C 0 ) = F°+RT\na B (C 0 ), (5a) 

F b (cJ = F b ° + *7 In a B (C a ), (5b) 

where F B ° is the free energy of pure B in its standard state, R is the gas constant and a B 
is the activity of B in the a-phase at the two compositions C 0 and C a . M-» N is then 
given by: 

MN = F B (C a ) - F b (C 0 ) = RTln^f\. (6a) 

If a is a dilute solution with C 0 and C a < 1, as in the phase diagram shown in fig. 2 and 
in most commercial alloys, then the activity coefficient, / B , of B in the a-phase is 
constant (Henry’s Law) so eq. (6a) becomes: 

MN = RT ln(C a /C 0 ). (6b) 



Fig. 1. Free-energy-composition curve of two phases a and /3 showing conditions for nucleation of /3 from 
supematurated a of composition C 0 . 
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Table 1 

Latent heat changes. 


Element 

Transformation 

Latent heat change 
(kJ/g-mol) 

U K) 

Fe 

liquid to solid 

-15.5 

1809 

Ti 

liquid to solid 

-18.9 

2133 

Fe 

austenite to ferrite 

-0.9 

1183 

Ti 

bcc to hep 

-3.5 

1155 


since 

Ob(Co) 

If, again as in fig. 2, /3 is a dilute terminal solid solution of A in B, then KL in fig. 1 is 
effectively MN and the driving force per mole of precipitate, A F a p m , or per unit volume 
of precipitate, A F a/J v , are given by: 

AF” = RT\n(CjC 0 ), (6c) 

AF^ = {/?r/^}ln(C a /C 0 ). (6d) 

Since C a <C 0 in a supersaturated solution, A F afi m and A F a0 v are both negative. The change 
from free energy per mole to free energy per unit volume is achieved through the use of V" 
the molar volume of the /3-phase. Very often the precipitating /3-phase is an intermetallic 
compound so that Cp is significantly < 1 so that a correction must then be made, giving: 

^K;=Y^~^ln{CjC 0 ). (6e) 

For the precipitation of an intermetallic phase such as MnAl 6 from an aluminium-rich 
solid solution, the effect of the modification to eq. (6e) is to decrease the driving force 
very significantly, by 1/7 for MnAl 6 . An extreme version of this effect is found for the 
growth of ferrite, a, from low-carbon austenite, y, in iron-carbon alloys. Since the 
precipitate is solute-depleted rather than solute-enriched, the form of the equation derived 
as here is slightly changed. We now have: 



Fig. 2. Simple phase diagram derived from fig. 1. 
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A F m 

ya 



RT In 


1 

1 -C 0 


(7) 


where the compositions are given in atomic fractions of the solute, which in this case is 
carbon. 

The low driving force for this reaction even when the alloy is cooled well into the 
two-phase, a plus y, region for example when the volume fraction of a can be 0.5 or 
more, makes this reaction very different from most precipitation reactions. The difference 
arises since the driving force for nucleation of ferrite, AF„p, is very small while the 
volume fraction of new phase, {(C y -C 0 )/(C y -C a )} which is also the supersaturation, ft, 
that drives the diffusional growth processes (§ 2.2.2) can be very large. That is, for the 
formation of ferrite from austenite and in other equivalent matrix phase changes, the 
nucleation of the new phase has a much lower driving pressure while growth is driven 
much faster than in most precipitation reactions. This idea is considered in more detail 
in §2.2. 6.2*. 

Finally it should be noted that if the matrix phase is not a dilute terminal solid 
solution the activity coefficient, / B , will not be constant. The correction for this effect 
derived by Purdy [1971] then needs to be included. This gives: 


A FI 


a p 


Cp - C a rt 

1 — e a 


ln(C a /C 0 ). 


( 8 ) 


Here e a is the non-ideality factor, e a = 1 +dln/ B /dlnC a , derived originally to Darken 
[1948] for diffusion studies. This term can be very significant, e a * 1 if the matrix phase 
from which the precipitate is forming is itself an intermediate phase such as /? brass in 
Cu-Zn or Cu-Al or an ordered intermetallic phase such as a 2 in the Ti-Al system. 


1.3. Stable and unstable free-energy curves 

The free-energy/composition curves shown in fig. 1 are both those of stable or at 
least metastable phases, since any alloy on a free energy composition curve where the 
curvature, d 2 F/dC 2 , is positive cannot spontaneously reduce its free energy, except by 
nucleating a distinctly different new phase. In a phase diagram such as that sketched in 
fig. 3, where an a-phase, below a critical temperature T c , decomposes into solute-rich, 
a", and solute-poor, a', phases, the free-energy/composition curve is unstable below T c , 


* Just above the eutectoid temperature (-1000 K) in Fe-C, where the carbon content of austenite is 0.8 wt%C 
(C ? =0.0017), C„ is almost zero. So, while the supersaturation for growth, for an alloy with 0.4 wt% C 
(C 0 = 0.0008), n is 0.5, the driving force for nucleation, A/^”, is only -4 J/g-mole. For precipitation of a 
B-rich phase, such as O’, CuA 1 2 , from an aluminium solid solution at say 190°C with an equilibrium solubility 
C a of 0.0002 and for an alloy composition C 0 of 0.02, then while Cl is only 0.06, the driving force for the 
nucleation of the copper rich precipitate, A F a p°, is 1500 times larger at -6,350 J/g-mole. As a result of these 
differences the polymorphous reaction in iron gives coarse grain-boundary nucleated precipitates of ferrite while 
the Al-Cu reaction gives a very different microstructure — that of very small copper-rich precipitates finely 
dispersed in the aluminium grains, §2.1, giving precipitation hardening. 
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Fig. 3. More complex phase diagram than in fig. 2, showing the development of solid-state immiscibility in the 
a-phase below a critical temperature T c . 

figs. 3 and 4. This instability arises in the part of the curve where the curvature, 
d 2 F/dC 2 , is negative. Within this “spinodaP’ region an alloy of composition i can split 
into j and k and so reduce its free energy (Cahn [1968]). This can occur not only if a' 
and a" are the stable phases but also when as in fig. 4, below T 2 , there is an even more 
stable /3-phase. The formation of the /3-phase requires the system to overcome a 
nucleation barrier. In this case, the solute enhanced region a", although metastable with 
respect to /3, will still form when the rate of its formation is faster than the kinetics of 
the nucleation and growth of the more stable /3-phase. Such regions of solute-enriched 
a", sharing the same crystal lattice as a', are known as Guinier-Preston, GP, zones. 

1.4. Gibbs’s two types of transformation 

From figs. 3 and 4, it is possible to see the distinction between the two types of 
transformation first described by the father of this subject, J. Willard Gibbs. In the first, 
type I, of these reactions, usually described as a nucleation and growth reaction, a small 
region of a new phase, such as /3, forms from within the matrix. The new region, the 
nucleus, has a completely different composition and/or structure from the parent phase, 
a. The nucleus is separated from the parent matrix by an interface which has an 



T < T c 


Fig. 4. Free-energy-composition curve for the phase diagram of fig. 3 at a temperature below the eutectoid 
temperature T 2 . The curvature, d 2 F/dC 2 , is negative in the part between the dashed lines, positive elsewhere. 
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interfacial energy, a, whose units are J/m 2 . This change is one that can be described as 
being large in the magnitude of the local change in structure and/or composition but one 
that is spatially localized to the small part of the matrix which has transformed. The 
majority of the matrix remains completely untransformed at least initially, though 
subsequent growth of the nucleus into the matrix occurs. Other names for such “nucleat- 
ion and growth” transformations are “heterogeneous” or “discontinuous” transformations. 
“Heterogeneous” nucleation and “discontinuous” precipitation, however, have different 
and quite specific meanings within the area of nucleation-and-growth reactions. As a 
result, it is better not to use these names for the whole class of Gibbs Type I transform- 
ations. Details of heterogeneous nucleation and discontinuous precipitation processes in 
various types of nucleation-and-growth reactions are discussed in § 2. 

The second mode of transformation is shown by a composition fluctuation in an alloy 
within an unstable region of its free-energy curve. The result of such a fluctuation is that 
a region of initially uniform composition develops a composition wave whose amplitude 
grows with time, fig. 5, but with, at least initially, a constant wavelength. This second 
type of reaction, a Gibbs Type II transformation, where the initial change in composition 
in any local region is small but where the composition fluctuation is delocalized in space, 
is usually called spinodal decomposition, at least when the wavelength is significantly 
longer than diameter of an atom. When the wavelength is that of an atom, leading 
towards an ordered solid solution the transformation is then usually called continuous 
ordering. Spinodal decomposition is known to occur in many precipitation reactions, for 
example in concentrated Al-Zn alloys (Rundman and Hilliard [1967]). The term 
continuous transformations is now a common name for the Gibbs type II transformation 
in both precipitation and in ordering and this terminology is used here in § 3.* 



Fig. 5. Variation in solute distribution with time during a continuous reaction, Gibbs type-2: spinodal 
decomposition. 


* There is again a possible confusion here since the term continuous precipitation is sometimes used for 
precipitation by a nucleation and growth reaction that does not require a migrating high angle grain boundary. 
Discontinuous precipitation is the nearly universal name for precipitation that does occur behind a migrating 
high angle grain boundary. This name discontinuous precipitation is so commonly used that it will he used 
here, §2.5, even though an alternative name such as a moving two-phase boundary (MTPB) might be less 
confusing. The context of the discussion will usually avoid any difficulty. However, in this chapter, we will not 
use the term “ continuous precipitation ” for a nucleation and growth reaction that takes place without the 
assistance of a moving grain boundary. 
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1.5. First order and higher order transformations 

In most structural transformations in physical metallurgy there is a discontinuous 
change in the slope of the free energy-temperature curve as the structure changes at the 
equilibrium temperature (fig. 6). This type of transformation is described as first order, 
since there is a discontinuity in the first derivative, dF/dT, and thus a discontinuous 
change in the internal energy, giving a finite latent heat, Ehrenfest [1933]. In second 
order transformations, there is a discontinuity in the second differential, d^F/df 1 , and 
therefore in the first differential of internal energy, which is the specific heat. Christian 
[1979] discussed the thermodynamic and mechanistic classifications of structural changes 
of interest to metallurgists and noted that some apparent second order transformations 
appear to show an apparent infinity in the specific heat rather than just a discontinuity. Unlike 
the first order transformations, these apparent second order transformations show an increase 
of specific heat (i.e., a latent heat) as the temperature approaches the critical temperature. 

1.6. Short-range and long-range diffusion 

In polymorphous changes, such as solidification and solid state structural transfor- 
mations in pure (single component) materials, and also during grain-boundary migration 
in recrystallization and grain growth, the only atomic process is the transfer of atoms 
across the interface: This is the interfacial process. For nucleation-and-growth reactions 
where the parent and product phases have different compositions, there are then two 
successive processes; firstly, long-range transport by diffusion over distances of many 
atomic spacings, commonly described as the diffusional process and secondly, atomic 
transport across the interface, normally a thermally activated short-range diffusional 
process which is again the interfacial process. The long-range diffusion only involves a 
fraction of the total number of atoms in the new phase, those required to change the 
composition of the matrix to that of the new phase. The fraction may be nearly 100%, 
for example in the precipitation of a B-rich phase from a supersaturated A-rich solid 
solution (fig. 2). In other cases, like the precipitation of bcc ferrite from dilute fee 
austenite in low-carbon steels where there is only a small change of composition, the 
fraction of atoms taking part in the diffusional process is very much less than one. By 



Fig. 6. Variation of free energy with temperature, showing a first-order reaction at an equilibrium transition 
temperature T e . 
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contrast all the atoms that are to be part of the precipitate must take part in the inter- 
facial reaction. 

Since the two reactions, long-range diffusion and the short-range interfacial step, are 
successive reactions, the slowest of the two processes will be rate-controlling (§ 2.3). This 
may be contrasted with the situation where there are two alternative reactions, for 
example the formation of either (i) the stable phase with the lowest free energy or (ii) a 
metastable phase with a higher free energy. Where there are two or more alternative 
reactions, then the fastest total reaction will determine which reaction occurs and this 
will, in consequence, determine the rate of the transformation and the resulting micro- 
structure. This situation occurs, for example, in precipitation-hardening reactions at low 
temperatures and high driving forces, where metastable precipitates with low barriers to 
nucleation form in preference to the stable precipitate when that is more difficult to 
nucleate. The initial metastable reaction removes much of the driving force for the 
formation of the stable phase, so that the metastable product may then remain for the 
useful lifetime of the sample. 

1.7. Techniques for studying phase transformations 

Since phase transformations produce a change of microstructure, almost any structural 
characterisation technique can be used to study a transformation. Edington [1979] 
reviewed this topic and detailed references to the tremendous range of published work 
can be found there and in the numerous conference reports on the various structural 
characterisation techniques. 

The techniques that have been used include optical, scanning and transmission 
electron, and field-ion microscopy. The increased resolution of the last two techniques 
involves a severe penalty in that the regions studied are highly localized and so the 
structures observed in great detail can be potentially unrepresentative. Diffraction 
techniques are also of vital importance, both those involving general diffraction and also 
those using selected-area diffraction in both transmission microscopy and at even higher 
spatial resolution, scanning transmission electron microscopy (STEM). At lower spatial 
resolution, X-ray Kossel methods, electron channelling patterns and more recently Back 
Scattered Kikuchi Diffraction (BKD) — also called Electron Back Scattered Patterns 
(EBSP) — have been of great value in investigations involving bulk samples. Of major 
importance, particularly for quantitative investigations, are chemical analytical studies of 
highly localized regions. Conventional electron-probe microanalysis allows chemical 
analysis down to scale of 1-2 fim, analytical electron microscopy by either X-ray 
emission or by electron energy loss has allowed routine analysis down to a resolution of 
10 nm and below. Ultimate or near-ultimate resolution has been achieved by the atom- 
probe field-ion microscopy technique, the determination of groups of individual atoms by 
time-of-flight measurements of field evaporated atomic layers (Haasen and Wagner 
[1986], Miller [1987]. Other successful analytical techniques include high-resolution 
Auger electron spectroscopy and ion probe mass spectrometry using focussed ion beam 
erosion. For a detailed comparison of the techniques, a conference on microanalysis at 
high spatial resolution provides a detailed source of information, Lorimer [1981]. The 
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older techniques of measuring average solute levels throughout the microstructure should 
not be neglected since these give additional kinetic information to that provided by the 
more detailed local measurements of composition. Any physical property that is modified 
by the solute level in the matrix can be used for general chemical analysis. For magnetic 
materials the ferromagnetic Curie temperature is often used, at least for studying slow 
reactions or ones occuring at temperatures well above the Curie temperature. Other 
methods include studies of lattice parameter by X-ray, electron or neutron diffraction, 
semi-quantitative measurements of minority phase amounts by Guinier X-ray diffraction 
in which the white X-ray background has been eliminated, thermoelectric power and 
some electrochemical methods. The most commonly used however is that of electrical 
conductivity since, for metals, the presence of solute has the largest effect on electrical 
resistivity, compared to almost all other structural defects. For ionic crystals, increased 
solute such as divalent magnesium ions in monovalent lithium fluoride increases the 
vacancy concentration, so leading to increased ionic conductivity at least in a temperature 
range where extrinsic conductivity is found. This technique allows very accurate 
chemical analysis of the solid solutions in ionic compounds. 

Many of these modem high-resolution techniques have only recently become 
available and are usually expensive in equipment costs, so much less use has been made 
of them than of the very widely used transmission electron microscopy which revolution- 
ized this subject after its introduction in the late 1950s. However, as Edington [1979] 
has shown, these latter techniques, introduced in the 1970s, have made extensive 
contributions to the subject and will continue to do so. Further use of these techniques 
may modify the present situation, which is that in many areas, the theories of phase 
transformations are well ahead of rigorous tests of those theories, at least, as far as 
quantitative experimental studies of the kinetrics of transformation are concerned. 

It is to be noted that there are two distinct modes of investigation of microstructural 
change which are used. In most cases the microstructure is investigated in samples where 
the reaction has been halted by rapid quenching to room temperature. The study of these 
static microstructures allows detailed insight of the microstructure at the highest 
resolution. However, the dynamic development of the reaction with time, in a particular 
region, cannot be followed by the static technique at least in one sample. A series of 
different samples quenched after different times must be used and averaged structural 
information obtained. The alternative technique, of dynamic or in-situ observation of 
structural change in a hot-stage microscope or equivalent equipment allows continuous 
investigation of the transformations in a selected region. This advantage is however, 
obtained at a severe cost of poorer resolution and with severe time limitations. The 
structure often changes before it can be fully characterized. An additional problem for the 
in-situ studies is the worry about modification of the transformation by the presence of 
near-by surfaces and by the effects of the illumination, usually a beam of high energy 
electrons. The use of higher voltage electron microscopy can reduce the first of these 
difficulties, by allowing thicker foils to be used, due to the greater penetration of the 
higher energy electrons. However, point defects especially vacancies can be induced by 
the higher-voltage beam and these vacancies if present at higher concentrations than the 
vacancies in thermal equilibrium can then modify the diffusional kinetics, see for 
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example (e.g., Weaver etal. [1978] and Doherty and Purdy [1983]). In most cases, 
however, a careful use of a combination of techniques allows the potential errors to be 
recognized and overcome and should enable valid conclusions to be drawn. In general, 
the static methods provide more accurate quantitative measurements while the dynamic 
or in-situ methods are better suited for giving qualitative insight into the operating 
mechanisms. 

Other techniques that can give useful dynamic insights into structural changes include 
calorimetry, if heat is being released or absorbed at a detectable rate, dilatometry where 
there are significant volume changes and for most reactions neutron diffraction is 
attractive since large samples can be studied both statically and, most conveniently, 
dynamically at temperature due to the highly penetrating nature of neutrons, (see ch. 12). 

2. Nucleation-and-growth transformations 

2.1. Theory of nucleation 

The basis of the theory of nucleation is that when a new phase forms within a parent 
phase, an interface is formed between the two phases. The formation of the interface 
creates a local increase of free energy when the first atoms assemble in the new 
structure. The theory of this process, which describes the interfacial energy barrier to 
nucleation, was originally developed for physical chemistry by Volmer and Weber 
[1928], Volmer and Flood [1934] and Becker and Doring [1935]. The model was 
applied by Turnbull and Fisher [1949] to solidification of metals and subsequently 
solid-to-solid metallic phase transformations, see Turnbull [1956], Following these 
studies the concept has become fundamental to the qualitative and quantitative under- 
standing of structural changes in physical metallurgy. The theory appears to be, qualitat- 
ively at least, highly successful both for solidification (ch. 9, § 3) and in successfully 
describing the nucleation step in most solid-state nucleation-and-growth reactions. The 
subject of solid-state nucleation has been reviewed on many occasions, for example by 
Kelly and Nicholson [1963], Russell [1970], Nicholson [1970], Christian [1975], 
Russell [1980] and Haasen et al. [1984], with similar conclusions to those advanced 
here. For almost all diffusional solid state reactions the theory is in good agreement, at 
least qualitatively and semi-qualitatively, with a very large amount of experimental data. 
There are, however, rather few fully quantitative experimental tests of the theory; these 
do, however, support the theory at least of homogeneous nucleation to a surprisingly high 
degree. The one exception where the theory has been found not to apply, even qualitatively, 
is for the nucleation of new grains in the recrystallization of deformed materials, see for 
example Doherty [1978], In this case, the nuclei of low dislocation density regions of 
the correct structure pre-exist in the deformed state and can develop by processes that do 
not require a local increase in free energy. The nucleation of recrystallization is discussed 
in ch. 28, § 3.3. For nucleation in martensitic reactions the theory, at least to date, 
appears to have been somewhat less useful, as is discussed in the following chapter. 

The model, for diffusional phase transformations, assumes that a new phase, differing 
from the parent phase in structure and/or composition, is built up, atom by atom, by 
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thermally activated atom transfer across the interface. There is, as a result, a local change 
in the free energy, which is given, in the usual continuum description, as a decrease in 
the volume free energy, due to transfer of atoms from a less stable to a more stable 
phase, and an increase in the interfacial free energy due to the increase of the area of the 
interface between the two phases. In the conventional model of nucleation, the macro- 
scopic values of these parameters are used even though it is unlikely that a cluster of 
only a few atoms would have the macroscopic properties that the new phase will show 
when it has grown to a size much larger than atomic dimensions. The total change of 
free energy, A F„ when a new phase of volume, V p , and interface area, A p , forms, often 
causing an additional increase in elastic strain energy per unit volume of precipitate, 
AF E , is given as: 

Af; = V^F; p + Aptr + V^AFg , (9) 

where a is the specific energy of the a-fi interface and A F a p is the free energy change 
per unit volume of the new (3 phase, given by the macroscopic molar free energy change, 
AF^, divided by the molar volume, V^, of the new phase. The first and last terms in eq. 
(9) are both dependent on the volume of the nucleus and are treated together. Only if the 
first term is larger (more negative) than the third term, which is always positive, can the 
reaction proceed. The volume of the nucleus increases as the third power of the nucleus 
size (47rrV3 if the nucleus is a sphere of radius r), while the area term, which is always 
positive and acts as the barrier to the process, increases only as the second power of the 
nucleus size {4m 2 for a sphere). Figure 7 plots eq. (9) for a spherical nucleus, and shows 
that at small radii the nucleus shows an increase of A F, and so is unstable and tends to 
redissolve in the matrix. Only when the new phase has reached the critical radius, r*, 
and has acquired the critical free energy increase, A F*, does the nucleus become 
potentially stable, so that, after the addition of one further atom, r>r*, it can grow with 
a continuous decrease of free energy. For a spherical nucleus the values of the critical 
parameters are given by: 

t* = -2<t/(af; + AF e v ), 

. 16 7 TO 2 


AF t \ 


Fig, 7. The nucleation problem; free energy of a sphere of new phase in a supersaturated matrix, as a function 
of the radius of the sphere. 
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The required values of A F a p v for use in these equations can be obtained from A F a ^ m , for 
example from eqs. (4), (6c), (6e) (7) or (8), by dividing the value of A F a0 m , by the 
volume of a gram-mole of the new phase, V m . For most metallic phases, the volume of 
a gram-mole, containing an Avogado’s number of total atoms, of any metallic phase is 
close to 10' 5 mVg.mole. The problem is to find the appropriate value for A F E V ; this is 
discussed in outline § 2.1.2, but for mathematical simplicity for the rest of § 2.1 it will be 
assumed that the strain term is negligible, as it will be for example in solidification from 
most metallic melts, at least above the glass transition temperatures In all cases the 
correct values of r* and AF* can be recovered by simple addition of A to A F a p. 

The rate of nucleation per unit volume, I a p, can be obtained from the simple 
application of statistical mechanics to show that if the critical formation energy of a 
nucleus is AF* then, at equilibrium, the concentration of critical-sized nuclei, n* in a unit 
volume is given by: 

if = AT exp(-AF*AT), (11) 

IV* is the number of atomic sites per unit volume on which the assembly of the nucleus 
could have started, for homogeneous nucleation this is all atom sites. The rate of 
nucleation is then the product of this concentration of critical nuclei and the rate of 
atomic addition to the nuclei to make them just supercritical: 

r ap = N v A*v exp(-A FjkT) exp(-AF*Ar), (12) 

A* is the number of atomic sites on the surface of the critical nucleus to which an atom 
can join after overcoming a growth barrier A F A (§2.2.1) and v is the atomic vibration 
frequency of an atom in the matrix at the interface, usually assumed to be of the order 
of 10 13 /s. In most cases A* is assumed equal to the total number of atomic sites on the 
interface but as discussed in § 2.2.3 this may not always be valid and atoms may be able 
to add to a nucleus only at ledges on the interface. 

The simple theory above needs to be modified to take into account the expected loss 
of critical-sized nuclei by their growth, during nucleation, into supercritical regions. This 
loss reduces the value of N* to less than the value expected at equilibrium, eq. (11), by a 
factor, Z, which is usually about 0.05 (Russell [1970]). The near constancy of this 
correction arises since the rate of formation of critical-sized nuclei and their growth to 
become supercritical both occur at a similar rate determined by atomic diffusion onto the 
growing cluster of atoms. Inclusion of a factor of exp {-r/t) to allow the achievement of 
the expected steady state rate of nucleation, F s , yields the result given by Russell [1970]: 


C = exp(-r//) ZA/ v A*v exp(-AFjkT) exp(-AF*AT) 

= exp(-r//) 7 S V , 

(12a) 

Z = {AF*/37rkTn* 2 )' n . 

(12b) 


There are many problems with the use of the theory to predict experimentally measured 
quantities such as the final density of the new regions of the product phase. These 
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difficulties include the usual lack of knowledge of the interfacial energy a which plays 
such a vital part in determining I a (I a f is proportional to the exponential of a raised to 
the power 3!), the difficulty in calculating the strain energy term in eq. (10b), and the 
difficulty in calculating how the driving force for the phase change, A F a f, falls as the 
reaction proceeds, so that the rate of nucleation can be integrated to give a total number 
of growing nuclei. Finally, as discussed in § 2.4.2, a significant number of nuclei will 
dissolve up, by a process of Ostwald ripening, during the precipitation reaction itself. 
These problems are discussed by Servi and Turnbull [1966], Langer and Schwartz 
[1980], Kampmann and Wagner [1984] and Haasen and Wagner [1992], However, 
despite these quantitative difficulties the theory is extremely useful in describing a large 
volume of experimental observations. A major success of this theory is its ability to 
account for the observation that nucleation increases from a rate that is almost undetec- 
table to rates that are too fast to measure over a narrow range of undercooling, AT in 
eq. 4 or supersaturation, C</C a in eq. 6d. This is readily shown by use of eq. (11) for 
a matrix phase change driven by a free-energy function of eq. (4). Under these circum- 
stances the concentration of critical nuclei is given by: 

log n* = 28 - K/AT 2 (13) 


This arises since the number of atomic sites in a cubic metre is about 10 28 and K is a 
constant given by: 


K _ 16 TTOr^X 
3 A H^kT ' 


(14) 


If the undercooling which gives a density of nuclei of 1 (10°) rrf 1 is AT(0) and that 
which gives a density 10 6 times as high is AT(6), then substitution into eqs. (13) shows 
that: 


AT(6)/AT(0) = (28/22) 1 ' 2 = 1.1. (14a) 

That is, a 10% increase in supercooling will increase the density of critical-sized nuclei, 
and therefore the rate of nucleation, by a million times. The constant K in eq. (14) can 
be easily evaluated if the undercooling that gives a measurable rate of nucleation is 
known, by substitution of a value of N* of about 10 3 irf 3 in eq. (13). This approximate 
value of K allows rather an an accurate estimate of the effective interfacial energy, a, to 
be made using eq. (14). In many cases, for example in solidification of pure metals, the 
determination of the interfacial energy by use of eq. (13) has given reasonable values of 
this energy even when the undercooling appears to have been significantly under- 
estimated (Cantor and Doherty [1979]). Since eq. 12 also contains the activation 
energy of diffusion, exp(-AF A /fc7), the initial increase of nucleation rate with fall of 
temperature ceases, giving a maximum rate. Below the maximum, the nucleation rate 
falls rapidly as the temperature falls. This variation of nucleation rate with temperature 
(together with a similar behaviour for growth) causes the so-called “C-curve” behaviour 
seen in time-temperature-transformation, TTT, diagrams, §2.5.1. 

Another major success of the theory of homogeneous nucleation is its ability to 
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understand a very large range of qualitative experience in which, in many solid state 
precipitation reactions, the phase that forms is not the equilibrium structure. It is 
frequently found that a metastable phase, characterized by good atomic fit with the 
matrix and therefore a low value of interfacial energy, c, is the one that forms. The 
theory is compatible with this phenomenon since the rate of nucleation is so sensitive to 
o-; the successful precipitate will be the one that nucleates fastest, and this is usually the 
precipitate that has the lowest interfacial energy, even though it may not yield the most 
stable phase. Similarly, nucleation theory accounts for the fact that when a new crystal- 
line phase forms in a matrix of a crystalline parent phase there is almost always a 
particular orientation-relationship between the phases, see for example Barrett [1952], 
This relationship appears to be the one that minimizes the value of the crucial parameter, 
the interfacial energy, in eq. 10b. This idea is discussed further in §2.1.2. 

Finally it should be recognized that additional qualitative support for this picture of 
nucleation is provided by an apparent failure of the model. The failure arises since in 
many cases nucleation occurs at much smaller undercoolings than expected. In these 
cases it is found that the new phase forms on some defect in the parent phase, for 
example a dislocation or a grain boundary. The second term in eq. (9) is significantly 
reduced by the fact that the new phase has consumed some of the existing energy of the 
defect, thereby reducing the barrier to nucleation. This topic of heterogeneous nucleation 
is discussed in §2.1.4. 

2.1.1. Interfacial structure and energy 

A major difference between the interfaces produced by nucleation during solidificat- 
ion from a liquid or glassy phase, and by nucleation in solid-state reactions, is in the pos- 
sibility of atomic matching across an interface between two crystals. For crystal-crystal 
interfaces the atomic matching ranges from perfect, in fully coherent interfaces, through 
less perfect matching in semicoherent interfaces, to random matching in fully incoherent 
interfaces. The atomic structure of such interfaces is discussed in ch. 9, §2.3.3. 

A simple example of a fully coherent interface is one between two phases that have 
the same crystal structure, lattice parameter and orientation, giving an orientation 
relationship often loosely described, for systems with cubic matrix and precipitate, as 
“cube-cube”. For fully coherent interfaces, in a binary AB alloy, the only contribution 
to the interfacial energy, a, comes from the higher energy of the unlike A-B bonds 
across the interface. Since the system is one of limited miscibility between the two 
phases, it is expected that the unlike bond energy h(AB) will be greater than the average 
energies of the bonds, |7z(AA) + /i(BB)]/2, as described in any discussion of the so-called 
quasi-chemical bond model (e.g., Swalin [1972]). 

Such completely perfect matching is unlikely to be found and deviations from 
coherency can develop as the extreme conditions of perfect matching described above are 
relaxed. In most reactions there will be a small difference in the lattice parameters of the 
two phases, a a and a^. This gives rise to a misfit, 8: 

8 = 2 (a p - a a )/(a a + a p ) (15) 

For small misfits and very small precipitates, the difference in atomic spacing can be 
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taken up by elastic strain in the two phases; but with larger precipitates the system can 
lower the resulting elastic strain energy by concentrating the strain in a cross-grid of 
edge dislocations spaced a distance d apart (fig. 8). Over the dislocation spacing, d, there 
are n atomic planes in the phase with the larger parameter and n + 1 planes in the other 
phase. When the misfit is much less than unity, then: 

n = 1/8, (16) 

and 

d = (a a + a fi )/28. (17) 

For such a semicoherent interface the interfacial energy has two components; a “chem- 
ical” term as discussed for the coherent interface and an additional “structural” term, due 
to the cross-grid of edge dislocations. For small precipitates, where the diameter of the 
nucleus may be less than the dislocation spacing, d, the nucleus will remain coherent, 
though elastically strained, and consequently will have a higher solubility because the 
elastic energy raises the volume free energy of the precipitate phase (Cahn [1968]). 

An incoherent phase boundary, or a phase boundary with poor atomic fit and high 
energy, will have an atomic structure rather like that of a high-angle grain boundary, 
whose misorientation is well away from any possible coincidence-site boundary 
relationships (ch. 9, §2.2. 1.1). Interfaces with poor atomic fit in two phase systems are 
likely to arise in one of two main ways: The first way is if, after nucleation of the 
precipitate, the orientation of the surrounding matrix phase is changed by the passage of 
a high-angle grain boundary in a recrystallization or grain-growth reaction. This 
phenomenon has been discussed by Doherty [1982] with examples of the resulting 
change in interfacial properties and consequent microstructural changes. The other main 
origin of incoherent interfaces arises when the two crystals, a and /3, do not share a 
common crystal structure. This is the usual situation in most precipitation reactions and 
is inevitable in all polymorphous changes. As is discussed in the next section, under 
these conditions part at least of the interface is almost certain to have very poor atomic 







© 

9 • * © 

• 

• • • 

• 

9 

© © © © 

© 

© © © 

© 

© 

© ^> © © 

© 

© ^ © 

© 

O 

o o o 

o 

o o 

o 

o 

099 

i*' a ot 

o 

o o 

o 

© B atoms Q A atoms 





Fig. 8. Semicoherent interface between two phases with the same structure but different atomic spacing and 
containing a set of edge dislocations a distance d apart. 
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matching giving an interface that can be regarded as incoherent or at least as having a 
poor atomic fit and a high specific interfacial energy. 


2.1.2. Equilibrium shape 

When the nucleus and the matrix have different crystal structures, it is almost 
invariably found that there is a definite and reproducable orientation relationship between 
the phases. Barrett [1952] gave a very complete listing of the orientation relationships 
for many reactions. The orientation relationship appears to allow a good atomic fit to 
develop between the two crystals either along a particular plane, Westmacott and 
Dahmen [1984], or occasionally along a particular direction, Dahmen and Westmacott 
[1986], Luo and Weatherly [1987] and Dahmen [1994], Typical examples of these 
orientation relationships are provided by fcc-hcp/matrix-precipitate pairs in systems such 
as Al-Ag 2 Al and in the Cu-Si system. These precipitate matrix pairs show a good fit 
between (111) in fee and (0001) in hep. These precipitates form with these planes 
parallel and they provide the habit plane of the flat plate-like precipitate crystals that 
form in these reactions (e.g., fig. 24 below). The atomic fit between the two structures 
is shown in fig. 9, where it can be seen that the two structures, the matrix, a, and the 



Fig. 9. Atomic structure of fcc-cph matrix-precipitate structures in the Al-Ag 2 Al system; (a) the fee matrix; 
(b) y' intermediate precipitate, fully coherent; (c) equilibrium Ag 2 Al, partially coherent; (d) the fee {111} 
stacking sequence. 
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metastable precipitate, y' , have identical atomic arrangements on the habit plane, so 
allowing full coherence in two dimensions. The two equilibrium phases have slightly 
different lattice parameters but the intermediate metastable precipitate, y\ has the same 
atomic spacing as the aluminium matrix, in the close-packed plane, and so shows full 
coherence, and consequently very low energy, for the habit plane (the structural 
component being zero). Precipitates are, however, three-dimensional and the interface 
must lie at all possible orientations around the crystal. Consequently, the non-habit plane 
parts of the interface, the rims of the plate-like crystals, have much poorer atomic 
matching and higher interfacial energy. In other words, the magnitude of cr will vary 
significantly around the precipitate. This can be represented in a section through the 
cr-plot as shown in fig. 10. The cr-plot is a radial plot of the interfacial energy as a vector 
whose length varies as a function of the orientation of the interface of the precipitate. 
The deep cusps in the plot occur at the orientations of good fit, corresponding to the 
habit plane. Aaronson etal. [1968] calculated that the ratio of energy between the good 
fit and the rest of the interface is 1 : 2.7 for the equilibrium Ag 2 Al phase in Al; similar 
calculations for the intermediate phase give a higher ratio of 1 : 10 (Ferrante [1978]). 
For anisotropic cr-plots such as the one seen in fig. 11, the Gibbs-Wulff theorem (see for 
example Mullins [1963] and Martin and Doherty [1976]) predicts the equilibrium 
shape of the precipitate, for the orientation relationship that gives the particular aniso- 
tropic cr-plot. This shape is the inner envelope of so-called Wulff planes that are planes 
drawn perpendicular to the vector from the origin at the intersection with the cr-plot, 
fig. 11. The Wulff plane has, of course, the orientation of the interfacial plane whose 
value of cr is OB. Figure 11 shows as an example the Wulff plane, LBM, for the cr 
vector, OB, and also shows the equilibrium shape of the precipitate, which provides for 
facet planes at the cusps, C lt C 2 , C 3 these facet planes are the good-fit interfaces. 

The aspect ratio. A, of plate like precipitates, defined as the length-to-thickness ratio, 
is equal to the energy ratio <J incohercn /cr coheri . m . The cusp gives a good-fit plane where there 
is good match of crystal planes as in the fcc-hcp system discussed and in many other 
precipitation reactions, for example Al-0' in the aluminium-copper system which has 


n 



Fig. 10. The cr plot, the variation of interface energy with the orientation of the boundary plane for the 
particular orientation that allows good fit at the cusp plane, cr 2 . 
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Fig. 11. The Wulff construction. The Wulff plane at B is LBM, that is normal to OB at B. The equilibrium 
shape is the inner envelope of Wulff phases, in this case, the cusps C„ C 2 and C 5 . 


habit planes on the matrix (100) planes (fig. 15, below). For systems like the fcc-bcc 
system in, for example, the iron-copper system, there is only a good-fit direction, which 
is the close-packed direction <110> in the fee phase and <11 1> in the bcc. This form of 
matching of directions provides the well-developed needle morphology, with an aspect 
ratio of 5 : 1, reported by Speich and Oriani [1965] for the Fe-Cu system. For good-fit 
directions, the tr-plot, schematically given in two dimensions by fig. 10, must be rotated 
about the vertical axis, o- t , to produce the appropriate three-dimensional needle or rod- 
like form: For plate-like crystals the three-dimensional plot is obtained by rotation about 
the horizontal axis, a 2 . In other examples the interface between fee and bcc, for example 
in steels, remains plate-like, see for example Aaronson et al. [1970]. 

Since the ends of needle crystals, and the rims of plate-like crystals, have poorer 
atomic fit than the coherent needle axis or habit planes of the plates, it is commonly 
assumed that these ends and rims are fully incoherent. This may be usually the case but 
it need not necessarily be true since in the fcc-hcp pair Al-Ag 2 Al it has been seen that 
the rims of the plates are also faceted, Ferrante and Doherty [1979]. This implies that 
the rim is also coherent, though with a poorer fit made up by the cores of Shockley 
partial dislocations, Howe etal. [1985], Christian [1994] shows that a precipitate in a 
matrix cannot be fully or partially coherent on one part of the interface and incoherent 
elsewhere so the rims of all plate-like precipitates should be considered coherent but to 
have a high density of dislocations, giving high energy and mobility. 

It is usually assumed during nucleation that the nucleus forms with the minimum 
interfacial energy, since any other assumption increases the ffee-energy barrier to 
nucleation and strongly reduces the rate of nucleation. The assumption of minimum inter- 
facial energy requires the nucleus to have the equilibrium shape, this will be plate-like 
or needle-like if the a plot shows the normal anisotropy for two crystals with different 
crystal lattices. If the plot is, however, fully isotropic, the expected shape of the nucleus 
will be a sphere. In two examples of “cube-cube” relative orientations between crystals 
with the same structure, nearly spherical equilibrium shapes have been reported. In the 
bcc-bcc beta-gamma brass system studied by Stephens and Purdy [1975] and in the 
all-fcc system of Ni-Ni 3 X (where X can denote a mixture of several atoms, usually Al 
or Ti) (see Ricks et al. [1983]), equilibrium shapes that were nearly spherical were re- 
ported. In the latter case, it seems that the precipitate develops the usually reported 
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cuboid morphology only with growth, and in systems when there is some small elastic 
misfit, 5. Both the brass and the nickel systems can evolve into a dendritic morphology 
during growth under condition of very low supersaturation but this is a growth form not 
an equilibrium shape! The shape instability leading to dendritic growth is discussed in 
§2.5. 

For plate-like precipitates with an equilibrium value of the aspect ratio A, that is, with 
A = 0 -/ 0 - 2 , the critical values of the plate radius, a(y)* and nucleation barrier, A F* t are 
found (see Aaronson etal. [1978]) to be given by: 


a(yf = -88,/{3(A^ + AF e v )}, 

(18) 

A F* = 256tto]/[21a(AF^ +AF e v )} 

(19) 


<7, is the interfacial energy of the poor fit rim and o 2 that of the good fit habit plane. 

2.1.3. Strain energy 

Strain energy plays a vital role in the nucleation of solid-state phase changes, as 
shown by eqs. (9)— ( 19). There appear to be two different types of strain that can be 
developed by the formation of a new phase in a matrix phase. The first type of strain 
occurs only when both phases are crystalline and is that caused by a misfit, due to 
different lattice parameters — eq. (15), between two coherent phases. The second type 
of strain is that caused when the new phase occupies a different volume from the region 
of the matrix that it has replaced. This second type of strain can arise in various ways. 
One of these is when the volume per atom is different in the two structures and the 
precipitate grows without a change in the number of atom sites. An example would be 
the nucleation of the less dense bcc ferrite from austenite in pure iron. A further method 
for the development of this volume strain is if there is a significant difference in the rates 
of diffusion of two components of the alloy. An example is when zinc-rich y-brass 
precipitates from /3-brass when there is a more rapid inward flux of the faster diffusing 
zinc atoms than the compensating outward movement of copper atoms. This increases the 
number of atomic sites in the region where y-brass is growing, giving a dilational strain. 
This effect has been directly observed on the surface of a brass alloy, by Clark and 
Wayman [1970] who showed that the y-phase stood above the surrounding surface. In 
the interior of the sample this displacement would produce an elastic strain unless 
relieved by plastic deformation. A similar increase in volume arises during growth of 
Fe 3 C in Fe by rapid interstitial diffusion of carbon upto the growing precipitate. 

A local strain around a precipitate will produce a large energy increase if the strain 
is taken up elastically since the elastic moduli of metals are large. For aluminium where 
Young’s modulus is 70 GPa, a 1% strain would give an elastic energy of 3.5 MJ/m 3 (35 
J/g-mole), a 5% strain would give an energy 25 times larger. In the case of a coherent 
interface with a finite S, the long range elastic distortions can be reduced plastically by 
the introduction of dislocations into the coherent interface, but at the cost of raising the 
interfacial free energy. For the second, dilational, type of strain the elastic energy can be 
relaxed either plastically, by dislocation motion — usually the glide of prismatic edge 
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dislocation loops, or by the diffusional motion of vacancies. Both of these release 
mechanisms, especially the diffusional motion of vacancies, occur more easily at high 
temperatures. 

Two limiting conditions are of interest to consider. The first condition would be for 
precipitation at low temperatures where there will usually be a high free-energy driving 
force (eq. 6d), since the equilibrium solubility C a is usually very small at low temperat- 
ures. The second condition occurs for high-temperature reactions, that occur with much 
lower driving forces. With small driving forces then the elastic strain energy opposing 
nucleation can easily be larger than the free-energy decrease driving the reaction. For 
example, precipitation of A1 Cu 2 from an aluminium-copper alloy at 800 K with a typical 
high-temperature supersaturation ratio C</C a = 1.1, gives a A F a p of only about 200 J/g- 
mole which would be offset by an elastic strain of only 2.5%. For high temperature 
precipitation it is likely that the volume strain must be accommodated by diffusion, or by 
plastic accommodation. Both of these processes become faster at higher temperatures. 
For low-temperature precipitation-hardening reactions, however, the supersaturation ratios 
are very much larger, so the elastic strain energy is more easily provided by the large 
free-energy change driving the precipitation. Electron microscopy has shown that such 
elastic distortion does indeed occur under these low-temperature, high-supersaturation 
conditions (see Kelly and Nicholson [1963]). The resulting elastic strains are a 
significant part of the strengthening response of such precipitation. 

Nemoto [1974] found, by direct observation using in-situ high-voltage electron 
microscopy, that during dissolution of cementite by decarburization of an Fe-C alloy at 
700°C, with high rates of dissolution, part of the change of volume of the reaction was 
provided by intense localized dislocation motion. For slower rates of dissolution such 
dislocation movement was not observed and the strain was assumed to be accommodated 
by movement of iron vacancies. The analysis of the coupled diffusion of substitutional 
iron atoms and rapid interstitial carbon atoms to relax localized dilational strain has been 
provided by Oriani [1966] for the case of precipitate coarsening in the same, iron- 
cementite, system. A further example was provided by Makenas and Birnbaum [1980] 
who reported that there was very significant plastic deformation during hydride precipit- 
ation in the niobium-hydrogen system. They charged hydrogen electrolytically into 
previously prepared thin foils of niobium and observed precipitation by TEM as the foils 
were continuously studied during cooling. Precipitation of hydride occurs by rapid 
interstitial diffusion of hydrogen causing a large expansion (AV/V of 18%) at temperat- 
ures too low for diffusional relaxation of the metallic atoms. Around each precipitate 
there was an intense tangle of dislocations with additional prismatic dislocation loops 
punched out in the <11 1> slip directions. During resolution of the hydrides, the plastic 
strain was not reversible and this led to a large temperature hysteresis between precipitat- 
ion and resolution. Similar examples of plastic deformation during other low-temperature 
hydride precipitation reactions were reviewed by Makenas and Birnbaum [1980]. It 
would be of interest to know if the interfacial change from the metal structure of 
niobium to the hydride structure during the low temperature hydride precipitation were 
civilian or military interface reactions, § 2.6. 

Lee etal. [1980, 1983] reviewed the current theoretical and experimental results on 
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the influence of elastic and plastic strains on precipitation in general, and nucleation in 
particular. For the elastic situation the detailed theoretical analysis shows that the elastic 
energy is minimized when the precipitates are formed with a thin disc- or plate-like 
shape. This is the morphology revealed by electron-microscopic studies in all cases 
where Guinier-Preston (GP) zones form when the solute in the zone has a different 
atomic size from the matrix atoms that the solute replaces. Disc-like zones are found for 
example in aluminium-copper and copper-beryllium alloys where there are significant 
size differences between the atoms. However in systems like aluminium-silver, where the 
atoms are almost the same size, spherical zones are found (Kelly and Nicholson 
[1963]). The shapes found for the intermediate and equilibrium precipitate in the Al-Ag 
system are plate-like — but this is due to the fact that the later precipitates have a dif- 
ferent, hexagonal, structure from the fee matrix. The observations on GP zones are there- 
fore much more relevant to understanding the influence of elastic strain on precipitate 
shape than are observations on shapes of metastable precipitates, where there is a 
complication due to differences in precipitate crystal structure. Finally it should be noted 
that, as expected, the habit plane of misfitting disc-like zones appears to be in the matrix 
direction with the lowest elastic modulus, <100> in most cubic metals (Wert [1976]). 

Lee et al. [1980] also discussed the expected plastic yielding that can occur around 
a spherical nucleus under the assumption of isotropic elasticity. They assumed that for 
precipitates larger than about 1 /zm the macroscopic yield stress will operate. For smaller 
precipitates, however, it is likely that the yield stress will be raised by the lack of 
dislocations in the sub-micron regions. By use of a model proposed by Ashby and 
Johnson [1969], in which dislocations can be nucleated with the help of a super- 
saturation of point defects, Lee etal. [1980] were able to derive an expected yield stress 
for flow, as a function of the precipitate radius. Their analysis gave results that agreed 
well with a range of experimental results, as regards the critical size of precipitate which 
went from coherent, elastically strained, to semi-coherent, plastically relaxed. 

2.1.4. Heterogeneous nucleation 

In many examples of nucleation in solids it is found that the nucleation sites are not 
distributed randomly, but are concentrated at particular sites in the matrix, usually at 
some type of crystal defect. The defects that act as the sites for heterogeneous nucleation 
include grain boundaries, dislocations, stacking faults or the interfaces of previously 
formed precipitates. These sites were described in some detail by Nicholson [1970]. 
Cahn [1956, 1957] provided the currently accepted analysis for nucleation of incoherent 
precipitates on grain boundaries and on dislocations. The analysis is based on the idea 
that the nucleus grows on the pre-existing defect so that the energy of formation of the 
nucleus is reduced by that proportion of the defect energy that is consumed by the 
nucleus as it forms. For nucleation on the boundary between two matrix grains (fig. 12) 
the grain boundary that has been destroyed is shown as the dashed line within the new 
crystal. The critical parameter is the semi-angle 6 determined by the ratio a aa /2a a /3 , 
where cr aa is the grain-boundary energy and <r a/j is the incoherent energy of the interface 
between /3 precipitate and the a matrix: 
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Fig. 12. (a) The double hemispherical cap model for the expected shape of an incoherent precipitate at an a-<x 
grain boundary showing the destroyed a-a grain boundary as the dashed line, (b) The Wulff construction for 
this situation. (After Lee and Aaronson [1975]). 


cost) = cr aa /2a afl . (20) 

When the new phase “wets” the grain boundary, which occurs when 2tr a p <(r aa , giving 
cos 6 = 1, then there is no barrier to nucleation. This often arises when the second phase 
is a liquid, as occurs for example in the hot shortness of steels in the presence of molten 
iron sulphide, and in other cases of liquid-metal embrittlement. In general, however, grain 
boundaries in metals are not “wetted” by crystalline second phases and so some fraction 
of the barrier to homogeneous nucleation remains in the case of heterogeneous nucleat- 
ion: The barrier is given by: 

A F* (het) = AF* (1 - cost)) 2 (2 + cos0)/2. (21) 

For nucleation at grain edges (the lines where three grain boundaries meet) and at grain 
comers (where four grains, and four grain edges, meet) there are an even larger 
reductions in the barrier to nucleation (Christian [1975]). Such reductions in the barrier 
to nucleation do not automatically lead to a large increase in the nucleation rate, since 
there is also a dramatic reduction in the number of suitable sites for nucleation to start 
[the term N v in eq. (11)]. If the mean grain diameter is d g , the thickness of the grain 
boundary is 5 b and the volume of an atomic site is V a then the number of nucleating sites 
per grain declines approximately as d g / V a : d 2 8y V a : d g 8 b 2 / F a : <5 b 3 / V a : as we consider 
homogeneous nucleation, grain-boundary nucleation, grain-edge and grain-comer 
nucleation, respectively. For a grain size of 50 /xm and a boundary thickness of 0.5 nm, 
the ratios are 10 15 : 10 10 : 10 s : 1. These are rather significant reductions in N v . The result 
of the two effects, the reductions in both A F* and in N v , is that for nucleation under a 
large driving force and with a small interfacial energy, homogeneous nucleation is 
expected to dominate, giving a high density of grain interior precipitates. With smaller 
driving forces or with higher values of the interfacial energy (especially if no good 
atomic matching between the two phases is possible) then the consequent large value of 
A/* in eq. (11) requires heterogeneous nucleation, at suitable sites for any significant rate 
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of nucleation to be possible. This conclusion is in very good qualitative agreement with 
a huge amount of experimental evidence for the sites of nucleation in many systems 
(Nicholson [1970]). It is always found, for example, that with a small supercooling into 
a two-phase field giving a low driving force, nucleation occurs solely at grain comers 
and edges and only at somewhat higher driving forces is nucleation found at grain- 
boundary sites and eventually at grain-interior sites. Moreover, it is very commonly 
found that the grain-interior nucleation is not homogeneous, since dislocations are also 
strong heterogeneous nucleation sites (Cahn [1957]). It appears that only for very large 
driving forces indeed, such as by diffusion of a third element into a binary alloy to give, 
for example, internal oxidation or nitriting (with a solute whose oxide or nitride has 
extremely low solubility) can homogeneous nucleation be expected to occur as the fastest 
process for largely incoherent interfaces. Even here it is likely that nucleation on 
dislocations will often dominate. Only for nucleation at densities that correspond to 
precipitate spacing significantly less that the typical dislocation spacing of about l/itm 
(10 18 precipitates/m 3 ) is homogeneous nucleation probable. Heterogeneous nucleation at 
closer spacings is still possible either by closely spaced precipitates strung out along 
dislocation lines or by some type of autocatalytic nucleation such as has been reported 
for 6' in Al-Cu (see next section) by Perovic et al. [1981] where the elastic strain of 
one precipitate plate facilitates the nucleation of a closely spaced neighbour. 

In most cases there is insufficient driving force for homogeneous nucleation except 
with largely coherent interfaces, as in GP zone formation. Most intermediate precipitates, 
even with a good fitting plane or direction, appear to nucleate either on dislocations or, 
in some two-stage heat treatments, on pre-existing GP zones (Lorimer and Nicholson 
[1969] and Jacobs and Pashley [1969]. Precipitation of a third phase, y, at the interface 
of a matrix, a, and an existing precipitate, /3, is very similar to that at an aa grain 
boundary, fig. 12, except that the interfacial energies, <r ay and a^ y , are not equal, so there 
are two angles, and 0 2 , given by eqs. 20a and 20b. 

ff u , = cos 6 l + (Tp y cos d 2 , (20a) 

cr y sin 0, + (Tp y sin d 2 . (20b) 

The classical theory of heterogeneous nucleation, outlined above, assumed that the energy 
of the interfacd between the two phases, tr Q(3 , has a constant value. This is a reasonable 
assumption for systems where there is no significant atomic matching but this is only 
likely if one of the two phases is non-crystalline, for example the internal oxidation of 
copper-silicon to precipitate amorphous silica. In most precipitation reactions, two 
crystalline phases are involved with a very high likelihood of some atomic matching. It 
is not known how great difference this can make to the theory of nucleation on dis- 
locations but for nucleation on grain boundaries it is known that a significant difference 
from the classical theory is produced. This arises since, if the precipitate has an 
orientation relationship giving good atomic fit with one crystal, then it is very unlikely 
to have any atomic matching with the other grain at a grain-boundary nucleation site. 
The assumption of a constant interfacial energy is then invalid. 

The situation of grain-boundary nucleation with lattice matching has been discussed 
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Fig. 13. The Wulff construction for a precipitate with a low-energy coherent interface BH, cr^c, between the 
grain-boundary precipitate and the upper grain at a particular angle <£ between the habit plane and the boundary 
plane (after Lee and Aaronson [1975]). 


by Lee and Aaronson [1975], who developed a modification to the Gibbs-Wulff 
theorem for grain-boundary nucleation (figs. 13 and 14). They showed that the expected 
shapes are very different from that of fig. 12 and consequently the effectiveness of grain- 
boundary nucleation sites is very different from that given by eq. (21). Lee and Aaronson 
then found that the barrier to nucleation is reduced still further, by factors of upto 100, 
depending on the ratio o r cohwm /o- illcotoreilt and the orientation of the facet plane with respect 
to the grain-boundary plane. The largest reduction occurs when the facet plane is nearly 
parallel to the grain boundary, with the angle <f> tending to 0° (fig. 13). 

For heterogeneous nucleation of a j8 phase on aa grain boundaries, where in almost 
every case there is no orientation relationship between the a grains, then the new j8 
phase will only have a low energy coherent or semicoherent interface with one of the a 
grains. With the other grain (or grains, if the nucleation site were a three grain edge or 
a four grain corner) the interface will be a high energy incoherent interface. This 
distinction between the two types of interface for grain boundary precipitate is not only 
important for nucleation but is also critical for the subsequent growth of the precipitates, 
§ 2.2, 2.2.3, 2.2.4 and 2.6. In some cases, e.g., the grain boundary “allotriomorphs” or the 
two-phase, pearlite, structure (§2.5.1]) in Fe-C alloys, growth is into the grain with 
which there is no orientation relationship. Here, the interface is high energy but fully 



Fig. 14. The equilibrium shape for a precipitate “cube-cube" oriented with the upper grain, with a small value 
of the coherent energy, <r coh , and a larger value of the incoherent energy, cr incoh (after Doherty [1982]). 
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mobile for a reconstructive interface reaction. In other cases the growth is into the grain 
with the good fit orientation relationship giving anisotropic energy and mobility and the 
possibility of some type of displacive interface reaction, §2.6. This type of growth 
usually produces Widmanstatten precipitate shapes: plates, laths or needles. 

2.1.5. Experimental studies of nucleation 

Despite the veiy extensive range of experimental investigations on precipitation 
processes in metals, as described, for example, by Russell and Aaronson [1978], there 
are only a few investigations that have attempted to test the various theories of nucleation 
quantitatively. As previously mentioned, however, there are numerous experimental 
observations that are in agreement, at least in outline, with the main predictions of the 
theories. Amongst the successful qualitative observations are the following: 

(i) There is almost invariably an orientation relationship between the a crystalline 
precipitate and the parent crystal lattice, and this orientation relationship is apparently 
always one that allows for good atomic matching between the two crystals, so that the 
interfacial energy will be minimized at least on certain planes. 

(ii) At high supersaturations it is commonly found that a metastable precipitate structure 
is formed rather than the stable phase. In every case where this has been studied, the 
metastable phase has better atomic matching with the matrix than the stable phase has. This 
has been discussed already for the aluminium-silver system but is perhaps better known for 
the aluminium-copper system (fig. 15). The phases that are precipitated before the equilib- 
rium 0-phase, CuA 1 2 , include the disc-like GP zones and the tetragonal 0'. The GP zones 
are fully coherent and the 0' is fully coherent with the aluminium matrix on the habit 
plane where the basal plane of the precipitate matches the {100} planes of the matrix. 
This type of pattern of precipitation is found in a wide range of metallic and nonmetallic 
solid-state precipitation reactions, as reviewed by Russell and Aaronson [1978], 

(iii) Owing to the strong barrier to homogeneous nucleation provided by the interfacial 
energy and the potency of many heterogeneous nucleation sites in solids, homogeneous 
nucleation is only found with either extremely high free-energy driving forces or with 
precipitates with very low values of the interfacial energy. Typical examples of homogen- 
eous nucleation are found for the precipitation of GP zones or intermediate phases which 
have the same crystal structure as the matrix. In addition to the many different GP zones 
formed in precipitation-hardened aluminium alloys (Lorimer [1978]), other examples are 
fee cobalt precipitated in copper-cobalt alloys (Servi and Turnbull [1966], LeGoues 
and Aaronson [1984], Wendt and Haasen [1985] Al-Kassab [1987]), ordered y'fcc 
precipitates of Ni 3 Al in nickel-aluminium alloys (Kirkwood [1970], Hirata and 
Kirkwood [1977] and Wendt and Haasen [1983]) and ordered Cu 4 Ti in copper- 
titanium alloys (von Alvensleben and Wagner [1984]). These three systems will be 
discussed in more detail here, since they provided quantitative tests of the kinetic theory 
of homogeneous nucleation. Further discussion of these reactions is given in §2.4.2, 
where the systems provide excellent tests of models of the coarsening reaction. 

(iv) As described in §2.1.4., the variation of heterogeneous nucleation sites with 
supersaturation seems also to be in good agreement with the predictions of the classical 
theory of heterogeneous nucleation, but with the exception of Aaronson’s investigation of 


References: p. 1497. 



1390 


R. D. Doherty 


Ch. 15, §2 


° Af 
• Cu 


( too ). 


( 00 /)) 
( 0 / 0 ) 
{ 100 ) 
( 00 /) 

,^^( 0 / 0 ) 


Incoherent 
with a 


( 00/ ) Coherent 

{010)\ Semicoherent 
( lOOminterfacial 

) dislocations) 

0,1821 


{001 ) Coherent 
{010)\coherent {large 


{100) ( elastic strain ) 



0.404 nm 



0.404 nm 



G P zone 


Fig. 15. The atomic fit between fee aluminium and the equilibrium phase 6 and the various intermetallic phases 
O’ and 8" and the GP zones in Al-Cu system (after Hornbogen [1967]). 


grain-boundary nucleation (Aaronson [1979]) there seems to have been almost no published 
quantitative tests of the theory for grain-boundary nucleation or for nucleation on dislocations. 

The classic test of homogeneous nucleation theory in metallic solids was provided by 
Servi and Turnbull [1966], They studied the precipitation of fee cobalt from copper 
by an in-situ technique: electrical resistivity measurements on supersaturated copper- 
cobalt alloys that had been directly quenched to the precipitation temperature after 
solution treatment. The overall kinetics of the reaction, observed by electrical resistivity 
changes, were close to those expected for a nucleation-and-growth reaction with nuclei 
forming only in the earliest stages of the reaction. The bulk of the transformation 
appeared to show diffusion-controlled growth of a near-constant number of nuclei. Servi 
and Turnbull reported however that the agreement with the predictions of this model was 
not exact. As described in various reviews (for example, Turnbull [1956]), the fraction 
J{x) of the supersaturated solute that remains in the solution during diffusion-controlled 
growth with a constant number of growth centres is given by*: 

f(x) = \- exp(-//r)\ (22) 


* It should be noted that the use of eq. 22 is only an approximation for a precipitation reaction. This is 
discussed further in § 2.4. 
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the exponent, n, is expected to be 3/2 and the time constant, r, is determined by the 
diffusion coefficient, phase diagram solubilities and the density of the growing nuclei. 

Over a range of cobalt contents, from 1.0wt% to 2.7wt% Co, the observed value of 
the exponent n fell from 1.95 to 0.97. The investigators ignored this discrepancy and 
calculated the density of nuclei in each case from the measured time constant, r. From 
these calculated precipitate densities, which were directly confirmed by electron 
microscopy (Tanner and Servi [1966]), Servi and Turnbull were able to determine 
the initial rates of nucleation. They were then able to show that these experimental 
nucleation rates agreed rather closely with the predictions of steady-state nucleation 
theory [eq. (12)]. The observed apparent interfacial energy was found to be 0.20 J/m“ 2 
which compared very closely to a calculated value, based on the quasi-chemical 
bonding model, of 0.23 J/m~ 2 . They reported a difference in the calculated and 
observed pre-exponential terms in eq. (12) of a factor of 100, the observed value being 
smaller than the calculated value. Two corrections to this have been found — a mis- 
placed v in Servi and Turnbull’s equation 2 and the incorrect insertion of the atomic 
fraction of cobalt in the value of N v , eq. (11), (Russell [1970]). The effect of these 
corrections is that the steady-state theory apparently underestimated the rate of 
nucleation by a factor of about 10 5 . LeGoues and Aaronson [1984] studied the 
nucleation rate in Cu-Co by conventional TEM observations. They showed that their 
data closely matched that of Servi and Turnbull but they claimed to have largely 
eliminated the significant error found by Servi and Turnbull by a recalculation of the 
solution thermodynamics and as a result reasonable agreement with classical nucleation 
theory, eq. (12a) was reported. Wendt and Haasen [1985] studied nucleation in the 
same system but at much higher supersaturations using the ultra high resolution 
technique of field ion microscopy with atom probe chemical analysis (FIM-AP). They 
reported much lower rates of nucleation (by about 10 s ) than predicted by the classic 
equations but they noted that at the supersaturation they used, the critical size of the 
nucleus was only about 0.2 nm and AF* was only about kT so that, in such circum- 
stances, the use of macroscopic parameters such as the interfacial energy is unlikely 
to be valid. Haasen and Wagner [1992] have described a new cluster dynamics 
model for simulating nucleation in Cu-Co in which even at the supersaturations of the 
early work, comparable to that of Servi and Turnbull, it was found that the standard 
homogeneous nucleation theory significantly overestimated the total nucleation density 
expected. However the discrepancy found was only a factor of 30, which for 
nucleation theory is not a very significant error! It will be of great interest to see the 
results from future applications of this cluster dynamics method to the modelling of 
nucleation, since a true atomic model for the formation of critical sized clusters will 
avoid the limitation of using macroscopic thermodynamics for a process occurring on 
an atomic scale. 

Several tests of nucleation theory have been carried out in the nickel-aluminium 
system. These include FTM at very high resolution but with low statistical accuracy by 
Wendt and Haasen [1983], small angle neutron scattering studies on the same samples 
by Beddoe etal. [1984] and high resolution TEM studies by Xiao and Haasen [1991]. 
These studies took into account the potential loss of precipitates that have nucleated, by 
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the process of Ostwald ripening (§ 2.4.2) and the data were reviewed in detail by Haasen 
and Wagner [1992]. The earlier studies, whose results are described in §2.4.2 as 
figs. 42 and 43, were essentially all obtained when nucleation was complete and the 
precipitate densities were falling due to coarsening. Kampmann and Wagner [1984] 
(see Haasen and Wagner [1992] for details) discussed these results using the model of 
Langer and Schwartz [1980] which combined nucleation, growth and coarsening and 
they also developed an new explicit numerical model for analysing the combined 
reaction, a model based on eq. (12a) for nucleation. A very reasonable match to the 
experimentally observed particle densities was obtained with fitted values of the 
interfacial energy and diffusion coefficient that were also very reasonable (a of 0.016 
J/nf 2 , and D of 5x 10~ 21 mV at 823 K). 

More recent studies by Xiao and Haasen [1991] gave insight into the earliest stages 
of precipitation in Ni-Al using high resolution TEM. Their study was for a lower alloy 
composition, 12at% Al, and at a lower temperature, 773 K. They observed very small 
(r~ 1 nm) ordered regions that appeared to remain essentially unchanged for a few 
hours until replaced by larger (r> 2 nm) steadily -growing precipitates. The latter 
achieving a maximum density of 10 24 m -3 after about 40 hours. The small particles were 
assumed to be sub-critical sized “embryos” that should be continuously forming and 
dissolving while the large particles were the stable, super-critical, precipitates. Xiao and 
Haasen [1991] thus appear to be the first to report direct observations on embryos 
whose predicted existance is essential for the nucleation model. So their direct sighting 
of these embryos was a rather important observation! The mean radius of the later and 
larger supercritical precipitates remained essentially constant while their density 
increased. At longer times, />40 hours, the density fell and the mean radius of the 
precipitates, raised to the power 3, increased linearly with time. These growth kinetics 
are fully characteristic of interfacial energy driven coarsening, §2.4.2. For the time 
period from 2 to 20 hours while the density of supercritical precipitates was increasing, 
Xiao and Haasen were able to measure directly a steady state nucleation rate, If of 
eq. (12a) of 8x 10 19 nuclei irfV, close to the value of 1 x 10 19 nfV predicted for the 
values of a (0.014 J/rr T 2 ) and D of (1.6x 10" 21 m 2 s _1 at 773 K) obtained from analysis 
of the coarsening data. The value of interfacial energy gives a predicted critical radius 
r* of 1 .7 nm matching quite closely the apparent critical radius that divided the ordered 
solute rich embryos from the growing precipitates. This agreement is very satisfactory 
— however the extreme sensitivity of nucleation rate to interfacial energy in eq. 12 
suggests that predicting a rate matching the theory to better than an order of magnitude 
error is not very likely. 

Haasen and Wagner [1992] discussed the application of the numerical modelling 
technique of Kampmann and Wagner [1984] to the data on nucleation in Cu-1.9at% Ti 
and as in the analysis of the earlier Ni-12at% Al data they managed to get a very good 
fit to the data on nucleation and coarsening. For Cu-Ti, the fitted values of the interfacial 
energy and the diffusion coefficient were 0.067-0.07 J/m -2 and 2.5 x 10~ 19 mV at 
623 K. 

An earlier successful test of nucleation theory was reported by Strutt [1974] for 
nucleation of a coherent metastable phase in the LiF-Mg/ 2 ionic system studied by ionic 
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conductivity. Russell [1970] described a similar successful test of nucleation theory, in 
a precipitation reaction in an oxide glass system. The theory of homogeneous nucleation 
has also been well tested for metallic solidification (chapter 8, §4.3; see also Cantor 
and Doherty [1979]). It would appear that despite early reports, for example by Heady 
and Cahn [1973], of an apparent marked failure of the theory in a case of liquid 
unmixing, the model of homogeneous nucleation is in essentially very good shape 
quantitatively as well as qualitatively as discussed earlier. Langer and Schwartz 
[1980] were able to show that the basic theory of nucleation remained valid for liquid 
unmixing. In the unmixing experiments in liquids close to the unmixing temperature the 
experimental problem appeared to be that of actually observing an initial nucleation 
rather than the onset of a significant amount of unmixing. Observable unmixing, the 
onset of “cloudiness”, involves diffusion close to the unmixing temperature and this is an 
inherently slow process. Optical scattering only becomes significant for droplets sized 
and/or spaced apart by distances of the order of the wavelength of light, 0.5 /xm. 

The only reports, known to the author of this chapter, on the quantitative testing of 
the theory of heterogeneous nucleation are those of Lange [1979] as presented by 
Aaronson [1979] and of Plichta etal. [1980], The first investigation was of the grain- 
boundary nucleation rate of ferrite from austenite in high-purity iron-carbon alloys. 
Considerable care was taken to distinguish grain-edge from grain-boundary nucleation: 
only data for nucleation on the boundaries between two austenite grains were considered. 
In order to make a comparison with the experimental results, an equilibrium shape for 
the ferrite nucleus on the grain boundary was needed. The only model that was at all 
successful was a pill-box nucleus shape for ferrite on the boundary between two austenite 
grains, where all the interfaces were partially or fully coherent. It was reported that all 
other nucleus shapes predicted nucleation rates orders of magnitude smaller than the rates 
observed. As Aaronson has pointed out, a pillbox model requires the habit plane of the 
precipitate to be accurately parallel to the appropriate plane in both the austenite grains, 
a rather unlikely situation. However, since austenite is unstable in the iron-carbon 
system, room-temperature crystallographic studies to test this hypothesis could not be 
carried out. 

The other study on heterogeneous nucleation kinetics was the observation by Plichta 
etal. [1980] of nucleation of the massive transformation in two Ti alloys. In the massive 
transformation , discussed further in § 2.2, the driving force for nucleation is determined 
from the under-cooling, A T, below the temperature, usually described as T a , where two 
phases with the same composition have the same free energy. As in the earlier study in 
iron, the spherical cap model underestimated the observed nucleation rates by, in this 
case, thousands of orders of magnitude! Agreement could again be achieved only by use 
of the pillbox model with a low-energy coherent interface, again with only a small 
fraction of potential grain-boundary nucleation sites appearing to be active. 

Two ideas seem to follow from these two sets of very interesting results. Firstly, 
there is the need for experiments on a system where the orientations of both phases can 
be studied after precipitation in a system with reliable thermodynamic data, for example 
precipitation in a dilute alloy. Secondly, there is the possibility that heterogeneous 
nucleation at grain boundaries may not depend on the general grain-boundary structure 
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but may occur mainly at grain-boundary defects such as the intrinsic and extrinsic 
dislocations known to exist in metallic grain boundaries. It has been reported, for 
example by Desalos etal. [1981], that after controlled rolling of low-alloy steels there 
is an enhanced density of ferrite grains nucleating on the austenite grain boundaries 
(fig. 16). This strongly indicates that nucleation is enhanced by dislocations absorbed on 
austenite grain boundaries. 

Plichta and Aaronson [1980] attempted to provide an answer to the first problem 
by observing the orientation relationships for boundary-nucleated massive precipitates in 
Ag-26at% A1 which has the same bcc-hcp pair of phases as in the Ti alloys studied. 
Plichta and Aaronson showed that 46 out of 47 boundary precipitates did indeed have the 
“Burgers” relationship, (0001) //{ 110], [1120] // <11 1>, with one of the two matrix 
grains, but rarely with both. They were able to provide evidence that for about half of 
the non-Burgers-related orientations some indication of good atomic fit was possible. 
Unfortunately no nucleation-rate experiments were carried out in the crystallographically 
investigated system. 

Additional evidence for the failure of the classical model of boundary nucleation has 
been described by Butler and Swann [1976]. They measured, by TEM, the density of 
precipitates heterogeneously nucleated on grain boundaries in Al-Mg-Zn alloys. They 
studied precipitates forming under higher driving forces, giving much higher precipitate 
densities than studied in the observations of Lange [1979] and Plichta etal. [1980], 
Butler and Swann showed direct evidence for preferential nucleation on defects at boun- 
daries, notably at grain-boundary steps (fig. 17) and at dislocations in low-angle 
boundaries. It is therefore rather likely, in experiments such as those described by Lange 
[1979] and Plichta etal. [1980]), that the precipitates seen may indeed have come from 
defects on the grain boundaries and that this could account for the high rates of nucleat- 
ion seen. Other aspects of Butler and Swann’s observations were in qualitative agreement 
with the ideas of Lee and Aaronson [1975]. Butler and Swann showed that the density 
of precipitates varied from boundary to boundary and along an individual boundary when 
the boundary plane varied. The latter observation appears to be due to the change of 
angle <f> between the boundary plane and the precipitate habit plane, fig. 13. In some 
cases, as the boundary plane varied a different precipitate orientation was found. In most 
TEM studies of grain-boundary precipitation, as for example that of Williams and 
Edington [1976], the precipitates at a boundary are always found to be one of the vari- 
ants of the precipitates nucleated inside one of the two grains that join at the boundary. 

In the absence of additional evidence, particularly on the role of grain-boundary 



Fig. 16. Observed rate of nucleation, after 5% transformation of y to a in a low-carbon steel, with no 
deformation and with two levels of prior hot deformation in the austenite (after DESALOS etal. [1981]). 
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Fig. 17. Precipitation on a grain boundary in Al-Zn-Mg: (a) Grain-boundary ledges, arrowed, and dislocations 
in the as-quenched alloy; (b) the same area after 75 s ageing at 240°C, with preferential nucleation on the ledge 
and on some dislocations. (From Butler and Swann [1976].) 


dislocations, it is probably safest at present to regard the existing models of heterogen- 
eous nucleation as being not yet fully established. However, the analysis of Lee and 
Aaronson seems likely to be valid for general boundary nucleation but to need revision 
to take into account the higher level of heterogeneities on the grain boundary itself. 
These heterogeneities probably will dominate the reaction at low driving forces where 
nucleation is studied by optical microscopy. 

In addition, there is a tremendous lack of reliable interfacial-energy information 
needed to make use of the theory (Hondros [1978]). Butler and Swann [1976], for 
example, report evidence that higher solution-treatment temperatures that will reduce 
segregation of impurities and so raise the boundary energy, caused an increase of 
nucleation rate. Such qualitative ideas, though valuable, do not of course substitute for 
quantitative evaluation of the interfacial energy under various conditions. 

2.2. Growth processes 

A critical distinction in solid-state, growth processes, where the two crystals have 
different crystal structures, arises from the presence or absence of a good atomic fit 
across the interface. If there is no good atomic fit, as is expected, for example, for one 
at least of the interfaces for /3 nucleated at aa grain boundaries, then rather easy atom 
transfer can be expected across the incoherent interface by the thermally activated jump 
process described in the following section. This type of interfacial growth process 
appears very similar to the movement of high-angle grain boundaries in recrystallization 
and grain growth and can be readily classified as a civilian or reconstructive process. In 
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solid-state phase transformations, examples include the “massive” reaction (a polymor- 
phous change of crystal structure at a constant composition), the growth of grain- 
boundary nucleated “allotriomorphs”, the growth of most eutectoidal two-phase structures 
such as “pearlite” in Fe-C alloys and the process of discontinuous precipitation, all of 
these reactions appear to nucleate at the parent grain boundaries. 

A good atomic fit across an interface of a growing precipitate (i.e., a coherent interface) 
is, however, strongly promoted by the need to minimize the interfacial energy barrier to 
nucleation, §2.1. The growth of such a coherent /3 precipitate into an a matrix with a 
different crystal structure faces a significant barrier discussed in detail in § 2.2.3 and 2.2.4. 
The growth barrier arises from the difficulty of finding atomic sites at the good-fit interface 
for atoms to leave one structure and join the other. Transfer apparently can only occur at 
growth steps or ledges that, in most cases, appear to have a dislocation character. The 
forward motion of the good-fit coherent interface appears to require the nucleation of these 
growth ledges and their lateral motion by a process that, at least in some cases, is equivalent 
to that of dislocation glide. In the case of the fcc/cph interface, particularly, as studied in 
Ag 2 Al precipitation from silver-supersaturated aluminium solid solutions (Howe et al. 
[1985]), this is known to be the operating mechanism. Such a mechanism is truly that of a 
military or displacive transformation, one of whose identifying characteristics is the 
production of a shape change seen as surface relief of an invariant plane strain type, as 
has been reported for Al-Ag by Liu and Aaronson [1970]. The mechanism of this type 
of growth reaction in transformation where there is a composition change and observable 
surface relief has become perhaps the most contentious area in diffusional phase 
transformations and was the subject of two recent conferences: International 
Conference on Bainite [1990] and Role of Shear and Diffusion in Plate Shaped 
Transformation Products [1994], The dispute will be described in §2.6. Here it is 
sufficient to note that the growth of coherent precipitates is very common and occurs in 
almost all precipitation reactions where the precipitate grow in the grains in which they 
were nucleated, forming plate-like, lath-like or needle-like Widmanstatten precipitates. A 
plate-like precipitate is one whose three dimensions, a, b and c, have a = b » c, for a lath 
a>b>c and for a needle a~»b = c. It appears to be generally agreed that the coherent 
plate and lath precipitates have growth ledges and often have shape changes, but it is not 
agreed if all, some or only a few, grow by a displacive mechanism. There are only a few 
reports, see § 2.6, of the presence or absence of surface relief effects for the majority of 
Widmanstatten plate-like precipitates for example those that form in precipitation- 
hardening aluminium and copper alloys. For needle-like precipitates, for example the fee 
Cu rods in bcc Fe whose coarsening was studied by Speich and Oriani [1965] there 
appears to have been no reports of either growth ledges or surface relief shape changes. 

2.2.1. Growth without change of composition 

When the growing precipitate phase and the matrix have the same composition, then 
the growth process merely requires atom transfer across the interface. The reconstructive 
mechanism for this is equivalent to that seen in the movement of a high-angle grain 
boundary in pure metals (ch. 28, § 3.4). One complication in polymoiphous phase 
changes, as with grain boundary migration, is the possibility of solute drag (Cahn [1963] 
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and LCcke and Stuwe [1963]). The solute drag phenomenon arises if a trace of an alloy 
addition is adsorbed onto the interface between the growing and shrinking phases. The 
rate of motion of the interface can then be greatly retarded unless the reaction is 
sufficiently fast for the interface to “escape” from the solute atmosphere. Solute drag on 
grain boundaries slows down, but does not stop, boundary migration. For grain-boundary 
movement it is difficult to acquire a sufficient driving force for escape from the solute, 
but in solid-state phase changes the driving force is likely to be sufficiently large for 
solute drag not to be a significant problem in many situations, for example with 
supercoolings of more than about 50 K [eq. (4)]. See § 2.6, however, for a discussion on 
the possible role of solute drag in halting bainite growth above a critical temperature in 
some alloy steels. 

For a phase transformation where the growing a crystal is a different phase from the 
/3 matrix, a new situation arises which is not present when the two crystals are the same 
phase, as in grain-boundary migration. The distinction is that any impurity will almost 
always prefer to concentrate in one or other of the two phases, that is, the partition 
coefficient, 

kap = CjCp, (23) 

will not be unity. This is illustrated by fig. 18 which shows a phase transformation in a 
dilute binary alloy where the solute segregates to the high-temperature /3-phase and 
Kp < 1 • 

The ffee-energy-composition curve at a temperature, T 0 , below the equilibrium 
transformation temperature for the pure matrix, T e , is seen in fig. 18b, and at this 
temperature a critical composition is indicated. For an alloy with less solute than 
this critical concentration, the /3-phase can transform directly, with a reduction of free 
energy, to the a-phase without any change of composition. This situation occurs through- 



Fig. 18. (a) Phase diagram showing equilibrium below an equilibrium transition temperature, T c ; (b) the free- 
energy-composition curve to show the possibility of a diffusionless transformation, /3 to a, in region II. In 
region I only a reaction to give solute-depleted a can lower the free energy. 
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out region II in fig. 18. It is clear that a diffusionless, or better a partitionless, transfor- 
mation to a of the same composition is not the most stable product in region II, but is 
a metastable product. The metastable reaction is only expected if it can occur significant- 
ly faster than the formation of the most stable product, solute-depleted a of composition 
C a . Since partitionless transformation does not require long-range solute transport, it will 
generally occur faster than any reaction requiring long-range diffusion (§ 2.2.2). In region 
I of the two-phase region, formation of solute-depleted a is the only possible reaction. 
At temperatures and at compositions in the a-phase below the two-phase field, the 
partitionless product of homogeneous a of the initial alloy composition is both the fastest 
forming and also the most stable product. Partitionless transformation is known to occur 
in the class of transformations described as massive, after the shapes of the product 
grains often found (Massalski [1970]). Since no material can ever be absolutely pure, 
any polymorphous reaction, if it is to occur without long-range diffusion of the impurity, 
must also be treated as an example of a massive transformation. In the following simple 
analysis it is assumed that the product and parent phases have the same composition and 
moreover that there is no solute build-up or depletion at the interface, in other words 
there is no solute drag effect considered. 

Figure 19 shows the expected variation of free energy per atom across an afi 
interface below a transition temperature, when the low-temperature a-phase is more 
stable. There is a difference in free energy per atom, A F^, between the two phases and 
an activation energy barrier, A F„, to atomic jumping across the interface. In the treatment 
given here, it is assumed that the interface is incoherent or at least has no atomic 
matching, as in most high angle grain boundaries. This assumption has two important 
consequences: Firstly that the barrier to atomic transport is likely to be of the order of 
the activation energy for grain-boundary diffusion rather than the higher activation energy 
for bulk diffusion. The second consequence is that there should be no difficulty for atoms 



Distance nor mot to the boundary 

Fig. 19. Thermally activated boundary motion; there is an activation barrier A F a for atoms to cross the 
boundary and a driving force, AFJ^ per atom, for growth of a at the expense of {}. 
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to find a site in either the a or /3 phase for them to “join” after crossing the interface. 
Situations where this is may not be the case are discussed in § 2.2.3. 

The fluxes of atoms, per atom site, from /3 to a, V and the reverse flux, J af3 , are 
given by: 

Jp a = Ac exp(-A FjkT). (24) 

ip = Ac exp{-(AF a + A Jj.J/ftr}, (25) 

where v is the atomic vibration frequency of atoms at the interface, typically about 
10 13 s' 1 , and A ac is the accommodation coefficient. is the probability that an atom, 
having left one of the crystals, can find a site in the other crystal across the interface. A ac 
for an incoherent interface is usually assumed to be a constant of the order of unity . The 
net flux of atoms is given by the difference of the two fluxes, and J a/3 : 

•^tt(nct) = — J a p 

= Ac exp(-A FjkT), [l - exp(-A /£/«’)] 

For small undercoolings, where A FJ)<zkT, the difference term in eq. (26) becomes 
A Ffa/kT. The velocity of the interface, v, is given as the product of the net flux per atom 
site and the displacement per atom, which is the atomic spacing, b, in the growing 
a -phase: 

v = A, (net) = M afl AJ£. (27) 

The interfacial mobility is M af3 and is determined from eqs. (26) and (27) as: 

Kp = ( bvAjkT ) exp(-Af£/W) . (28) 

There are alternative definitions of interfacial mobility based on the replacement of the 
free-energy difference, A F£, in eq. (27) by either a temperature difference, AT;, an 
interface supercooling, or a composition difference, AC,, in a phase change involving 
composition changes (§ 2.2.2). 

The simple linear relationship between the interface velocity and the driving force 
[eq. (27)] fails if the accommodation coefficient, A ac , is not constant. This may occur for 
certain facetted interfaces where there is a difficulty in starting a new atomic layer. This 
is found in solidification, for materials with high-entropy of freezing. For solid-state 
transformations the same situation may arise for coherent interfaces between phases of 
different structure or orientation (§ 2.2.3). In both cases, growth occurs only at growth 
ledges, §2.2.3, and since the density of these ledges will vary, for example with 
undercooling, then the linear relationship of eq. (27), which requires a constant mobility, 
is not expected. This problem of the general form of growth kinetics of facetted 
interfaces has been little addressed analytically in solid-state reactions. In solidification, 
A ac and thus the mobility are found to vary linearly with undercooling in systems 
growing by use of a screw dislocation spiral. The expected growth velocity then 
increases with the square of the undercooling. The mobility of solid-state ledged 
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interfaces then requires a knowledge of ledge spacings and thus models of ledge 
nucleation frequencies. The nucleation of growth ledges is currently, however, not 
understood at all, as described by Doherty and Rajab [1989], see §2.2.4. 

2.2.2. Transformations involving long-range diffusion 

In region I of the phase diagram in fig. 18, the formation of the low-temperature 
phase requires the diffusion of solute away from the low-solute a-phase. On the other 
hand, for the typical precipitation reaction, such as the growth of solute-rich (3 from 
supersaturated a (fig. 2), the reaction proceeds by the diffusion of solute towards the 
growing (3 -precipitates. The two situations are shown in fig. 20. The composition of the 
matrix phase at the interface is C ; , which is shown as lying somewhere between the 
equilibrium value, C p for the polymorphous reaction, or C a for the precipitation reaction, 
and the alloy composition, C 0 , which is usually the bulk matrix composition at some 
distance from the new phase. Growth of a spherical, product phase of radius r, is 
determined by the composition gradient outside the precipitate which is a function of the 
radial distance R and the time, t, since the nucleation of the precipitate. The total flux, 
J R , of solute atoms (in atoms/ s" 1 ) across the surface of a sphere of radius R ( R>r , the 
radius of the precipitate) is given by: 

J R = -4TTR 2 D(dC'/dR) K (29) 

The concentration, C, must be in the units of atoms of solute per unit volume. The 
relationship between C and the atomic fraction of solute C is: 



Fig. 20. Diffusion around a growing precipitate. In the matrix phase-change (a and d), solute is diffusion away 
from the low-solute a, while in (b) and (e) solute is diffusing towards the solute-rich /3. 
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C' = C/K (30) 

so 

J R = -4nR 2 D(dC/dR) R /V 3 (29a) 

V a is the volume of one atom of the alloy, which for simplicity is assumed independent 
of composition. The volume of the precipitate is jirP, and developing the model for the 
case of precipitation, the increase in the number of solute atoms, A n B , in the precipitate 
above that of the matrix from which it forms, is given by: 

A « B = (|w 3 )(q -Cf) = (firr 3 /V a )(C fi - Q) (31) 


dknjdr = 4w 2 (C fi - Q)/V a 

The total flux to the precipitate, d A« B /d/ is then given by: 

dA« B _ d« B dr = 47 tt 2 _ c ,dr 

dt dr d t V, l> dt 


(32) 


(33) 


This flux will be that provided by diffusion [eq. (29a)], with R = r. The supersaturation 
for diffusion is usefully defined as fl D , eq. (34). 

= (C 0 - C ; )/(^ - Q. (34) 

Flux-balance from eqs. (29a) and (33) then gives: 

4wR 2 D(dC/dR) R = 4t7t 2 (C (3 - Q) (dr/d/) (35) 


The change of sign in the flux equation (29) arises since the negative flux is one in the 
-R direction, which is a positive flux for precipitate growth. Provided that there is a low 
supersaturation, fi D <0.1, then during the time that it takes a solute atom to diffuse across 
the solute-depleted region around the precipitate, the solute distribution and the instant- 
aneous growth rate, dr/dt, can both be regarded as essentially constant. Rearranging 
eq. (35) for the condition that H D <0.1, yields: 


d R = DdC 

R 2 ” r 2 (c, -C,) [dr/dt ] ' 


(36) 


Equation (36) can then be integrated, from R = r, C-C iy to R = °°, C=C 0 , assuming that 
{ t^iCp - C ; ) (dr/dt ) } is essentially constant in the time it takes solute to diffuse from R > r 
upto R = r. Integration of eq. (36), under these conditions, yields: 


i DjC'-Q) 

r r 2 (c^ -C) (dr/dt ) ' 


(37) 


Equation (37) can now be rearranged to give the growth rate, dr/dt: 
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d r /d,.°i C “- C ‘>^. (38) 

'(C.-C,) r 

The growth rate, dr/d/, clearly does vary with the precipitate radius, r, (eq. 38) but at 
small supersaturations this can be ignored for the small change in radius that occurs 
during the time period in which a solute atom moves across the solute depleted region. 
This simplifying assumption is known as the constant size approximation. 

The growth rate can also given by a relationship derived from the local departure 
from solute equilibrium (Q-Q), by use of a slightly modified definition of mobility 
M a p rather than M a? in eq. (28)*. The velocity is then given by: 

<39) 

The two equations for the growth rate, eq. (38) and eq. (39), can be equated and 
rearranged to determine a dimensionless parameter, rM ap '/D, which determines the 
unknown composition, the interface composition in the a matrix, Q 


The dimensionless parameter, rM aP ' /D is the compositional equivalent to the Nusselt 
number in heat flow. When rM a ^D > 1, then C,—>C a , giving the condition of local 
equilibrium (Hillert [1975]). This condition also gives what is called diffusion- 
controlled growth. Diffusion-controlled growth arises when the interface reaction is so 
fast that any further increase of mobility does not significantly accelerate the overall 
reaction. The opposite extreme, when rM a ^/D<. 1, leads to C\ — > C 0 . This arises when the 
solute diffusion is so rapid and atom transport across the interface so slow, that the 
solute-depleted region around the precipitate is effectively eliminated by rapid diffusion. 
We then have the condition described as interface-controlled growth. In either extreme 
case, the composition difference driving growth, C B -C a , is applied fully to the slowest, 
most resistive step, either the diffusion in the matrix or the interface reaction. The 
intermediate case, where rM a p'/D ~ 1 is usually described as one of mixed control, where 
the composition difference driving the reaction is applied to both processes. Here, an 
increase of either the diffusion coefficient or the mobility will accelerate the reaction. 

Diffusion control. With local equilibrium, C i = C a , eq. (38) becomes: 


dr/d/ = 


D (C, ~ C„) 


= flD/r 


(41) 


* There is a straightforward relationship between the two mobilities, which can be obtained by relating the local 
departure from solute equilibrium at the interface, Q-Q, to the free energy per atom (KL in fig. 1), this is: 
kT {(^-Q/(C r CJi ln(C/CJ 
» kT (C r CyC a when (C-C a )/C a « 1. 
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where fl is the dimensionless supersaturation, (C 0 -C a )/(C fi -C a ). fl is also, by the lever 
rule of phase equilibria, the atomic fraction of the (3 precipitate phase at equilibrium. If, 
as was assumed above, that the volume per atom is constant then fl is also the volume 
fraction of f3. It is easy to use eq. (41) to demonstrate the conditions that allowed the 
simple integration of eq. (36), that is that r*( dr/dt ) is essentially constant over the time 
that solute diffuses across the solute depleted region. With Cj = C a , and using eq. (41), 
eq. (35) yields the concentration profile in the matrix for R > r as: 


C = C 0 - 


(Q - C a )r 
R 


Differentiation of this expression shows, at R-r, that the concentration gradient at the 
interface, (dC/d/?) R=r , is given by: 


(dC/d/?) R=r = (C 0 - C a )/r 


(35a) 


when R-2r, C - C a - {(C a - C a )/ 2}. This result means that half the difference in com- 
position in the profile occurs over a distance r from the precipitate interface. The time, 
At, that solute takes to diffuse over a distance r, when the diffusion coefficient is D, is 
given by: 

At * r 2 /D. (41b) 

With dr/dt = £)fl/r, during a time At, the radius will grow by: 

A r ~ At(dr/dt) = fir. 

So for fl < 0. 1 there is near constancy of r, and thus of dr/ dt, during the time that solute 
traverses the composition profile. This result validates the assumptions used in the 
integration of eq. 36. 

Equation (41) can be integrated from t = 0 when r = 0, to give the square-root of time 
relation expected for diffusion control: 

r - (2flDt)' /2 . (42) 

Interface control. From eq. (39) with C, - C 0 , we obtain the linear relationship: 

r = (43) 

It should be clear from the role of the parameter rM ae '/D in controlling the reaction, that 
with r~ 0 immediately after nucleation, the situation should always tend towards interface 
control. This may only be detectable, however, for very small values of the critical radius of 
nucleation, r* [eq. (10a)]. Diffusional growth cannot occur until a critical nucleus has formed 
by thermal fluctuation. Even after nucleation, precipitates close to the critical radius will 
have their growth conditions modified by interfacial effects, with the solubility Ca 
increased by the Gibbs-Thomson equation (§2.2.6). The only conditions for which 
interface control, eq. (43), or even mixed control can be realistically expected is for very 
low values of mobility. This can occur under conditions of faceted or ledged growth 
(§2.2.3). 
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Diffusion control — General. More general solutions for diffusion-controlled growth 
of both spheres and flat plates have been provided by Aaron etal. [1970], including the 
exact solution for the general case without the approximations involved in eq. (35). For 
a sphere the radius is given by: 

r = (aDtf 12 . (44) 

The dimensionless coefficient a is defined by: 

ft = (a/2j exp ajexp(-a) - (an) in erf (a 1/2 )]. (45) 

The approximate relationship of eq. (42) is valid with a low supersaturation fl <0.1, for 
which a ~ 2ft. This is the usual situation for many precipitation reactions. For the growth 
of plates, the half-thickness a K is given by: 

a K = 2 a'(Dt)' 12 (46) 

and 

ft = a'7r 1/2 exp(a' 2 /2) erfc (a'/2). (47) 

For oblate and prolate spheroids, that are reasonable approximations to the frequently 

found plate-like and needle-like precipitate shapes. Ham [1958] and Horvay and Cahn 
[1961] have shown that the half-thicknesses, a x , and half-lengths, a y , are given by: 

a x = 2{PDt) y \ (48) 

a y = 2 A(pDt) m , (49) 

The parameter, p, is obtained from the supersaturation, ft, and the aspect ratio A 
(A = fly/a,) by equations of a form similar to eqs. (45) and (47) which cannot be inverted 
to give p as explicit functions of A and ft (Horvay and Cahn [1961]). Horvay and 
Cahn [1961] provide figures showing how p increases with the supersaturation at 
different values of the aspect ratio, A. The equations for p for a particular values of A 
and ft are, of course, readily solved numerically. These equations are different for plate- 
like and needle-like precipitates. The assumptions that lead to eqs. (48) and (49) were 
that the precipitates were formed with a given shape and from then on, growth occurred 
with diffusion control and local equilibrium, all round the precipitate, that is, with 
C, = C a , at all positions on the interface. The analysis then predicts that the shape is 
preserved, that is, the precipitate grows with a constant aspect ratio. 

The prediction of a constant shape was challenged by the successful application of 
stability theory to the growth process (Mullins and Sekerka [1963]; § 2.2.6). However, 
as will be discussed in §2.2.6, the stability criticisms of the Ham, Horvay and Cahn 
analysis do not rule out its usefulness for the highly elongated “Widmanstatten” shapes. 
In fact, the Ham, Horvay and Cahn analysis appears to have very wide validity for the 
growth of Widmanstatten precipitates, except at high ft (ft > 0.5), when linear lengthen- 
ing kinetics, eqs. (59b and 64), as distinct from the square root of time lengthening 
kinetics eq. (49), are found. 
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2.2.3. Role of interface structure in growth processes 

Aaronson [1962] pointed out that for the plate-like Widmanstatten precipitates in 
systems like AI-Ag 2 Al (fig. 9, above), the broad faces of the precipitates have a very 
good atomic fit (§2.1.2). This is because the atomic positions on (0001) hexagonal 
planes are the same as on the {111} fee planes in the matrix. For growth of either phase 
by single-atom transfer to the other structure, a very considerable rise in energy results 
(fig. 21a, b). This continues with a second atom (fig. 21c) and only becomes a reasonable 
structure with either three, five or six atoms added (fig. 2 Id, e). The obvious conclusion 
of this idea is that it will be very difficult for individual atoms to join the precipitate on 
the habit plane: In other words the accommodation coefficient, A x , in eqs. (24) and (25) 
will be close to zero for precipitate thickening, giving very low values of the mobilities, 
M a/S or UJ. 

The solution to this problem appears to be the formation of a ledge or step on the 
broad face of the precipitate. Atoms can then add to the riser of the ledge which will be 
essentially incoherent or at least ill fitting, so the interface moves forward by sideways 
motion of the ledges. The structure of the riser of the ledge is that of the core of a dis- 
location, which for the fcc/cph interface is a Shockley partial dislocation: a/6<112> with 
respect to the fee crystal. When the model of growth by ledge migration was put forward 
by Aaronson [1962] there was little direct evidence to support his suggestion though a 
strong need, from the observed shapes of the Widmanstatten precipitate, for an explanat- 
ion of the very high aspect ratios, A, found; these are often of the order of 100 or more 
(see for example Rajab and Doherty [1989]). Such values of the aspect ratio are very 
much higher than the values of A expected for the equilibrium shapes of the precipitates, 
§2.1.2. Aaronson [1962] and Aaronson etal. [1970] argued that the high aspect ratio 
of the Widmanstatten plates and of needles was due to the need for ledge growth on the 
good fit parts of the precipitate interface. Following the work of Weatherly and 
Sargent [1970] and Weatherly [1971] it became clear that the growth ledges 
postulated by Aaronson do indeed actually occur on the broad faces of Widmanstatten 
precipitates. Furthermore, the ledges have dislocation character and that the good fit areas 
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of these interfaces do often show reduced mobility compared to ill-fitting parts of the 
interface (e.g. Purdy [1978], Rajab and Doherty [1989]). Figure 22 shows the 
dislocation-like contrast shown by the ledges on faceted Mg 2 Si precipitates in an 
aluminium alloy. Howe etal. [1985] directly imaged the dislocation ledges in an fcc/cph 
system by high resolution TEM and showed that the steps occurred in clusters of 1, 2 or 
3 Shockley partial dislocations with 2, 4 or 6 atomic layers per step. 

It was implicitly assumed by Aaronson that ledged interfaces would always show 
interface-controlled or mixed controlled growth, that is, ledged interfaces would be 
inhibited and grow more slowly than expected for diffusion control. This does not follow 
automatically however, as will be discussed in § 2.2.4. 




Fig. 22. Growth ledges on a Mg 2 Si plate-like precipitate seen by dark-field electron microscopy (from 
Weatherly [1971]). 
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The motion of growth ledges, on good-fit interfaces, need not be based on the ledge 
moving by individual random, thermally activated atomic jumps at the growing phase at 
the risers. As was originally suggested by Nicholson and Nutting [1961] and by Hren 
and Thomas [1963] and confirmed by Howe et al. [1985], growth of a hexagonal 
precipitate such as Ag 2 Al from a fee matrix can occur by glide of an a/6<112> dis- 
location on a {111} habit plane (fig. 23). The glide of a single such dislocation will 
cause two atomic layers of the matrix fee structure to transform to the hexagonal 
structure, and to cause shear of the matrix structure. This idea raises a fascinating 
problem, since this type of transformation when it occurs in cobalt and in cobalt-nickel 
alloys (Gaunt and Christian [1959]) is fully martensitic (see ch. 16). However, in 
dilute aluminium-silver alloys it is not possible, thermodynamically, for diffusionless 
(partitionless) precipitation of the hexagonal phase to occur, since the fee solid solution 
does not contain enough silver for a diffusionless transformation. There is, in addition, 
overwhelming experimental evidence that the precipitates are indeed silver-rich compared 
to the solid solution (e.g., Laird and Aaronson [1969], Ferrante and Doherty [1976] 
and Rajab and Doherty [1989]). Further evidence for the dual martensitic (displacive) 
and diffusional character of the fcc-to-hcp transformation in aluminium-silver alloys was 
provided by the observations of Aaronson etal. [1977] on the heterogeneous nucleation 
of the intermediate -/-phase on dislocations. Previously Liu and Aaronson [1970] had 
shown that these precipitates produced, plane-strain, surface relief during precipitation. 
The precipitates were observed to form by the glide of a partial Shockley dislocation, 
away from the other Shockley partial, which together with the first partial had made up 
a perfect lattice dislocation in the supersaturated solid solution before precipitation 
(fig. 24). By assuming that the transformation involved a change of composition, that is, 
that the precipitates were indeed the expected silver-rich metastable -/-phase, the authors 
were able to predict successfully the required undercooling for this type of nucleation to 
occur. It is clear that this reaction has the shear characteristics of a martensitic 
(displacive) reaction, while also involving a diffusive change of composition. This 
combination seems perfectly acceptable for a case where the reaction requires a 
diffusional change of composition but where the interface process can occur by a shear 
process. It is tempting to use the term “bainitic” to describe a transformation that 
requires long-range solute transport by diffusion to give the necessary change in 
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Fig. 24. Al-15 wt% Ag, solution-treated at 500°C, directly quenched to 360°C and held there for 60 s, showing 
the y' plates forming behind moving partial dislocations (from Aaronson etal. [1977]). 

composition ( C a to Cf) but where the change from a to fi crystal structures occurs by a 
displacive reaction. The problem with this name is that it is not yet fully agreed that the 
bainitic transformation in the fcc-to-bcc decomposition of austenitic iron occurs by these 
two processes. Shear by dislocation glide parallel to the habit plane certainly gives the 
invariant-plane strain characteristic of the martensitic transformations (Gaunt and 
Christian [1959]). So even if bainite itself has not been proved to form by the required 
combination of solute diffusion and martensitic like crystal structure change a “bainitic” 
type of reaction certainly does occur in the Al-Al 2 Ag precipitation reaction. 

Irrespective of how the mechanism of ledge movement across the coherent interface 
is described, the occurrence of ledged interfaces (see for example the review by 
Aaronson [1974]) indicates that that this is an important phenomenon and that the 
diffusional mobility of these interfaces will be controlled by the nucleation and growth 
of the ledges (§ 2.2.4). 

It is important to note, however, that growth ledges of any form are only expected for 
coherent interfaces between phases which have different structures and/or orientations. 
The difficulty identified in fig. 21 only arises since the atomic fit is restricted to isolated 
atomic planes. If the two phases have identical structures and orientations as in, for 
example, the so-called “cube-cube” relationship, then there is no difficulty in adding 
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atoms onto the growing phase — in fact there is no interface step at all, since the arrival 
of the required solute at the interface automatically changes the structure from that of the 
matrix to that of the precipitate. Examples of this type of cube-cube relationship, already 
cited in §2.1.2, are y precipitates in /3 phases, in Cu-Zn, Cu-Al or Ag-Zn, Ni,Al 
precipitates in Ni alloys, and Al 3 Sc and Al 3 Li precipitates in A1 alloys. In addition, the 
formation of Guinier-Preston zones, solute rich clusters in heavily solute supersaturated 
alloys, are further examples of the formation of precipitates with the same structure and 
orientation as the matrix from which they grow. As expected, on the basis of the ideas 
above, there are no unambiguous reports of ledge structures at the coherent interfaces 
between any cube-cube matrix-interface pairs. This failure to see the ledges in these 
systems must be considered significant, given the large quantity and the high quality of 
the experimental observations, particularly in the nickel-based alloys. 

2.2.4. Growth of ledged interfaces 

Jones and Trivedi [1971, 1975] analyzed the growth kinetics expected for a ledged 
interface. They considered ledges of height h and spacing A. If the ledges have an 
instantaneous velocity along the interface of v s then the overall normal velocity of the 
interface, Vj is: 

v, = v s h/\. (50) 

Jones and Trivedi analysed the case of an isolated ledge moving with a constant velocity 
and determined the steady state diffusion field that would describe the solute field 
feeding solute to the ledge. They produced a relationship between the dimensionless 
Peclet number, P, (P = v i h/2D) and the dimensionless supersaturation, Cl, for an isolated 
ledge. Their solution, for the case when there is diffusion-controlled atomic addition to 
the riser of the ledge, is: 

Cl = 2Pa(p) (51) 

The function a(P) can be evaluated numerically from data provided by Jones and 
Trivedi [1971], Typical values of P are about 0.03 at Cl =0.1, 0.08 at Cl = 0.2 and 0.24 
at fl = 0.4. It was reported by Jones and Trivedi [1971] that the solute field about a 
moving ledge has a finite extent (fig. 25), and that this field becomes more localized 
around the ledge as the supersaturation increases and the ledge velocity rises. An 
immediate consequence of their analysis, for multi-ledged interfaces, is that if ledges are 
more widely spaced than the extent of the diffusion field, then each ledge can move 
independently at the steady state velocity. As a result, the interface velocity V; will have 
a constant value, given by: 

Vj = 2DP/X. (52) 

Doherty and Cantor [1982] drew attention to the problem raised by this result when 
compared to the expected velocity for the diffusion-cox&roWzd growth of a plate, which 
is obtained, assuming a negligible barrier from the interface process, by differentiating 
eq. (46), to give the normal interface velocity as: 
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Fig. 25. Analytical model of ledge growth: Predicted contours of equal solute content about a ledge moving 
with a constant velocity. Contours are for different values of the normalized supersaturation, T g = 

With fl 0 = (C 0 -C a )/(C^-C a ) the initial supersaturation and fl = (C-C a )/ (C f -C a ) the remaining supersaturation, 
the value of T g is 0.0 for zero supersaturation (equilibrium solubility), and 1.0 for full supersaturation. (After 
Jones and Trivedi [1971].) 


Vi = dajdt = a'(D/t) m (53) 

The a' of eqs. (46 and 53) has no relationship with the a(P ) of eq. 51. The difficulty is 
that since eq. (53) contains time in the denominator of the velocity function, the velocity 
of the mobile, diffusion-controlled interface falls steadily with time, eventually becoming 
smaller than the velocity predicted for the ledged interface, of spacing. A, however small 
the ledged velocity it might be, provided only that A is finite. Moreover, the velocity 
advantage of the ledged interface continues to grow with longer and longer times, 
becoming, at infinite time, infinitely larger. Although fixed ledge spacings and infinite 
growth times are not experimentally realistic the “thought experiment” described above 
indicates a major difficulty with the Jones and Trivedi analysis. Doherty and Cantor 
[1982] were unable to find the error in the analysis but tested the results against their 
numerical simulations of the diffusion field around a pair of ledges. The simulation 
assumed the ledges to be nucleated in the fully supersaturated matrix and to grow by 
solute flux to the ledge riser. There was no barrier to solute addition at the risers of the 
ledges, which were assumed to show local equilibrium, C t =C a . The rest of the interface 
was, however, not allowed to accept solute; that is, its mobility was set to zero. There 
was, by the geometric “point effect of diffusion”, a higher flux to the top of the ledge 
than to the base. However, the ledge shape was held stable by distributing the arriving 
flux uniformly along the riser: This model made the implicit assumption of rapid 
interface diffusion along the riser interface. The riser interface will have, in the solid 
state case, fig. 23, the structure of the core of a partial dislocation and so would be 
expected to show rapid solute transport down the riser. 

For a ledged precipitate growing from a liquid in solidification, or from a vapor, then 
such rapid solute transport will not be expected and so an alternative model is needed. 
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A suitable model for growth from a liquid will be the one initially used by Jones and 
Trivedi and subsequently by Enomoto [1987] in which the ledge or step velocity is 
determined by the velocity of the base of the step , where local equilibrium was assumed. 
With this boundary condition, higher parts of the riser retain some supersaturation which 
feeds, by matrix diffusion, to the base of the step. 

The results of the Doherty-Cantor finite-difference computer simulation were 
significantly different from the analytical results of Jones and Trivedi. As expected, the 
high local flux from a fully supersaturated matrix gave a high initial ledge velocity but 
this velocity fell steadily with time. For widely separated ledges, the total ledge 
displacement, X s , showed an initial time exponent of about 0.76. This increased to 0.88 
at longer times for supersaturations up to 0.1 (fig. 26). The origin of this ever-decreasing 
ledge velocity was the steady movement of isoconcentrates away from the ledge as time 
increased (fig. 27). The only condition for which a constant velocity, v s , of the ledge was 
found by Doherty and Cantor [1982], was for high supersaturations, f l =0.5, and quite 
long times. Here the solute distribution was highly asymmetric (fig. 27). The asymmetry 
appears to be physically reasonable since the ledge is growing rapidly and therefore 
“catching up” with the isoconcentrates in front of the ledge but leaving a huge “trail” of 
depleted matrix behind it. Subsequent simulations by Enomoto [1987] that were run for 
longer times than those of Doherty and Cantor [1982] found that isolated ledges 
eventually achieved constant (steady-state) ledge velocities at all supersaturations but 
again with asymmetric solute distributions particularly at high supersaturations. In all 
cases there was an initial transient time, that decreased with higher supersaturations, 
during which, as seen by Doherty and Cantor, the growth velocity fell. After the transient 
period the isolated ledge reached a steady velocity that was faster the higher the 
supersaturation. Atkinson [1981, 1982] presented an improved analytical model for the 
steady state growth of an isolated ledge at a constant ledge velocity which avoided the 
error in the original Jones and Trivedi treatment. Atkinson’s analytical results matched 
closely the steady state velocities for isolated ledges produced by Enomoto’s simulations. 



Fig. 26. Computer simulation of ledge growth. The total ledge displacement X s in units of ledge height ft, for 
dimensionless time of growth, t = h 2 /D, for various supersaturations. Only for ft = 0.5 was a constant velocity 
found at long times. Isolated ledge. (After Doherty and Cantor [1983].) 
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0.034 o 409.0 0.995 

Fig. 27. Computer simulation of ledge growth. Predicted contours of equal solu‘e supersaturation at growing 
ledge for three values of t in units of ft 2 //), with the corresponding values of the ledge velocity, V s , in units of 
D/ft, for two different supersaturations, 0.05 and 0.5. S' a normalized supersaturation, defined by S' = {C-C a )/ 
(C 0 -C„); for the full initial supersaturations S' = l. Isolated ledges. (After Doherty and Cantor [1983].) 


Both Atkinson and Enomoto assumed the solidification boundary condition for the 
velocity of ledge, described above, rather than the solid state condition used by Doherty 
and Cantor. The result of the different assumed boundary conditions was that the 
velocities found by Enomoto were slightly smaller than those in the simulations of 
Doherty and Cantor (see Doherty and Cantor [1988, 1989]). Apart from the minor 
difference in the boundary conditions at the riser, the simulations run by the two studies 
appeared to match rather well — despite some rather fierce attacks by Enomoto and 
Aaronson [1989] on the initial simulation of Doherty and Cantor.* 

The critical difference between the computer models and the earlier analytical result 
of Jones and Trivedi was found when the interledge spacing, A, was reduced to allow 
diffusional interaction between adjacent ledges (see fig. 28a). When the growth times 
reached and exceeded the values of A 2 /4 D, where the diffusion fields overlapped, the 
time exponent of the ledge displacement fell to 0.5, which, by eq. (52), produced the 



* Since the author of this review was involved in this and an earlier dispute on ledge growth kinetics, Doherty 
et at. [1977, 1978b], Aaronson [1977], Sankaran and Laird [1977, 1979], readers may wish to read the 
original papers before deciding whether or not to accept the opinion presented here. 
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Fig. 28. Computer simulation of ledge growth, (a) Total ledge displacement X, against lime for closely spaced 
ledges, A -h, A =5h, A =20h and A =68ft. For t>t* (r*=A 2 /D) the diffusion fields overlap and an X s /; 05 
relationship is found. (After Doherty and Cantor [1983]). (b) Precipitate growth (thickening rate) as a 
function of time for the ledged interface, G,, (thick line) and the volume diffusion limited rate, G d , (thin line) 
for two supersaturations, fl o = 0.5 and 0.05, and the ledge spacings B (A/h) = 2, 5 and 10. The arrows indicate 
the values of r*=A 2 /4D. (c) The isoconcentrates, here (l-G/ll 0 ) = 1-5”, at a supersaturation, fl 0 , of 0.5 are 
shown for various dimensionless times T (Dt/h 1 ), with multiple ledges, at a spacing A of 20h, after being 
displaced by the growth distances in units of h and with the dimensionless step velocities V s (vh/D) indicated. 
t* is 100. (After EnOMOTO [1987]). 


References: p. 1497. 





1414 


R. D. Doherty 


Ch. 15, §2 


same time exponent for the interface displacement, a x . This is, of course, the time 
exponent for full diffusion-controlled growth of the interface [eq. (46)]. The simulation 
results, both those of Doherty and Cantor and of Enomoto, showed, after the overlap of 
the diffusion fields from adjacent ledges, that the interface displacement not only had the 
diffusion limited time exponent but also, to within a few percent, the correct value 
[2a'D I/2 , eq. (46)] for the diffusion-controlled slope of a plot of the half thickness, a x 
versus t l/2 . Such plots from the simulations, had time delays corresponding to the initial 
periods of slow growth, where solute close to the interface could not join the precipitate 
except at the ledges. At times somewhat after the diffusion fields overlapped, the kinetic 
behaviour was as though the interface could accept solute everywhere. This occurs since 
the diffusion fields around interacting ledges, apart from the perturbation close to the 
step, are similar to the fields about a flat interface that is accepting solute at all points 
along the interface. Enomoto’s simulations found very similar results. Figure 28b shows 
his computer predicted growth rate for the ledged interface, G„ and that expected for 
diffusion limited thickening by eqs. (46 and 47), G d , at two supersaturations and for two 
dimensionless spacings B (B=k/h). The critical time for diffusional interaction, /*, is 
indicated by the vertical arrows. For times upto t* the growth rate was clearly inhibited 
by the ledged structure but after times of the order of At* the ledged interface moved at 
the rate limited by diffusion. The solute isoconcentrates are shown in fig. 28c, the 
contours are given as (1 -Cl/Cl 0 ). For times less than t* (t = 400) much of the interface 
is almost fully supersaturated, (1 -0/fl 0 )-> 0. But when t= 1000 the isoconcentrates, 
(1 -0/0 0 ), from 1 to 0.4 are parallel to the interface. Under these last conditions, any 
solute atom that enters the solute-depleted region is almost certain to be captured by the 
interface by one of the moving ledges. 

Both sets of computer simulations clearly support the physically reasonable assumpt- 
ion that once the reaction time is significantly larger than the interaction time, A 2 / 4 D, 
between ledges the ledged interface should show diffusion limited growth kinetics. This 
was the assumption used implicitly in the successful application of the Ham, Horvay and 
Cahn growth kinetics, eqs. (48 and 49) to match the behaviour of Widmanstatten plate- 
like precipitates in aluminium alloys discussed in 2.2.5.2.* 

There is no analytical theory that describes the situation for diffusionally interacting 
ledges — this is still a situation that can only be handled by numerical simulation. 
However a rather clear pattern for ledge growth now appears established by Doherty 
and Cantor [1982] and Enomoto [1987], at least for constant ledge spacings. A set of 
ledges, at a given fixed distance apart, grows initially in an isolated manner with a 
steadily falling ledge velocity that, if the ledge spacing is large enough, will achieve a 
constant step velocity and thus a constant interface velocity. The interface thickening rate 
for such isolated, non-diffusionally interacting ledges is slower than expected for volume 
diffusion control. Interaction between the ledges, however, starts to occur at times of the 


* The assumption was also used explicitly in an early dispute about this analysis, Doherty et at. [1977, 
1978b], Aaronson [1977], Sankaran and Laird [1977, 1979], and the computer simulations of Doherty 
and Cantor [1982] were run, in part, to test this assumption. 



Ch. 15, §2 


Diffusive phase transformations 


1415 


order of A 2 /4 D, and, at times somewhat larger than this, there is a transition to volume 
controlled thickening. Once this situation is reached then the analytical equations of Ham 
and Horvay and Cahn become applicable and the precipitates should continue to grow, 
maintaining the aspect ratios that they had acquired at the time of the transition to 
diffusion controlled thickening. This does indeed appear to be the experimental situation, 
see Ferrante and Doherty [1979] and Rajab and Doherty [1989]. 

In contrast to the situation for the growth of ledges, there is almost no understanding 
at all for ledge formation or nucleation. Weatherly [1971] applied conventional 
nucleation theory to the formation of a “pillbox” disc on the surface of a precipitate, but 
the predicted rates of nucleation were too low for any reasonable rate of ledge formation. 
This happened even during initial growth when the interface has full solute super- 
saturation and therefore a high driving force for nucleation. At later stages of growth, 
and especially during Ostwald ripening (§2.4), there is so little supersaturation that 
“homogeneous” ledge nucleation appears impossible (see Doherty and Rajab [1989]). 
Various possible heterogeneous nucleation sites have been described such as those at 
emerging screw dislocations and at the intersections of precipitates on different habit 
planes (Aaronson [1974]). The beautiful experimental observations of Howe et al. 
[1985] show direct evidence for heterogeneous nucleation of ledges on the broad faces 
of a precipitate caused by the intersection of other precipitates forming on other variants 
of the habit plane. Rajab and Doherty [1989] provided indirect evidence from growth 
kinetics, §2.2.5.2, for ledge formation by this method of precipitate intersection. 
However Doherty and Rajab [1989] showed that this effect could not be accounted for 
using conventional heterogeneous nucleation theory. At the point of intersection, where 
heterogeneous nucleation of a ledge using the interfacial energy of the high energy rim 
of the intersecting plate might have been expected, and appears to actually occur, there 
is no supersaturation at all since the rim of the other plate is, from its growth kinetics, 
known to be a perfect sink for solute. The critical problem of understanding the 
mechanism and thus the kinetics of ledge nucleation in the thickening of Widmanstatten 
precipitates is therefore a major unsolved problem in the field of diffusional phase 
transformations. 

2.2.5. Quantitative experimental observations of growth rates 

As with nucleation studies there is a considerable lack of quantitative data on growth 
rates, despite the large number of qualitative observations on growth of product phases 
by diffusional processes. There are a few such quantitative measurements however and 
these will be briefly described. 

2.2.5.I. Interface-controlled growth rates, without change of composition. The best 
data for this type of reaction are probably those for grain-boundary migration in high- 
purity alloys (ch. 28, §3.4.1) It appears from the current state of knowledge, from the 
work of Aust and Rutter [1963] and the review by Haessner and Hofmann [1978], 
that the single-atom model of eq. (27) does appear to describe the process very well, 
with activation energies of the order of those expected for grain boundary diffusion. 
However, grain-boundary migration experiments are often complicated by solute-drag 
effects and also by the problem of knowing what is the magnitude of the free energy 
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driving force. Solute drag effects (Chapter 28) arise from the usual desire of solute to 
adsorb preferentially on a grain boundary particularly when the solute and solvent atoms 
have different sizes. When the grain boundary tries to migrate the solute can be partially 
left behind and thereby exert a drag on the boundary which therefore moves more slowly 
than in the case of very high purity materials when the boundary can quite easily escape 
the solute field. Despite some uncertainties about the details of solute drag model, it 
seems safe to conclude from the present state of knowledge that the basic model for 
mobility of an incoherent boundary is in quite good shape. Low-angle grain boundaries 
and coherent twin boundaries show very low mobility (ch. 28, §§ 2.4.2 and 3.4.2). The 
low mobility of the twin boundaries appears to arise from the need for a ledge mechan- 
isms for migration combined with a low density of the required growth ledges. 

For polymorphous phase transformations there are considerable difficulties in 
determining the growth rates, since in many cases the kinetics of the process are 
apparently controlled by nucleation. A clear example of this is provided by the numerous 
studies of the fcc-bcc phase transition in iron with a range of trace alloy additions. 
Wilson etal. [1979] briefly reviewed the field and presented additional data. They report 
a series of thermal arrests at different cooling rates for high-purity iron and mild steel; 
similar results have been previously found. Two transformations occurring at the highest 
temperatures, approximately 840 and 735°C, are both described by the authors as 
occurring by the massive transformation mechanism, the product in each case being 
apparently ferrite of the same composition as the austenite. The distinction is due 
apparently to different nucleation mechanisms. The high-temperature transformation 
product is described as equiaxed, having a polygonal shape, and was treated by Wilson 
et al. as occurring with the volume stress relaxed, while the lower-temperature product 
is described as massive, and consists of small grains with an irregular, presumably 
facetted, outline. Hillert [1995] reports four plateaux in very rapid cooling of pure iron: 
the first seen at the highest temperature is attributed to equiaxed (massive) growth of 
iron, the second at faster cooling and lower temperatures is attributed to plate-like 
(Widmanstatten) growth, the next is lath (dislocated) martensite and the one at the fastest 
cooling rates and at the lowest temperature is plate (twinned) martensite. The distinction 
between the first two is apparently that the massive reaction takes place by migration of 
the incoherent interface of the bcc ferrite into the fee austenite grain with which it does 
not have a good fit orientation relationship. The plate-like growth of Widmanstatten 
ferrite would then be the growth of grain boundary nucleated ferrite into the fee austenite 
with which it does have the good fit orientation relationship. It would seem that the 
plateau growth, where the growth rate becomes faster at essentially constant supercooling, 
should in both these cases be partitionless, certainly with respect to substitutional 
impurities if not the fast diffusing interstitial impurities. The two martensitic reactions 
will be displacive and partitionless with respect to all impurities. The fascinating 
distinction is that between plateaux #2 and #3 Widmanstatten ferrite and lath martensite 
in nominally pure iron. Since there are two plateaux there has to be a distinction between 
them — this could be either that the Widmanstatten ferrite is not displacive (§2.6) or 
that it is not partitionless with respect to the rapidly diffusing interstitial impurities. (It 
could, of course, also be different in both respects.) It was reported by Hillert [1995] 
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that when trace quantities of carbon, upto 0.02wt%, were added to the nominally pure 
iron, there was a more rapid fall in the plateau temperatures for reactions #1 and #2 than 
for the lath martensite reaction #3. This result was analysed by Hillert as showing growth 
of the first two reaction under local equilibrium, diffusion control, for carbon. That is, 
there was a “spike” of carbon ahead of the growing interface, giving a carbon con- 
centration of the ferrite equal to that of the alloy, with the austenite being carbon 
enriched, but having the same carbon activity, as the ferrite across the interface. The 
width of the carbon enrichment spike in the austenite would be of the order of D y (C)/v. 
D y (C) is the carbon diffusion coefficient in austenite and v is the interface growth 
velocity. Since the diffusion coefficients of the substitutional elements are orders of 
magnitude smaller than that of interstitial carbon their spikes will be of negligible widths 
— indicating true partitionless growth of ferrite for the substitution impurities. However 
additions of nickel had been previously shown by Swanson and Parr [1964] to have 
equivalent effects in lowering the plateau temperatures of reactions #1 and #2. Hillert 
[1995] takes this result as confirmation of a solute drag effect in the massive and 
Widmanstatten ferrite growth reactions, see § 2.6. It is not clear however whether the 
effect of nickel was not merely the slope of the T 0 line, fig. 18, for Fe-Ni. 

A detailed general review of the topic of massive transformations was provided by 
Massalski [1970]: Details of the various structural features in this type of transformation 
can be obtained from this source. One early investigation that did provide quantitative 
data on the diffusionless type of transformation is the study of the massive transfor- 
mation in copper-zinc alloys by Karlyn et al. [1969]. The authors quenched bcc brass 
containing 38at% Zn to room temperature and observed the transformation products on 
briefly up-quenching samples by pulse-heating. They only obtained the diffusionless fee 
product when the sample was held in the single-phase a region of the phase diagram and 
it was not found when samples were held in region II of the phase diagram (fig. 18, 
above) where it is expected that the diffusionless transformation should occur. The 
explanation offered for this failure to see the transformation in region II of the two-phase 
field, when it is thermodynamically possible was that of a nucleation difficulty. It was 
hypothesized that only pre-existing nuclei would have had time to develop, and since 
these pre-existing nuclei had been held in the two-phase region, they were all surrounded 
by rejected solute. The effect of this rejected solute would be to cause the local 
concentration at the interface to be outside region II of the phase diagram. When the 
samples were heated into the single-phase field, however, the nuclei would, after a brief 
time delay to reabsorb the excess zinc solute, be able to grow diffusionlessly. This was 
what actually was seen to happen and the authors were able to describe the growth rate 
of the a grains by eq. (27) with the low diffusional activation energy (for copper alloys) 
of only about 60 kJ/mole. This value is well below that expected for bulk diffusion of 
copper (200 kJ/mole) and lower even than that expected for grain-boundary diffusion 
(80-100 kJ/mole). 

Plichta et al. [1978] compared the growth rates found in a series of massive 
reactions in titanium binary alloys with previous studies of the massive transformation in 
other systems. They showed, for all the systems, that the activation energy was sig- 
nificantly less than expected for bulk diffusion. This result was obtained by use of 


References: p. 1497. 



1418 


R. D. Doherty 


Ch. 15, §2 


eq. (27) and again provides evidence that this equation does indeed adequately describe 
interface-controlled partitionless transformations, 

2.2.S.2. Reactions involving long-range solute diffusion. The range of quantitative 
experimental data available prior to 1968 was reviewed by Aaronson etal. [1970], They 
reported that in most cases the experimental studies showed good agreement with 
diffusion-controlled growth models, such as were described in § 2.2.2. These authors also 
considered additional mechanisms such as the growth of grain-boundary precipitates, the 
theories for which were not included in § 2.2.2. In this modification a precipitate on a 
grain boundary is fed solute by diffusion over a short distance in the matrix to the grain 
boundary followed by rapid grain-boundary diffusion to the precipitate. This “collector 
plate” mechanism gives faster growth than expected for solute transport solely by volume 
diffusion. For several important examples of thickening of Widmanstatten plates, for 
example in Fe-C, Cu-Si and Al-Cu, Aaronson et al. [1970] reported evidence for 
ledged growth with the growth data showed signs of growth slower than expected for 
diffusion control. Later investigations of precipitation of the hexagonal phase from the 
fee matrix phase in Cu-Si by Kinsman etal. [1971, 1973] and by Tien etal. [1971, 
1973] again showed in this case apparent control not by silicon diffusion to the growing 
hexagonal phase but control by interface processes that allowed the ledge mechanism to 
operate. Tien et al. used an electrical-resistivity technique for measuring solute content 
in the matrix and at small driving forces suggested a dislocation pole mechanism similar 
to that used to interpret twinning in fee metals (e.g., Kelly and Groves [1970]). The 
twinning dislocation was an a / 6 <112> partial. The Cu-Si phase diagram is an interest- 
ing example of the type of system which has a very low driving force for nucleation 
since the two phases, cubic a and hexagonal k, are very similar in composition 
[eq. (6d)]. The microstructure of the precipitated two-phase structure shows the hexa- 
gonal plates running completely across the matrix grains. As a result this is a rather 
unusual precipitation reaction, though somewhat similar to Widmanstatten ferrite in the 
iron-carbon system in having a low driving force for nucleation but a large 
supersaturation for growth. The precipitates in Cu-Si nucleate apparently at stacking 
faults while the iron system requires grain-boundary nucleation. The formation of 
precipitates on a single variant of the <11 1> habit planes means that there will be no 
intersections with other precipitates. Such intersections, as noted in §2.2.4, appear to 
allow the formation of thickening ledges. Lack of facility in forming the growth ledges 
will account for growth at interface-limited rates slower than expected for full diffusion 
control. Recent observation on the growth of face-centered tetragonal y-phase from 
hexagonal a in Ti-Al by Sadler-Holm [1994] has shown that in this system there is 
also a lack of precipitate intersections. The hexagonal phase has only the single close 
packed (00.1) habit plane so no intersections are possible for fee precipitates that form 
on this single variant eph plane. In the resulting so-called “lamellar” structure the 
inhibited thickening of the y precipitate phase lead to retained aluminum supersaturation 
in the hexagonal phase particularly after it has transformed to the ordered a 2 structure. 

In the more usual precipitation reaction where a high density of intersecting plate-like 
precipitates form, for example the 0'-phase in Al-Cu or y' Ag 2 Al in Al-Ag, there has 
been a considerable amount of work and resulting controversy. Sankaran and Laird 
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[1974] extended the observations of Aaronson and Laird [1968] and claimed that the 
results showed inhibited thickening compared to the rates predicted by the diffusion- 
control model. They also reported that the lengthening reaction was faster than that 
expected for linear unstable lengthening (§ 2.2.6). Chen and Doherty [1977] reanalyzed 
the earlier data on the growth kinetics of d' in Al-Cu and suggested that the results 
appeared to fit quite well to the Ham [1958, 1959] and Horvay and Cahn [1961] 
diffusion-controlled model for growth with a constant aspect ratio [eqs. (48 and 49)]. 
This led, not unexpectedly, to a considerable controversy between the various authors 
(Sankaran and Laird [1977, 1978], Aaronson [1977] and Doherty et al. [1977, 
1978b]). It is probably impossible for someone deeply involved in a scientific argument 
to give a really objective account of the situation but it appears that it is reasonable to 
say that the data presented agree surprisingly well with the diffusion-controlled model, 
that is, with no significant inhibition of the growth rate, but with very large aspect ratios. 
These high length-to-width shapes then require some significant initial inhibition of 
thickening at the early stages of precipitate growth. 

The situation is perhaps easier to understand in the studies of precipitate growth in 
the Al-Ag system (Laird and Aaronson [1969], Ferrante and Doherty [1976, 1979], 
and Rajab and Doherty [1982, 1989]). Figure 29 shows the observed growth kinetics 
in a particular experiment and indicates that the results show the square root of time 
relationship expected for diffusion control. Moreover, the slopes of the size// 172 plots are 
close to those predicted by Ham [1958, 1959], and by Horvay and Cahn [1961] for 
diffusion control [eqs. (48) and (49)]. However, in this and other experiments, the 
observed value of the aspect ratio, A, varied widely with the precipitate growth con- 
ditions and, in every case, A was much greater than the value expected for precipitates 
with the equilibrium shape determined by interfacial energies. 

Ferrante and Doherty postulated that the aspect ratio was increased by immobility in 
the thickening reaction at the earliest stages of the reaction, before the measurements 
shown by fig. 29 could be made. Under these conditions the aspect ratio would increase 
rapidly, as had been previously argued by Aaronson. 



Fig. 29. Half-lengths, a y , and half-thickness, a„ of plate-like precipitates of Ag 2 Al growing in aluminium, as 
a function of time' /2 (after Ferrante and Doherty [1979]). 
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The details of this hypothesis are easily given: The diffusion-controlled thickening 
rates are given by: 

dajdt = ( Dp/if \ (54) 

da y /dt = A(Dp/t) m , (55) 

These are the derived equations for growth rates that when integrated, give the growth 
equations (48) and (49), quoted above. 

The rate of change in aspect ratio, dA/dt is easily derived from the definition of A: 
A = aja x , (56) 


dA/dt 


1 A 


V 


dt 


dr 


(56a) 


Substitution of eqs. (54) and (55) into eq. (56a) shows, as had been pointed out by Ham 
[1958], that with diffusion-controlled lengthening and thickening starting from an initial 
aspect ratio A, that shape will be preserved (dA/dr = 0). However, if the thickening 
reaction is inhibited by some factor/ with/<l, eq. (54) becomes: 

dajdt = f{DfS/ij' 2 , (54a) 


So, under this condition of inhibited thickening there will be an increase of the aspect 
ratio, given by: 


dA/dt = -(^- 1 (1-/). 


(57) 


Ferrante and Doherty [1979] showed direct evidence for this effect by TEM 
observations at the earliest stages of precipitation, at shorter times than appear on fig. 29. 
The observations were of thickening slower than predicted, F < 1, and a steadily rising 
value of the aspect ratio, A [eq. (57)]. Rajab and Doherty [1982, 1989] extended the 
observations in the Al-Ag system to a wider range of temperatures and supersaturations. 
They also found, for most of the precipitation reaction, diffusion-controlled growth rates. 
In addition, they found that the experimental value of the aspect ratio, A, though constant 
for a particular reaction condition, varied systematically with the density of precipitates. 
The aspect ratio fell as the density of precipitates increased. The authors argued that this 
variation in A was due to variation in the duration of the period of initial thickening 
inhibition, with/< 1, caused by a shortage of growth ledges. However, when precipitates 
on different habit planes intersected each other, which occurred earlier in the process, 
and so at smaller values of A, for higher densities of precipitates, then inhibition ceased 
and the precipitates retained the value of A at the time of intersection. The microstructural 
origin of this effect was assumed to be easier ledge nucleation at the intersection of two 
precipitates. Microstructural studies by Rajab [1982], Rajab and Doherty [1989] and 
particularly those of Howe et al. [1985] gave experimental support for this hypothesis. 



Ch. 15, §2 


Diffusive phase transformations 


1421 


The current position on observation of precipitate growth kinetics suggests therefore 
that the theoretical ideas described in §2.2 appear to be well founded. That is, most 
reactions are diffusion-controlled except those with a coherent interface, between 
different structures, when a ledge mechanism is required. Even with a ledged growth, 
however, diffusion-controlled growth is possible provided that there is a sufficiently 
active source of ledges. The major problem that remains is the lack of understanding 
about the mechanisms and kinetics of ledge formation. A similar conclusion appears to 
apply also for Ostwald ripening of plate-like precipitates (§ 2.4). 

Finally, mention should be made again of the studies on nucleation and growth in the 
copper-cobalt system by Servi and Turnbull [1966], Their observations showed clear 
evidence for diffusion-controlled growth of metastable fee cobalt precipitates in the fee 
matrix. This is a “cube-cube” orientation relationship between fully coherent precipitates 
which should not need growth ledges. The agreement with diffusion control in this 
system is very satisfactory confirmation of the mobility of interfaces between coherently 
connected structures when they have the same orientation and structure. The precipitate 
growth was studied in this system from the earliest stages of growth, at the smallest 
values of radius r, by virtue of the electrical resistivity technique used. Later studies on 
other systems with a very high density of homogeneously nucleated precipitates, such as 
those of Xiao and Haasen [1992], while not indicating any significant interface control 
do not find the r/ 1 growth kinetics of diffusion control. This result seems reasonable, 
since the r/t kinetics are only predicted for independent growth of diffusionally isolated 
precipitates. If the spacing A 3 between precipitates, after nucleation at a time t is less 
than (Dt)' /2 , then such isolated growth kinetics is not to be expected. The successful 
numerical analysis of nucleation, growth and coarsening by Kampmann and Wagner 
[1984] and Haasen and Wagner [1992] in Cu-Ti and Ni-Al alloys were based on 
diffusion-controlled growth and the success of their models supports the expected 
(§2.2.3) lack of any interface immobility between the coherent (cube-cube) nuclei in 
both alloys. 

2.2.6. Growth instabilities 

This section is divided into two parts; firstly is a consideration of the onset of 
instability, and secondly is a discussion of the linear growth expected for well-developed 
instabilities, the lengthening of dendrites and Widmanstatten plates and needles occurring 
with a constant tip or rim radius. 

2.2.6.I. Initial instability. For a spherical precipitate growing in a solute field 
(fig. 20, above) with either full diffusion control or even with mixed control, C a < C, < C a , 
the solute gradient sets up a potentially unstable situation, first analyzed by Mullins and 
Sekerka [1963]. If the sphere is perturbed by any oscillation, described mathematically 
as a spherical harmonic of amplitude S s , then those parts of the sphere with a locally 
larger value of the “radius”, the distance from the center of the original sphere, (but a 
smaller radius of curvature, p) will penetrate into more supersaturated matrix and grow 
faster, while parts of the sphere with a smaller value of the radius but with a larger 
radius of curvature will grow more slowly, see fig. 30. As a result, the instability will 
grow forming a cellular or dendritic interface. This situation had been long recognized 
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(a) 



R 


(b) 


Fig. 30. (a) A sphere of unperturbed radius r (full line) perturbed (dotted line) by a spherical harmonic of 1 = 6 
in six “<100>” directions, four of which are shown. At the maxima the local radius of curvature has decreased 
to p+ while at the minima it has increased (and changed sign) to p-. The matrix solubility has increased to 
C a (p+) at the maxima and decreased to C (1 (p-) at the minima, (b) Solute C as a function of the radial distance 
R from the center of the /3 precipitate to the a matrix through one of the maxima with solubility (C „ (p +). 
C a (p— ), the solubility at the minima, is also shown at its radial distance from the origin. 
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in solidification and had been given a classical analysis by Tiller et al. [1963] as 
constitutional supercooling (ch. 8, §6.4.1). In solidification, a temperature gradient 
provides some stability against growth of perturbations into the commonly observed 
growth form, a highly branched dendrite. For an isothermal precipitation reaction no 
such stabilizing temperature gradient exists. Mullins and Sekerka showed, however, how 
the interfacial energy will partially stabilize the growing sphere. 

In the following discussion, only the case of the growth of a solute-rich precipitate by 
flux towards the growing precipitate will be considered. The opposite case, of growth of 
a solute-poor phase, for example, the growth of alpha-iron in austenite in Fe-C alloys, 
merely requires an appropriate change of notation. 

To describe this stabilization, the Gibbs-Thomson equation for the increase of 
solubility of a solute-rich spherical precipitate of radius r is needed. This is given in the 
general case by Purdy [1971] (see also Martin and Doherty [1976, 1995])* as: 


Cjr) = C„ 


1 + 


(1-CJ 2crV m 
(C /3 ~ C a ) RTre a j 


(58) 


where e a is the Darken non-ideality factor (§ 1.2). For dilute terminal solid solutions this 
reduces to the usual form: 


= C a 1 + 


2o-V a ) 

RTr j 


(58a) 


The effect of surface energy on solubility is such that when the precipitate nucleates at 
the critical radius r*, C i = C a (r) = C 0 , and so, by eq. 38, there is zero net supersaturation, 
and the growth rate, dr/d/, is zero. Eq. 38 becomes: 


dr/d/ = 


D Co-Q(r) 
r C p (r) - C a (r ) ' 


(59) 


Substitution of eq. (58a) into eq. (59) gives: 


dr/df 


2DaV m C a lj 

f-- 


{Cp ~ C a) RT r 


r j 


(60) 


Here C^(r)-C a (r)~ Cp -C a since both compositions shift in the same direction with the 
Gibbs-Thomson effect. 

This growth rate is plotted out in fig. 31, which shows that the growth rate rises to 
a maximum value at r-2r*, and then declines, becoming that given by eq. (41) only for 
r »r*. 

Mullins and Sekerka showed that for this type of diffusion-controlled growth, the 


* Figure 1 used to describe the driving force for precipitation is readily converted to be used to derive the 
Gibbs-Thomson equation. For the Gibbs-Thomson equation, C 0 becomes Cjr) and L—>K becomes TxrVffr. the 
increase in free energy per mole (dG/dn = dG/dr. dr/d/t) due to the curvature of spherical precipitates of radius r. 
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Fig. 31. Variation of growth rate of a spherical precipitate radius at a fixed supersaturation. At r=r*, the 
critical radius for nucleation, the growth rate is zero. 


spherical shape was stable against all perturbations until it reached a limiting radius for 
absolute stability, r l = lr*. The physical origin of the stability is that while an unper- 
turbed sphere has a constant radius and thus solubility, at the maximum of the perturba- 
tion in fig. 30, the radius of curvature p, which is the radius to be used in eq. (58) for 
non spherical precipitates, decreases, below the radius, r, of the unperturbed spherical 
precipitate while at the minimum in the perturbation the radius of curvature increases. As 
a result of the Gibbs-Thomson effect, the solubility rises at the maxima and decreases 
at the minima. This creates a concentration difference that opposes the development of 
the perturbation. (The radius of curvature of a surface is the mean of any two radii that 
are perpendicular to each other — 1/p = 1/r, + l/r 2 .) 

The condition of absolute instability is that a perturbation of amplitude S s will grow, 
that is dS s /df>0. A spherical harmonic with two maxima is the smallest to show this 
type of instability. A spherical harmonic with two maxima, / = 2, is a sphere perturbed to 
an oblate spheroid - an ellipsoid of revolutionf. This has a shape similar to that of 
either a rugby or an american football. Higher spherical harmonics are spheres distorted 
with increasing numbers of maxima — given by the parameter l such that with (=6 a 
distortion is capable of giving maxima in 6 directions equivalent to the cubic crystallo- 
graphic directions <100>, fig. 30. Growth of this will give the shape seen in cubic 
solidification dendrites. Coriell and Parker [1967] extended Mullins and Serkerka’s 
analysis and showed that absolute stability was less important than relative stability. The 
criterion for a sphere to show relative instability is that 


1 dS s 1 dr 
S s d t r dr 


( 61 ) 


This criterion arises since if the relative increase, (l/<5 s )(d<5 s /df) is >0 but <(l/r)dr/dt, 
then although the sphere is unstable, the observed shape will become apparently more 
spherical, rather than less, with time. The smallest radius that shows relative instability 
is the 1 = 3 spherical harmonic with three maxima; this occurs when r> 36 r*. When 1 = 6, 


t / = 1 merely displaces the sphere. 
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giving the 6 <100> orthogonal dendritic arms, r> 66 r*.* 

The analysis above is a limiting case since, as discussed by Shewmon [1965], there 
are other stabilizing factors that must also be considered, particularly for solid-state 
reactions. Shewmon developed his analysis since he could find no examples of solid-state 
dendritic growth, and wished to understand why. However, dendritic growth shapes are 
now known, and have been studied, in a few metallic solids by Malcolm and Purdy 
[1967], Bainbridge and Doherty [1969], Bainbridge [1972], Fairs [1977], Ricks et 
al. [1983] and Baumann and Williams [1985]. Despite these examples of sold state 
dendritic growth, the problem was only changed to — why were there so few examples 
of solid-state dendrites? 

The additional stabilizing factors that inhibit growth perturbations include the 
following: 

(i) strong anisotropy in the interfacial energy — for a facet plane; 

(ii) interface immobility; 

(iii) interfacial diffusion; 

(iv) diffusion within the precipitate; 

(v) highly anisotropic precipitate shape — high initial aspect ratios; 

(vi) closely spaced precipitates; 

For metallic solidification, where dendrites are ubiquitous, none of the first five 
stabilizing influences are significant, while most solid-state diffusional reactions do have 
some or all of these stabilizing factors. 

The first four of these effects were discussed by Shewmon [1965] and each was 
shown to be strongly stabilizing. No quantitative analysis was made for (i) the stability 
of an interface at an energy cusp orientation giving a facetted plane (figs. 10 and 11). 
The physical basis of the enhanced stability due to anisotropy is easily seen, however. 
Any shape perturbation of a facetted plane would have to increase not only the interfacial 
area, the usual problem for a perturbation, but a perturbation at a cusp also requires the 
interface to rotate away from the cusp plane and thereby greatly increase the specific 
interfacial energy. Partial interface immobility (ii), giving mixed control, will also 
stabilize the precipitate since it reduces the concentration gradients of solute away from 
the interface, these gradients cause the instability. High diffusivity at the interface (iii), 
or within the precipitate (iv), relative to the diffusivity in the matrix phase is stabilizing 


* Mullins and Sekerka [1963] had suggested that their analysis indicated that the growth equations of Ham, 
and Horvay and Cahn, eqs. 48 and 49, would be unlikely to apply since the spheroid shapes described should 
be absolutely unstable. As a result the length/ t U2 kinetics should transform into the linear kinetics discussed in 
§ 2.2.6.2. This is clearly experimentally incorrect, § 2.2.5.2. The origin of the mistake by Mullins and Sekerka 
is not obvious but appears to arise for at least two reasons. The first is the relative stability analysis of Coriell 
and Parker [1967]. The plate like precipitates of Ham, Horvay and Cahn are approximately highly distorted 
spheres with 1 = 2 — such shapes though absolutely unstable are relatively stable. As a result, even if capillarity 
is added to Ham, Horvay and Cahn analysis, the predicted shape stability may not change. In addition, as 
discussed below, there is no good analysis of shape stability in the presence of high anisotropy of the interfacial 
energy. The approximate analysis offered by Doherty [1982], see §2.2.6.2., suggests that high aspect ratio 
precipitates should be stable against linear lengthening except under high supersaturations, A > 0.5, much higher 
than the typical values of A S 0.1, found in precipitation reactions. 


References: p. 1497. 



1426 


R. D. Doherty 


Ch. 15, §2 


since solute flux along these pathways can reduce the interfacial area but unlike diffusion 
in the matrix it does not contribute to precipitate growth. An analysis of surface 
smoothing, in the absence of any perturbing gradients, was given by Mullins [1959] for 
the various diffusion pathways. On any surface, variations in curvature are removed by 
diffusion in both phases and along the interface unless the interfacial energy stabilization 
is overcome by the gradient instability. 

The effect of an anisotropic precipitate shape (v) on shape stability has not been for- 
mally analyzed with a perturbation technique. Doherty [1982], however, indicated why 
a shape with a high aspect ratio should be stable by use of the equations developed in 
§ 2.?.2 for stable growth by comparison with the expected linear growth rate of the end 
of a Widmanstatten plate. The analysis is given in the discussion of linear growth models 
(§2.2.6.2). Linear growth of Widmanstatten plates (Purdy [1971]) as in the linear 
growth of dendrites, Malcolm and Purdy [1967], Doherty etal. [1978] is the usual 
growth consequence for the tips of the instability. No full analysis of the transition from 
developing instability to linear dendritic growth appears yet been made. The current 
models of dendritic growth, due to Doherty et al. [1978], Langer and Muller- 
Krumbhaar [1978], Huang and Glicksman [1981], Kurz and Fisher [1989], find that 
the tips of dendrites have a radius of curvature that is determined by an instability 
condition. 

The final stabilizing influence of closely spaced precipitates (vi) is easily understood, 
though there is apparently no proper analysis of the stability of closely spaced precipit- 
ates. The effect arises since closely spaced precipitates show an overlap of the diffusion 
fields of adjacent particles (the soft impingement effect). This effect, for two precipitates 
spaced less than ( Dt) os apart, reduces the concentration gradients about each precipitate, 
see fig. 20, as C 0 = C M — > C a . Under these circumstances the composition gradient driving 
the instability falls, and the closely spaced spherical precipitates will be stabilized. The 
result of this is seen for solidification either whenever there is a high rate of primary 
nucleation (Kattamis et al. [1967]) or by secondary nucleation in stir-casting (see 
Vogel [1977], and Doherty et al. [1984]). Under these conditions of closely spaced 
growth centres, the solidification growth form is found to be spherical, not dendritic. This 
was nicely illustrated for the Al-Al 3 Li solid state case studied by Baumann and 
Williams [1985]. With high supercooling below the solvus, the coherent ordered Al 3 Li 
precipitates, with a high spatial density; were found to be spherical. With lower 
supercoolings, giving more widely spaced precipitates, these showed a clear dendritic 
growth instability. 

The combination of all these effects appears to explain the lack of shape instabilities, 
at least those that might give true dendritic shapes, in all but a few solid-state reactions. 
The few dendritic shapes that have been found occur with cube-cube oriented precipitates 
that have low and isotropic interfacial energies and low interfacial diffusivities ; in 
addition, the precipitates all have ordered structures, which have low rates of diffusion 
within the precipitates and the examples giving dendritic shapes all occur with low 
densities of precipitates. The clearest examples were of y-brass in /3-brass (Malcolm 
and Purdy [1967] and Bainbridge and Doherty [1969]) and y' in y, Ni 3 Al in Ni 
(Ricks et al. [1983]) and the equivalent structure of 8' in a, Al 3 Li in Al, Baumann and 
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Williams [1985] the last two at small undercoolings. These examples all have, coherent 
low diffusivity interfaces, ordered low diffusivity precipitates and low precipitate 
densities. Deliberate attempts to produce shape instabilities in other cube-cube systems, 
for example for silver precipitates in copper (Doherty [1976]), failed, presumably owing 
firstly to the high rates of interfacial diffusion expected in the semicoherent interface 
containing a high density of edge dislocations between silver and copper, where there is 
a large misfit between the two phases, and secondly to the significant rate of diffusion 
within the disordered silver rich precipitates. 

22 . 6 . 2 . Linear growth models. For both dendrites and plate-like and needle-like 
Widmanstatten growth forms, there are well developed models for the linear growth of 
the tips based on the assumption that the tip radius does not change. For the tip of a 
needle, the radius is the same in all directions, while for the rim of the plate the constant 
radius is the small one between the two parallel habit planes and not the large radius of 
the growing cylindrical plate. The earliest model for lengthening of a plate was given by 
Zener [1946] and modified by Hillert [1957], The essential features of the model are 
the assumption of a constant tip radius which will give, for a needle crystal, a growth 
rate given by a slightly modified form of eq. (41) 


dr/d/ = 2 


D C 0 -C a 

r C ,- C « 


2 D(1 

r 


(41a) 


The extra factor of two in the eq. 41a, compared to eq. 41, comes from the difference in 
growth geometry. For the growth of a sphere (eq. 41) the flux of solute, D(dC/dK) R=r , is 
applied over the whole surface area, 4 77T 2 , and all of the surface is growing normal to the 
surface. For the linear lengthening of a circular shaped needle crystal the flux is 
delivered to one side, so the solute flux is fed onto an area of 2tti 2 , but the area that is 
fed by the flux is only the cross sectional area of the needle, m 2 , fig. 32, so the growth rate 
is thereby doubled. 

In the solidification literature, see for example, Doherty [1980] or Kurz and Fisher 
[1989], the velocity, v = dr/ dt is made dimensionless as the Peclet number P in eq. 41b. 


P = 


vr _ C 0 - C a 
2D Cp - C a 




(41b) 


V 


a 

Fig. 32. Linear lengthening at a constant growth velocity v. and constant tip radius of a cylindrical needle 
crystal of fi with a hemispherical tip of radius r growing in and a supersaturated a matrix. 
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However, the effect of the surface energy has also to be included as in eq. 59 and so we obtain: 


dr/dr = 


2D C 0 - C a (r) 
r C p (r)-C a (r) 


(59b) 


This gives the same plot as that of fig. 31 apart from the factor of two. For plate-like 
crystals, the equations are slightly different, but the physical reasoning and the resultant 
form of the relationship, between velocity and plate-tip radius are very similar to that 
seen for the simple needle crystal. 

Having produced such relationships as those of eq. 59b etc., it is necessary for the system, 
and thus any theory of the process, to select the particular value of the radius for the plate- 
like or needle-like crystal. Any value of r is an equally valid solution to a diffusion equation, 
such as eq. (59b). For convenience and for lack of any other simple choice, the initial theory 
selected the maximum-velocity criterion (Hillert [1957]). As explained in reviews for plate- 
lengthening, for example by Hillert [1975], and for needle and dendritic growth, for 
example by Doherty [1980], the simple theory has proved inadequate and has had to be 
modified. Two major modifications have been required: The first is for a more accurate 
diffusion model and the second is the adoption of a more successful determining 
principle for tip radius. Both of these modifications are described below. 

(1) A better diffusional model. For the needle-like crystal the required diffusional 
solution, neglecting interfacial energy considerations, is that due to Ivantsov [1947]. 
The Peclet number P is given by: 

ft - P &\p(P)E l (P). (62) 


£[ (P) is the integral exponential function, see for example Kurz and Fisher [1989]. For 
a plate-like crystal the equivalent expression, Ivantsov [1947], is: 

ft = (rrP) l/2 exp(P)erfc(P 1/2 ). (63) 


This gives a unique value of P (P - vr/2D) for a given supersaturation ft but still leaves 
the choice of the radius r and thus the growth velocity v undetermined. Trivedi [1970a] 
solved the problem after introducing the surface energy and thus the Gibbs-Thomson 
modification of solubilities and selecting a tip radius that yielded the maximum growth 
rate. Trivedi’s rather complex equation can be reduced at low-to-medium values of the 
supersaturation f l by the the approximation due to Bosze and Trivedi [1974]: 


27 ftft 2 T 
256m* ' 


(64) 


The Bosze and Trivedi supersaturation, ft BT , is related to the supersaturation, ft, by: 
2ft ft 2 ^ 


ft BT — ft/ 1 


7T 27T 


(64a) 


r* is the critical radius of nucleation, eq. 10a, where C 0 = C a (r) and is often related to the 
supersaturation by: 
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i* = r/o. 

The capillarity constant, T, is given for precipitation by: 
r _ 2*V m C a {\-C a ) 

RT(c t 3 - c a ) 2 c, ■ 


(64b) 


(64c) 


Hillert [1975 and 1994] has shown that eq. (64) gives good agreement with the observed 
rates of linear lengthening of Widmanstatten a-iron plates in Fe-C alloys and Purdy [1971] 
showed equivalent agreement for the needle-like a -brass precipitates forming from ^ -brass in 
Cu-Zn alloys. In both these cases the precipitates grow from grain-boundary nucleation 
sites since, the composition differences between the two phases being small, there is 
insufficient driving force for nucleation within grains [eqs. (6d) and (7)]. 

(2) Determining principle for tip radius. Despite the obvious attraction of analytical 
simplicity of choosing the radius that gives the maximum growth rate, there is no reason 
why nature should share this outlook. Various other determining principles have been 
suggested in the past, mainly for the linear growth of two-phase products, eutectics, 
eutectoids and cellular precipitation reactions, see § 2.5 (Puls and Kirkaldy [1972] and 
Livingston and Cahn [1974]). In her studies, on solid state dendritic growth of ordered 
y dendrites in /3- brass, Fairs [1977] measured both the linear growth rate and tip radius 
for a range of precipitation conditions. She observed that the tip radius was larger than 
that expected to give the maximum growth rate. As a result the growth rate was, of 
course, less than the expected maximum velocity. These experimental solid state 
observations were matched by the detailed studies by Glicksman et al. [1976] on the 
solidification of thermal dendrites in undercooled succinonitrile, a low entropy of 
melting, transparent, organic analogue for metals. In their succinonitrile study, the 
authors reported that the growth velocities were also significantly less than expected for 
the maximum velocity. The data on succinonitrile were reanalyzed by Doherty et al. 
[1978a] who showed that the effect observed by Fairs [1977] was occurring in the 
organic solidification dendrites: The tip radius was significantly larger than that which 
would give the maximum velocity but the data lay on the plot predicted by the Trivedi 
equation (Trivedi [1970b]) for dendritic or needle growth. 

Fairs showed both in her solid-state dendrites and in the organic dendrites that the 
operating tip radius seems to be determined by a stability criterion. This was originally 
thought to be relative stability but, as argued by Huang and Glicksman [1981], 
absolute stability is more likely to be appropriate for a perturbation on a tip with a 
constant tip radius. It is, however, still not intuitively obvious why stability rather than 
the maximum-velocity criterion controls the tip radius and therefore the growth velocity. 
Langer and Muller-Krumbhaar [1978] developed a detailed theory for dendritic 
growth with the tip radius determined by a stability criterion and their general theory 
appears to be very successful. According to this description, the dendritic or needle 
growth rate is determined from the Ivantsov equation for a needle crystal, eq. (62). Peclet 
number, P, determined by eq. (62) is determined by: 

P = vr/2D = (vT/2 Da'). (65) 
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Here a' is a stability constant that is found to be 0.025. A similar analysis should be 
feasible for determining the operating rim radius for the growth of plate-like precipitates 
— however no such analysis appears to have been published — nor have any detailed 
experimental studies been published aimed at determining the operating conditions for the 
linear growth of plate-like precipitates. However Olson etal. [1990] have described, see 
§ 2.6, an interesting new approach to this problem for bainite growth. 

In the current analysis of dendritic growth, there is still no adequate treatment 
showing how an initial instability in a spherical precipitate develops into a branched 
dendrite, whose linear growth can be described by stationary diffusion fields around the 
constant tip shape [eqs. (64) and (65)]. Since the tip radius in both cases is determined 
by stability — relative stability for the initial perturbation and absolute stability for the 
linear growing needle — it is possible that a nearly constant tip radius applies through . 
out the transition. However there is no theoretical or experimental support for this 
speculation. An even more striking problem is the lack of a stability theory to describe 
how a plate-like Widmanstatten precipitate initially growing, like a sphere with a size- 
root-time relationship, can first evolve a perturbation at the rim, and then subsequently 
adopt the constant growth rate and tip radius experimentally found (Hillert [1975]). 
The present author (Doherty [1982]) suggested a possibility for what it might be that 
determines the change from growth with a stable shape, with a constant A in eqs. (49) 
and (55), to an unstable linear growth, given for example by eq. (63), with an increasing 
A. Doherty suggested that the change might be expected to occur when the linear growth 
rate becomes faster then the instantaneous lengthening of the stable shape. Eq. (55) 
shows that the instantaneous growth rate of a stable plate-like precipitate falls with 
increased time and thus precipitate size. If the hypothesis is correct, then the value of the 
aspect ratio, A, will be crucial since large values of A, that promote stable growth, will 
greatly delay the transition to linear lengthening. The other factor of crucial importance 
is the supersaturation, H. Table 2 gives the half-length a* for which the Bosze and 
Trivedi relationship [eq. (63)] predicts the same lengthening rate (v = dciy/dt) as the stable 
Horvay and Cahn result. The results in table 2 were obtained for a typical value of the 
aspect ratio seen for precipitates nucleated within a grain, A =100, and a capillary 
constant, T, of 0.6 nm, needed to determine the critical radius of nucleation r*. For 
grain-boundary nucleated precipitates a smaller value of supersaturation will arise if 


Table 2 

Plate half-length for which eqs. (49) and (63) predict the same lengtening rate. 


Supersaturation fl 

Supersaturation fl* 

Precipitate tip 
radius r, nm 

Growth 
parameter f} 

Plate half-length 
a*, 

0.01 

0.0 1 

1060r c =6.0x 10“ 

7x 10' 5 

2.3 x 10" 6 

0.03 

0.0306 

306r c =7x 10 3 

3x 10 -4 

0.12x 10" 6 

0.1 

0.107 


4x 10' 3 

i.2x io -4 

0.3 

0.38 

28r c = 60 

5x 10' 2 

1.2x 10“ 3 

0.6 

1.07 

10r c = 10 

1 x lO -1 

50 


For a^>a* linear lengthening would be expected. 
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smaller initial values of A were used. 

The huge values of a* at the low supersaturations typical of age-hardening precipitat- 
ions of solute-rich precipitates from solution-treated alloys. Cl < 0.3, suggest why linear 
lengthening is not seen in such precipitation reactions as Ag 2 Ag from A1 and 6' (Al 2 Cu) 
in Al-Cu (§2.2.5). However, matrix phase transformations, such as the formation of 
ferrite from austenite, usually do occur at large supersaturations, Cl > 0.6, but with low 
nucleation densities (§ 1.3). In such a case, the values of a* become smaller than the 
matrix grain size so the observation of linear unstable growth of Widmanstatten plates in 
this system is then understandable. This can only be a tentative description, since the 
analysis is approximate and the operating value of A in the stable growth regimes of 
Widmanstatten ferrite is not known. Experimentally, however, observation on the early 
stages of the formation of Widmanstatten plates at grain boundaries should be relatively 
straightforward. In addition, although the Trivedi analysis for linear growth rates have 
been corrected for interfacial energy effects, the growth of the stable precipitates in 
eq. 49 has not been corrected for capillarity. In the case of the most common examples 
of linear growth of Widmanstatten precipitates, in bcc iron precipitating from fee iron in 
Fe-C alloys and in fee a -brass forming from bcc /3-brass, the plates develop from grain- 
boundary nucleated films. 

Townsend and Kjrkaldy [1968] observed the development of this instability and 
suggested that it could arise as seen in fig. 33. It is assumed that the bcc ferrite forms in 
(a) on a grain boundary with the good fit plane of the precipitate nearly parallel to the 
matching plane in the upper grain. Growth continues along the grain boundary in (b) but 
when the curved grain boundary moves away from the good fit interface, further growth, 
(c to e), allows instabilities to develop allowing the plates to form with the low energy 
habit planes shown. This model nicely matched the microstructures seen. Development 
of instabilities in this form should allow linear growth to occur at smaller super- 
saturations and sizes and in the analysis of growth from a sphere (table 2). Townsend 
and Kirkaldy [1968] and Hillert [1975] both find linear lengthening at smaller values 
of fl than the minimum of about 0.6 suggested by the simple analysis of table 2, using 
the value of A of 100. 

2.3. Precipitate dissolution 

There has been much less interest in the dissolution of precipitates than in their 
growth for the obvious reason that, provided an alloy is heated into a single-phase 
region, all the precipitates will dissolve, given sufficient time. As a result, the microstruc- 
ture of the final heat treated material does not carry with it the microstructural conse- 
quences of the kinetics of dissolution. This is not the case for precipitation where the 
microstructure is determined by the kinetics of the reaction. However, for precipitates 
that are not completely soluble after an increase of temperature, there will be a period of 
dissolution that will affect the microstructure because of the different shrinkage rates of 
different precipitates. In addition, observation of precipitate dissolution can provide 
insight into the amount of diffusion control. This idea was exploited by Hall and 
Haworth [1970] and Abbott and Haworth [1973] in studies of the dissolution of 
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(a) 

\ 

\ 



Fig. 33. Development of grain-boundaiy-nucleated Widmanstatten plates: (a) Grain-boundary allotriomorph, 
with good fit plane, with the upper grain, almost parallel to the grain boundary; (b) Precipitate growth around 
a curved grain boundary; (c) to (e) Growth of instabilities evolving into linear lengthening Widmanstatten 
plates. After Townsend and Kirkaldy [1968], 


plate-like precipitates in the Al-Cu and Al-Ag systems by microprobe analysis of the 
solute fields in the matrix normal to the habit plane of the precipitates. In the first 
example, apparent diffusion-controlled dissolution was found, while in the Al-Ag alloy 
some evidence for mixed control was reported with the solute content in the matrix at the 
interface, C„ being less than the equilibrium value, C a . For such systems, where a ledge 
mechanism would be expected, it should be remembered that for dissolution of isolated 
plates the incoherent rims of the plates are good sources of ledges (Martin and 
Doherty [1976]). 

A detailed analysis of precipitate dissolution for spherical precipitates and for 
diffusion away from the broad faces of plate-like precipitates was published by Aaron 
and Kotler [1971], Their review provides a very detailed summary of the theory and 
need not be reproduced here. Only some of their major conclusions will be described. 
Aaron and Kotler compared the processes of growth and dissolution and showed that 
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latter is not the simple opposite of the former, particularly for spherical precipitates. The 
distinction arises since the rate of dissolution, for a diffusion-controlled reaction, is 
initially infinitely fast and then falls as the concentration gradient in front of the interface 
becomes shallower. However as the process continues, the radius of the sphere shrinks 
and the volume releasing the solute falls so the dissolution rate rises again. The effect is 
due to the spherical symmetry of the matrix absorbing the solute. Plate-like dissolution 
by thinning however does not show this acceleration and the rate of thinning continues 
to decline (fig. 34). Aaron and Kotler also point out that the details of the diffusional 
analysis are more difficult for dissolution than for growth since, in the dissolution case, 
the source of solute, the interface, is retreating away from the surrounding matrix so that 
the solute distribution close to the interface undergoes a more complex behaviour during 
dissolution than during growth (fig. 35). This problem becomes very severe for the case 
of dissolution of a plate-like or needle-like precipitate growing back from the tip. In this 
case the tip will move into a region of matrix already enriched with solute due to the 
thinning process. No analytical solution of the problem appears to be available. 

In the analysis provided by Aaron and Kotler [1971], it was shown that, although 
capillarity effects are expected to increase the rate of dissolution of spherical particles 
owing to the rise of solubility at small radii, these modifications appear to be small for 
the 1 /zm-sized and larger particles of interest in dissolution reactions above the 
equilibrium solvus. Although this was not considered by Aaron and Kotler, the acceler- 
ation due to capillarity will be of much greater significance for the dissolution of the 
very much smaller precipitates of concern in precipitation reversion. Reversion is the 
dissolution of very fine metastable precipitates when the ageing temperature is raised 
during a precipitation-hardening reaction. Sargent and Purdy [1974] pointed out the 
possibility that a fine dispersion of precipitates may redissolve completely even when the 
raised temperature remains within the two-phase field, so long as the radii of the 
precipitates are less than the critical radius for nucleation of the alloy at the higher 
temperature. The critical radius is the value of r in eq. (58) at which C a (r) is equal to the 
alloy content, C 0 . A clear example of this effect is seen in Al-Mn alloys in which fine 
precipitates of Al 6 Mn formed at lower temperature are seen dissolve up on heating, even 
though the alloy is still supersaturated with the solute. The transformation is readily 
observed using electrical conductivity on samples quenched during processing. On 



Fig. 34. Dissolution of an isolated sphere (heavy solid line) and a plate (dashed line) by diffusion-controlled 
dissolution. The influence of the Gibbs-Thomson effect on the dissolution rate of small spheres is shown by 
the thin solid line. (After Aaron and Kotler [1971].) 
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Fig. 35. Distinction between growth and dissolution of a solute-rich /3 precipitate: the composition is C 0 at the 
interface and C 0 in the matrix a long way from the precipitate. (After Aaron and Kotler [1971].) 


heating, the conductivity rises as Mn is initially precipitated, however above about 800 K 
the conductivity starts to fall as heating continues — indicating net Mn dissolution. On 
halting the heating, and holding the sample at a constant temperature, the dissolution 
quickly ceases and precipitation resumes — confirming that the previous dissolution was 
taking place in a supersaturated solid solution. The low solute content of the precipitate, 
Cp = 1/7, greatly promotes the increase of Gibbs-Thomson solubility in this system — 
see eq. (58). This type of precipitate dissolution is exactly the effect predicted by Sargent 
and Purdy. An alternative type of reversion also occurs. Here the temperature is required 
to exceed the solvus of the metastable precipitate even though the temperature is below 
the solvus temperature of more stable precipitates. Reversion of copper-rich GP zones, 
formed by room temperature “natural” ageing of Al-Cu alloys, on heating to a temperat- 
ure, 450 K, at which the 9' phase subsequent precipitates during artificial age hardening 
is a well known example of this second type of reversion. The sample initially softens, 
as the zones dissolve, before rehardening with subsequent precipitation of O'. Reversion 
of Al-Li was used by Vasudevan and Doherty [1987], 

2.4. Competitive growth 

There are two subsequent stages to the free growth of isolated precipitates described 
in § 2.2. The first of these occurs when the diffusion fields around adjacent precipitates 
start to overlap, but while the matrix around the precipitate remains supersaturated with 
solute. This is briefly discussed in §2.4.1. Its essential feature is that, for almost all of 
the particles, the composition in the matrix exceeds the Gibbs-Thomson solubility, C a (r) 
given by (eq. 58). As a result almost all of the particles survive so that the density of 
precipitates (the number per unit volume — (V v ) either continues to increase if nucleation 
is occurring or remains essentially constant if nucleation is essentially complete. The 
second stage of competitive growth — usually called Ostwald ripening — occurs when 
a significant fraction of the nucleated precipitates have a Gibbs-Thomson solubility, 
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C a (r) greater than the average matrix composition, so that they then dissolve and the 
microstructure shows a decrease in the precipitate density, N v . Ostwald ripening is 
discussed in § 2.4.2. 

2.4.1. Growth from a supersaturated matrix after “soft” impingement 

The situation when the diffusion fields around adjacent precipitates start to overlap 
is a process usually described as soft impingement. With soft impingement, the con- 
centration away from a solute-rich precipitate falls below the initial alloy composition, 
C 0 , and the effective supersaturation, Cl, falls with time causing the rate of growth given 
by (eq. 43) to fall. For growth of solute depleted precipitates, fig. 20d, the solute in the 
matrix rises, but the value of the supersaturation falls. After soft impingement, not only 
will the growth rate fall but also the basic > 2 /t relationship will change; giving a larger 
time exponent than the value of 2 derived for independent precipitate growth. Care is 
needed in experimental studies of precipitation to distinguish a fall of the exponent 
caused by soft impingement, from an apparent fall in the exponent caused by measuring 
the mean radius, as l 2 / 1 . With true soft impingement, there is a failure in r max 2 /f where 
the size measured is that of the largest precipitates — ones that nucleated at the start of 
the reaction. Continued nucleation of precipitates, even if the younger precipitates still 
grow as > 2 /(t~t a ), will give a failure in the expected kinetics if these are determined 
from the average precipitate size. 

It is probably useful to think of the regime of growth, as distinct from the regime of 
nucleation, as being characterized by an almost constant density of precipitates (the 
number per unit volume, N v ). During the nucleation regime the density of precipitates 
rises and during competive coarsening § 2.4.2, the density falls. 

As described in various reviews of overall transformation kinetics, for example by 
Turnbull [1956], Fine [1964] and Christian [1970, 1975], the rate of the precipitation 
reaction will be given by eq. (22) during the growth period that includes soft impinge- 
ment but not during competive coarsening. The references cited here include a general 
discussion of what is now called the Johnson-Mehl, Avrami, Kolmogorov (JMAK) 
equation, eq. (22). In the general case that includes precipitation where the volume 
fraction of the product does not tend to 1, the equation gives the amount of new phase 
formed but normalized by the amount of the new phase present at equilibrium. This 
ensures that the predicted reaction parameter, f{x), tends to unity as the reaction is 
completed. The values of the exponent n are related, see table 3, to the particular 
nucleation and growth conditions. Table 3 was originally published by Christian [1970]. 
The time constant r, is determined from the nucleation and growth rates and the form of 
the equation, 1 - exp (-//r) n , comes from competition between the growth centers, either 
by “soft” impingement of diffusion fields or by “hard” impingement between growing 
grains. A full discussion of this topic has not been provided here, firstly since the earlier 
reviews are rather complete and, secondly, because (with a few exceptions) use of the 
overall reaction kinetics seems to have played little role in understanding the mechanisms 
of transformations. The failure arises since similar values of n can he obtained for 
different mechanisms. In the opinion of the present author, the direct determination of the 
separate nucleation and growth rates is of much greater value in understanding the 
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Table 3 

Value of n in kinetic law /= 1 -exp(-kf)- 


(a) Polymorphous changes, discontinuous precipitation, eutectoidal reactions, interface controlled growth, 
etc. 

Conditions 

n 

Increasing nucleation rate 

> 4 

Constant nucleation rate 

4 

Decreasing nucleation rate 

3-4 

Zero nucleation rate (saturation of point sites) 

3 

Grain-edge nucleation after saturation 

2 

Grain-boundary nucleation after saturation 

1 


(b) Diffusion-controlled growth (early stages of reaction only) 


Conditions 

n 

All shapes growing from small dimensions, increasing nucleation rate 

> 2 W2 

All shapes growing from small dimensions, constant nucleation rate 

2 W2 

All shapes growing from small dimensions, decreasing nucleation rate 

1 1/2 _2 1/2 

All shapes growing from small dimensions, zero nucleation rate 

1 ,/2 

Growth of particles of appreciable initial volume 

Needles and plates of finite long dimensions, small in comparison with their 

1-1 1/2 

separations 

1 

Thickening of long cylinders (needles), e.g. after complete edge impingement 

1 

Thickening of very large plates, e.g. after complete edge impingement 

1/2 

Segregation to dislocations (very early stage only) 

2/3 


reaction mechanisms than is use of eq. (22). Fine [1964] points out that the JMAK 
equation is, in fact, only an approximation to the expected kinetics for precipitation with 
soft impingement. However, the equation is of value in providing a simple picture of the 
kinetics of the overall process and in confirming that individual nucleation and growth 
rates have been correctly measured. A nice illustration of the use of the equation is its 
application in biology to the two-dimensional spread of colonies of lichen on rocks or of 
oysters in freshly established oyster beds. The equation is also commonly used in 
describing the rates of recrystallization in deformed materials (chapter 28, § 3.2) and also 
in studies of isothermal crystallization of glasses particularly of metallic glasses (chapter 
19, §4.3). All these cases are for “hard” impingement between growing colonies or 
grains for which the assumptions of the JMAK equation are valid. Rollett etal. [1989] 
provide a recent review of the JMAK equation and the conditions needed for its 
applicability. They showed that one of the critical features required for the JMAK 
equation to be valid, is the spatial uniformity of the driving force for the reaction. When 
the driving force varies, for example between different grains in a deformed material, the 
impingement correction that yields the JMAK equation is no longer valid. This is, 
however, not likely to be a problem in structural changes driven by chemical free energy 
changes which should not vary from grain to grain. 
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2.4.2. Competitive coarsening: Ostwald ripening 

Following the soft impingement of diffusion fields, the precipitates continue to grow, 
draining the remaining solute from the matrix and approaching the equilibrium volume 
fraction of the new phase. The approach to the equilibrium volume fraction does not 
mark the end of the reaction, however, since the microstructure with a distribution of fine 
precipitates has a high area of interface, and this raises the free energy of the system. 
The process of reduction of this interfacial energy involves the growth of larger 
precipitates at the expense of smaller ones. In a similar way, after primary recrystal- 
lization of deformed metals the resulting grain structure usually has a significant amount 
of grain-boundary area that drives the process of grain growth. 

The process of competitive precipitate growth in a near-equilibrium matrix com- 
position is commonly described as Ostwald ripening after the discoverer of the same 
process for salt crystals precipitating from aqueous solutions (Ostwald [1900]). The 
mechanism of this process is provided by the Gibbs-Thomson equation, eq. (58). This 
equation shows that, with a range of precipitates sizes, the solubility will be least with 
the large precipitates and highest with the small precipitates. This difference in solubility 
produces a solute gradient from small precipitates to large ones and diffusion of solute 
between the precipitates then causes the smallest precipitates to shrink and disappear. The 
solute from the shrinking precipitates feeds the larger ones and so, for both reasons the 
mean precipitate size rises. The rate of growth is described by fig. 30 but with the 
critical radius of nucleation now replaced by the mean radius, r; r=r*. 

The topic has been reviewed by for example Martin and Doherty [1976], by 
Martin etal. [1995] and the topic is also discussed in chapter 9, §4.2.2, so full details 
of the process will not be presented here. For diffusion-controlled growth the classic 
(LSW) theory of the process due to Wagner [1961] and Lifshitz and Slyozov [1961], 
predicts that, even starting from a narrow size distribution, the ripening process will yield 
a steady-state distribution of particle sizes. After achieving this steady state size 
distribution, the mean precipitate radius will coarsen from an initial value, r^, to 7 l with 
time, t, as: 


3 -38 aDC a V m t 

' 0 9 RT 


( 66 ) 


It should be noted that this commonly quoted equation was derived for a particular set 
of assumptions. These include that the solubility in the matrix, C a , is given as atomic 
fraction, that both matrix and precipitate are terminal and dilute solid solutions, so 
C„ — » 0 and C p — » 1 , and that the matrix has a constant activity coefficient, y, or 
equivalently has a second differential of the free energy with composition of the matrix, 
fj, which is that of an ideal solution. In addition the volume fraction of the precipitates 
is assumed to be vanishingly small. The general expression, see for example Calderon 
etal. [1994] or Martin etal. [1996], is: 
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The changes in eq. (66a) can be very significant. This influence of having a value of the 
precipitate atomic fraction, C^, less than 1 is particularly important. If the precipitate is 
an intermetallic phase A n B in A then this leads to a predicted increase of the coarsening 
rate by approximately (n+ 1) 2 . For Ni 3 Al in Ni this is an increase of 16 and for Al 6 Mn 
in A1 the increase is almost 50 times! 

The general result of eq. 66 has been extensively tested both qualitatively and 
quantitatively many times and, in general, the model appears to be very successful, as 
shown for example by Ardell [1987 and 1990], Xiao and Haasen [1991] and Martin 
etal. [1995]. However, there remain a number of problems. The main problem is that 
although almost all tests on spherical and near-spherical precipitates give the expected 
qVf relationship, the observed rates of coarsening, the gradients of the r t 3 /r plots, are 
usually larger than expected. In addition, there is a corresponding somewhat wider size 
distribution than predicted by the LSW analysis. A further point is that the simple 
diffusion theory [eq. (60) and fig. 30] predicts that the fastest growth rate will be for 
particles twice the size of the average, yet the LSW-predicted spread of particle sizes for 
diffusion-limited growth cuts off at a maximum particle size of only J.5r. It is not 
immediately obvious why particles are theoretically limited to this size, since fig. 30 
suggests that after extended growth the maximum precipitate size is likely to be 2r, 
Greenwood [1956]. Experimental size distributions, even after very long periods of 
coarsening, do indeed show precipitates up to 2r in size, see for example fig. 36 from 
Bower and Whiteman [1969]. That is, the experimental results almost appear to fit 
better with the simple analysis than with the rigorous LSW analysis! It seems likely that 



r/r 

Fig. 36. Experimental results for the distribution of precipitate reduced radii ( r/r) against reduced radius after 
extensive diffusion-controlled coarsening of Fe 3 Si in Fe at 700°C, aged 312 h. The dotted line shows the 
expected distribution in the limit of a low volume fraction (/— > 0), (After Bowman and Whiteman [1969].) 
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the origin of the effect arises from the fact during the initial precipitation reaction the 
particles acquire a wider size distribution than the predicted steady-state size distribution 
of coarsening. Not all precipitates nucleate at the same time, nor since the sites of 
nucleation are not ordered in space do the precipitates have equal volumes of super- 
saturated matrix to draw the solute from. So the initial size distribution will usually be 
rather wide. 

The rates of change of initial particle size distributions with coarsening have been 
examined (i) by computer simulations, for example by Voorhees and Glicksman 
[1984c] and Brown [1989, 1990a and b, 1992a and b], (ii) analysed theoretically by 
Hillert etal. [1989 and 1992] and (iii) experimentally studied by Fang and Patterson 
[1993]. It seems established that, in the final limit of very extensive coarsening, the LSW 
size distribution is the only distribution that can, mathematically, maintain a steady state 
form (see for example Hillert et al. [1989], Mullins and Vinals [1989] and Hoyt 
[1990]). Brown [1989] found, by numerical simulation of coarsening, that particles 
initially given a wide size distribution maintained a significant number of particles in the 
size range above 1.5 r for extended times. Voorhees and Glicksman [1984c] also noted 
a similar effect. Fang and Patterson [1993] measured the effect of different initial 
particle size distributions on coarsening. They used tungsten powders in liquid nickel- 
iron matrix, albeit at a rather high volume fraction of solid particles (f s = 0.6) see below. 
They found, as might be expected, that the rate of coarsening was controlled by the size 
distribution (fig. 37). Narrow distributions of particles showed slow coarsening and wide 
distributions coarsened more quickly. The size distributions changed rapidly at the earliest 
times, as shown by changes in the natural logarithm of the standard deviation (fig. 38). 
During the time interval when the size distributions were changing rapidly, the coarsen- 
ing equation did not show the expected cube law for diffusion limited coarsening 
(fig. 37b). The kinetics however quickly achieved the expected cube-law after rather 
short periods of non steady-state coarsening. However, the distributions, particularly the 
wide distribution, did not reached a really steady-state form even after extensive 
coarsening. During the period when the distributions were changing only slowly, 
however, the cube law fitted the data to a rather high precision. This result is important 
in showing that a good experimental fit to the cube law does not guarantee that the 
kinetic coefficient will have the value expected for true steady-state coarsening. 

Another problem with the LSW analysis is that it was developed for point particles, 
that is, for volume fractions of second phase that are close to zero, so that, in the LSW 
analysis, individual particles are only aware of the average solute content, C=C a (r), and 
they are unaware of the local concentrations introduced by neighbouring particles. 
Ardell [1972] introduced an initial approximate, but physically reasonable, correction 
for the effect of finite volume fractions and his initial analysis showed that when the 
volume fraction of second phase was 0.25, the rate of coarsening implied by eq. (66) was 
expected to increase tenfold. Subsequent analyses such as that Brailsford and Wyn- 
blatt [1979] suggest that the acceleration of coarsening with volume fraction that 
though still significant is much smaller than predicted by Ardell. The computer simulat- 
ions of Voorhees and Glicksman [1984c] suggest only a doubling of the coarsening 
rate at a volume fraction of 0.25 and a fourfold increase at a volume fraction of 0.5. For 
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Fig. 37. Plots of r 3 -r„ 3 against coarsening time, experimental results for W-14Ni-6Fe showing the effect of the 
initial width of the particle size distribution on the rate of coarsening, (a) The entire coarsening period (b) the 
initial coarsening period. After Fang and Patterson [1993], 


several sytems of solid particles of transition metals in liquid copper as reviewed by 
Annavarapu and Doherty [1995] and for two solid/solid systems, Al 3 Li in A1 and Co 
in Cu reviewed by Ardell [1987, 1990] showed clear evidence of a significant increase 
in coarsening rate as the volume fraction increased. For the solid-solid cases, corres- 
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Fig. 38. Experimentally measured change of the standard deviation of the log normal distribution. In w, during 
coarsening of W particles in the W-14Ni-6Fe alloy. After Fang and Patterson [1993], 


ponding increases in the apparently steady state width of particle size distribution was 
also found. However Ardell’s reviews clearly found that although there were considerable 
differences in the rate constants found by different investigators for the system of Ni 3 Al 
in Ni there was almost no evidence, in any of the studies, of any acceleration of the 
coarsening rate with increase of the volume fraction of Ni 3 AI in Ni. The reasons for both 
the scatter in the data and for the lack of any variation in coarsening rate with volume 
fraction are not at present clear. The recent, very detailed study by Xiao and Haasen 
[1991] on coarsening kinetics in a single alloy of Ni-12at% A1 is worthy of note. They 
obtained, from careful measurements of precipitate sizes, precipitate volume fraction and 
the precipitate densities, data that, as discussed in §2.1.5, matched classical nucleation 
theory and also fitted the classical LSW coarsening theory (as modified by Ardell 
[1990]) very closely both as regards the coarsening kinetics, the growth of volume 
fraction precipitated and the decrease in precipitate density. These agreements, described 
below, were obtained despite a significant volume fraction of the precipitates in the system 
(about 15%) and of course the LSW theory was developed for negligable volume fractions. 
So for the Ni 3 Al/Ni system the volume fraction effect found in other systems is not apparently 
occurring. This is a major problem in current phase transformation studies. 
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Xiao and Haasen [1991] made two independent determinations of the interfacial 
energy, <x, and the operating diffusion coefficient D. cr was found to be 0.014 J/nT 2 
while D was found to be significantly larger (4 times) than expected from diffusion 
studied by more conventional means, the values of a and D determined from the various 
coarsening tests matched the values obtained from nucleation data astonishingly well — 
§2.1.5. 

The expected variation in volume fraction, f(t), with time is given by eq. (66b) 

= (66b) 

9 r m «- f 

4 R 2 T 2 C a D) 
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t l is the time (45 h in this experiment) at which “late stage” coarsening starts. The 
condition for late-stage coarsening is that C — > C a (r ). That is, when the only remaining 
supersaturation arises from the increase in solubility due to the interfacial energy, (f-f,) 
was the time used in eq. (66a) to obtain the slope of the TV time plot. 

The expected variation for the number density as a function of time, again (r-r')> 
is given by: 

N y (t) = k 3 (t-t l T l -k 6 (t-t l )-* n (66c) 

The rate constants, & 3 and k 6 , are given by: 
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The first term in eq. (66c) is that from the original LSW model and did not include the 
change in number density arising from the increase in volume fraction of precipitates 
arising from the decrease in solubility of the precipitates with time as the particles 
coarsen — that is from the Gibbs-Thomson effect. The second term in eq. (66c) is the 
correction, derived by Ardell [1987, 1990], that includes this effect. 

The start of late-stage coarsening, r, = 45 h, was close to the time (40 h) at which the 
maximum value of N v (t) was found, that is when the net rate of nucleation, that is the 
rate of nucleation less the rate of particle dissolution, reached zero. At this time, the 
diffusion distance (Dr) 1/2 was about 15 nm, three times larger than the mean spacing, 
A ~ (AT v (r))“ 1/3 , between the precipitates. The very high density of precipitates achieved in 
this system with a very low interfacial energy suggests that precipitates will interact 
diffusionally from early in the process — nucleation can only continue in regions with 
a locally lower density of precipitates while coarsening is occurring in regions with a 




Ch. 15, §2 


Diffusive phase transformations 


1443 


locally higher density of precipitates. Two results appear to follow directly from this 
result. The first result is that late-stage coarsening will be expected to follow soon after 
the maximum density of precipitates was seen. The second result is that only a very 
limited period of free diffusional growth with a near constant density of precipitates 
should be expected. Both of these results were reported in the study. The number density 
showed a continuous change with the density increasing to a maximum and then falling 
again with no significant period of near constant density. It was reported that at no time 
was the expected relationship for free growth of a constant density of precipitates, r 2 
increasing as t , found. As noted, in §2.4.1, such kinetics would not be expected if 
nucleation was continuing. For a high rate of homogeneous nucleation such as was found 
in the system even r max 2 might not be expected to increase linearly with t. With a very 
high rate of nucleation from the alloy composition, isolated precipitates surrounded by 
matrix at the alloy composition will not be found — the matrix under these circumstan- 
ces will be nucleating more precipitates — thereby depleting the solute. Xiao and Haasen 
reported that during the period of nucleation there was only a small increase in the mean 
precipitate radius — see similar earlier results shown in fig. 43 in the next section. It was 
also reported that after extensive coarsening from 45 to 256 h, during which time It 3 
increased tenfold, the size distribution did approached the LSW form with the maximum 
radius of 1.4 r. This match with the expected LSW distribution was found despite the 
observations being made in a system with a significant, equilibrium, volume fraction of 
precipitates /=0. 12. 

For a full discussion of the various other theoretical modifications to the initial LSW 
theory, to take account of both interface immobility, and alternative diffusion paths along 
grain-boundaries dislocations, etc., the reviews by Martin and Doherty [1976] or 
Martin et al. [1995] can be consulted. Developments in both the theory and the 
experiments of coarsening are also discussed in chapter 9. One recent development due 
to Yang etal. [1992], Higgins etal. [1992] and Annavarapu and Doherty [1995] 
should however be noted. This is coarsening by coalescence of particles at grain 
boundaries caused by particle drag. Particles at grain-boundary triple points and at grain 
comers can remove grain-boundary curvature and thus halt grain growth but the particles 
then experience a net unbalanced force. For particles on small grains this leads to the 
migration towards each other of the particles, causing both the loss of the smallest grain 
and the coalescence of the set of particles to a single larger particle. The rate determining 
step in this process is solute diffusion within the particles rather than between then so for 
small volume fractions of particles this leads to a rapid acceleration of the coarsening 
rate. Yang etal. [1992] showed this model gave good agreement with their experiments 
in a solid solid system (Ni-Ag) and Annavarapu and Doherty [1995] were able to use 
a modification of their model to account successfully for the rapid acceleration of coar- 
sening in high purity solid-liquid mixtures as the volume fraction of solid approached 1 . 

2.4.3. During initial nucleation and growth 

It was initially pointed out by Kampmann and Kahlweit [1967, 1970] that 
competitive dissolution of small precipitates will not be restricted to periods after 
precipitation was complete. This is because during all stages of precipitation the average 
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solute content of the matrix will be falling steadily owing to nucleation and growth of 
precipitates — for example as measured by the electrical resistivity studies of cobalt 
precipitation in Cu-Co alloys (Servi and Turnbull [1966]). The fall in solute content 
causes the critical radius of nucleation, r*, to rise, and any precipitate whose radius falls 
below this growing size will redissolve. The process is shown schematically in fig. 39. 
The alloy is steadily cooled below the solubility limit, becoming supersaturated at t = t s . 
The supersaturation, S, here defined as as @/C„, first rises with time (because C a is 
falling as the temperature falls) so r* initially falls with time. The first nucleus forms at 
when S is still rising and r* falling; When, at the last nucleus forms S is then 
falling and consequently, r*(t) is rising. The smallest precipitates, whose growth rate is 
less than dr*/df, then have too high a solubility and will dissolve up even though 
precipitation, measured by a fall in S, is still continuing. The critical result is not that 
coarsening, as measured by a fall in precipitate density, is occurring while the 
supersaturation, S = ©/C„, is falling. The significant feature is that there can be a fall of the 
precipitate density even though © > C a (r). There is always a finite Gibbs-Thomson 
supersaturation, S 0T =C a (r )/C a , for precipitates of a finite mean radius. With @ = C a (r) 
there is still precipitation occurring given by eq. (66b) above. 

Numerical simulations of combined precipitation and coarsening were published by 
Kampmann and Kahlweit [1970], assuming diffusion-controlled growth and dis- 
solution, using homogeneous nucleation theory and assuming that the supersaturation 
builds up steadily with time. Figure 40 shows the result of one set of calculations with 
fig. 40a showing the change of supersaturation, S; curve 1 is the change of S that would 
occur in the absence of precipitation and curve 2 the actual change of supersaturation 
that is determined by the nucleation and growth rates. The total number of precipitates 
larger than the critical size is seen in fig. 40b as the number of precipitates, Z/N v 
normalized by dividing by the final solute atomic concentration. The number of 
precipitates rises during the precipitation reaction to a maximum value and then starts to 



Fig. 39. Competitive coarsening during precipitation. An alloy is quenched becoming supersaturated at / = /,. 
The first nucleation occurs at /' and the last f . As the supersaturation, 5 = C/C a , falls, r*(t) rises and small 
precipitates redissolve. (After Kampmann and Kahlweit [1970].) 
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Fig. 40. Numerical solution to competitive coarsening: (a) the variation in supersaturation S with time 8, curve 
1 with no precipitation, curve 2 with precipitation; (b) the variation in the density of the precipitates Z, scaled 
with the equilibrium solute content N , (r = °°). (After Kampmann and Kahlweit [1970].) 


fall while the supersaturation is still large, approximately 4. Figure 41 shows the 
computed change of /*, the critical precipitate size in terms of the number of atoms of 
solute, during coarsening; this is compared to the “steady-state” loss of precipitates 
predicted by an analysis equivalent to the LSW analysis described in § 2.4. 1 , shown as 
the dashed line. Kampmann and Kahlweit [1970] suggested that the discrepancy is 
probably within the experimental error of the numerical analysis used. The dimensionless 
time parameter, 6, scales with the diffusion coefficient, so that with the diffusion 
coefficient expected for liquids D= 10“ 9 m 2 /s, 6-10 corresponds to only 10 -6 s, but with 
a substitutional diffusion coefficient of £>=10 -16 m 2 /s this would be 100 s. It is clear 
from this model, therefore, that the number of growing precipitates found in the later 
stages of precipitation should be considerably smaller than the original density of 
growing nuclei, for example in the studies of the nucleation kinetics in the experiments 



Fig. 41. Competitive coarsening during precipitation: (a) the variation in the critical size i* with coarsening 
time, where i* is the number of atoms in the critical cluster; (b) the rate of change of the critical size with time. 
(After Kampmann and Kahlweit [1970].) 
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of Servi and Turnbull discussed in §2.1.5. Langer and Schwartz [1980] analytically 
evaluated the effect of interfacial energy driven coarsening on precipitate densities. The 
results reported show similar trends to those of the numerical models described here but 
no detailed comparison between the evaluations was made. 

There are several experimental studies that have been made to test the idea of 
competive precipitate dissolution during precipitation in the Ni-Ni 3 Al system. These 
experiments used conventional transmission electron microscopy (TEM), Kirkwood 
[1970] and Hirata and Kirkwood [1977], field ion microscopy/atom probe techniques, 
Wendt [1981] and recently high resolution TEM, Xiao and Haasen [1991], The last 
study was discussed in §2.4.1 above. In all the studies a fall of at least an order on 
magnitude in precipitate density was found while the precipitation was continuing. The 
early results of Wendt [1981] are shown in figs. 42 and 43. The supersaturation, 
A C=@-C a , fell with time and. approached the initially expected f Ui kinetics of eq. 66b 
after about 50 minutes. The initial number density fell approximately linearly after this 
time. However this is not the correct kinetics which should be those of eq. (66c). Figure 
44 from the recent study of Xiao and Haasen [1991], shows that the product of (t-t { ) 
and N v (t) was not constant but fell linearly with (t-t l )~ ]/3 , exactly as predicted by 
eq. (66d), obtained from eq. (66c), 

N v m-t t ) = k 3 -k 6 (t-t i y m . (66d) 

The recent studies of Xiao and Haasen [1991], described above, show very clearly how 
both the nucleation and coarsening processes can strongly overlap — especially with very 
high densities of precipitates. It seems likely that this will be a general result for systems 
showing essentially homogeneous nucleation. For the reasons discussed in §2.4.1, any 
system with a high rate of homogeneous nucleation will very quickly achieve soft 
impingement — that is when the spacing between precipitate, approximately ( N v )~ l/ 3 , 
becomes smaller than the diffusion distance, ( Dt) l/ 2 . So free growth with (r nHI ) 2 increas- 
ing with time is unlikely to be seen. The only conditions where free growth with (r max ) 2 
increasing with time should be expected is for systems showing heterogeneous nucleation 
so that not only are the precipitates widely spaced, allowing (N V )~ I/3 » ( Dt) u 2 , but on a 
microstructural level the full supersaturation is expected to persist in regions of the 
matrix between the effective heterogeneous nucleation sites. Homogeneous nucleation will 
occur there at negligable rates at supersaturations where heterogeneous nucleation and 
growth dominates. Kampmann and Wagner [1984] and Haasen and Wagner [1992] 
show this result rather clearly through the Kampmann and Wagner “numerical” model of 
combined nucleation, growth and coarsening at high precipitate densities. In support of 
this idea is the observation that experimental studies showing (a^ff) 2 increasing with time 
appear to be only found in studies for heterogeneously nucleated precipitates, see, for 
example, the results discussed in § 2.2.3 for plate-like precipitates, where the precipitate 
and matrix have different structures. Homogeneously nucleated precipitates seem limited 
to systems where the interfacial energy is low and isotropic giving very high densities 
so preventing free growth from occurring at the undercoolings usually studied. If the den- 
sity of precipitates is greatly reduced then the free growth of spherical precipitates with 
('max) 2 increasing with time is still not seen since under these circumstances the spheres 
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Fig. 42. Precipitation in a Ni-14 at% A1 alloy, aged at 550°C: (a) super-saturation, Ac, versus ageing time, t A 
(from right to left); /is the precipitated volume fraction; (b) particle density, iV v , versus ageing time, compared 
with the theoretical slope (solid line). (From Wendt [1981].) 

become unstable and grow as dendrites with a constant tip growth velocity as discussed 
in § 2.2.6. Examples of homogeneously nucleated dendrites are reported in the Ni 3 Al-Ni 
system (Ricks et al. [1993]) and in the similar Al 3 Li-Al system (Baumann and 
Williams [1985]). An example of a system showing heterogeneously nucleated dendrites 
is the -y-brass//3 -brass system (Malcolm and Purdy [1967] and Doherty [1982]). 
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Fig. 43. Measured mean particle radius, r, and calculated radius of critical nuclei, R*, versus ageing time for 
the alloy of fig. 37 (from Wendt [1981]). 

2.4.4. Coarsening of Widmanstatten precipitates 

One of the early studies of precipitate coarsening was of the growth of the needle-like 
precipitates of fee copper in bcc iron (Speich and Oriani [1965]). Despite the anisotrop- 
ic shape, arising from the good atonic fit along the close-packed directions <110> fee 
and <11 1> bcc, the precipitates showed apparent diffusion limited coarsening with a 
constant precipitate aspect ratio of about 4. The authors used a modified version of the 
Ham [1958, 1959] analysis to model the diffusion controlled growth. They obtained 



Fig. 44. Experimental values of the product of (/— r,)W v (/) as a function of (r-t,)' 1 ' 3 for coarsening of Ni-12 at% 
AI at 773K, eqs. (66c and d). After Xiao and Haasen [1991], 
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reasonable agreement with their experimental results. Ferrante and Doherty [1979], 
as part of an investigation of the role of ledged interfaces in the growth of plate-like 
precipitates in Al-Ag, studied the coarsening of the precipitates after the completion of 
the initial precipitation. The plate-like precipitates in this system had much larger aspect 
ratios than expected for the equilibrium shapes: A> 100 while A eq = 3 (Aaronson etal. 
[1968]). Ferrante and Doherty showed that with the modified Gibbs-Thomson 

equations for the solubility at the incoherent rim, C r , and that at the semicoherent facet 
plane, C f , are given by: 
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Here, as in § 2.2.2, a^ is the half length (equivalent to the radius) of a cylindrical, plate- 
like precipitate and cr r is the interfacial energy of the incoherent rim of the cylindrical 
precipitate. Despite the difference in solubility between the rim and the facets of the 
precipitate, with A»A eq , the plates showed a reluctance to equilibrate towards the 
expected equilibrium shape, even with very extended coarsening times. Rajab and 
Doherty [1989] reported similar behaviour in a further study of the same system. The 
conclusion reached by both studies was that this reluctance of plate-like precipitates in 
the Al-Ag 2 Al system to change shape was due to the thickening reaction under the 
supersaturation arising from C f < C p being inhibited by a lack of growth ledges. An 
unexpected result from the initial coarsening study in 1979 with these out-of-shape Ag 2 Al 
precipitates was that despite inhibited thickening, the coarsening reaction, measured by 
the rate of lengthening, was very fast. However, further analysis of the results by use of 
the Ham, Horvay and Cahn analysis, supplemented by use of eqs. (67a and 67b), showed 
that the intuitive expectation, that inhibited thickening would slow coarsening, had been 
wrong. With inhibited thickening, so that A remains » A^, there is a much higher driving 
force for coarsening, as the only means of ridding the system of the excess interfacial 
energy, eq. (67a). Under the highly approximate condition of a constant aspect ratio, the 
analysis by Ferrante and Doherty [1979] yielded the following prediction for the rate 
of growth of the mean half-length of the precipitate: 
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Eq. (68) gave surprisingly close agreement with the experimental results, despite the fact 
that while the mean aspect ratio remained constant, the individual values of A were found 
to show a strong correlation with precipitate size. The correlation between A and a y 
indicated that there was a mean solute concentration determined by the Gibbs-Thomson 
solubility at the mobile ends or rims of the plates, eq. (67a). In the initial study by 
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Ferrante and Doherty a fine microstructure had been produced by low temperature ageing 
followed by coarsening at a higher temperature. In the later study by Rajab and 
Doherty [1989] coarsening observations were carried out isothermally at both high and 
low temperatures by first precipitating and then coarsening Ag 2 Al at the same temperat- 
ure. Under these circumstances a slow decrease of the mean aspect ratio was found, but 
at rates very much slower than expected for diffusion-controlled transport from the rims 
of the precipitates to the facets of the same precipitate. Again a lack of thickening ledges 
appeared to be the reason — despite the observation that precipitate/precipitate intersec- 
tions were still present. These intersections had been found to be effective sources of 
growth ledges under the larger supersaturations of the initial precipitation. That is, the 
supersaturations found during growth appeared to be sufficient to support nucleation of 
growth ledges while the smaller supersaturations found in coarsening appeared to be 
insufficient to allow a sufficiently density of ledges to form. 

Detailed investigations of the coarsening of plate-like precipitates with A»A eq were 
provided by Merle and Fouquet [1981] and Merle and Merlin [1981], They studied 
0' plates in Al-Cu and found, in that system, that the precipitates formed with high and 
variable aspect ratios. As in the study by Rajab and Doherty [1989] in Al-Ag, the 
highest aspect ratios of the 0' plates were found for those conditions with the lowest 
precipitate densities. During coarsening the aspect ratio fell towards what appears to be 
the equilibrium value of A^ ~ 20. The rate of fall was fastest in those microstructures 
which had the highest precipitate density, see fig. 45. The implication of these results is 
clearly that the highest precipitate densities lead to the greatest chance of precipitate 
intersection and thus a source of growth ledges. Merle and Merlin [1981] and 
subsequently Merle and Doherty [1982] showed that the kinetics of the change of 
shape could be described by a ledge model with solute transport at 225 °C by interface 
diffusion around the incoherent edge of the precipitate. However, the agreement required 
the experimental measurement of ledge spacing and as noted in the discussion of growth 
of ledged precipitates § 2.2.4. there remains no way, at present, of predicting the ledge 
spacings, owing to the lack of a successful model for ledge nucleation. As noted by 
Doherty and Rajab [1989] the inhibition of thickening during coarsening was more 
marked in Al-Ag than in Al-Cu for reasons that were not known. It would be good to 



Fig. 45. The variation of the mean aspect ratio of the plate-like 6' precipitates in Al-Cu at 225°C. T.I. is an 
interrupted quench from the solution-treatment temperature, R is a room-temperature quench followed by the 
225°C age, and e5 is a 5% room-temperature strain prior to ageing at 225°C. 3%, 4 % and 5% are the Cu 
contents. Note that A falls faster with the highest density of precipitates. (After Merle and Fouquet [1981].) 
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have more studies of the coarsening of plate like precipitates since these have the most 
common precipitate shape in successful precipitation hardening systems! 

2.5. Discontinuous reactions: moving two-phase boundary MTPB reactions 

The reactions discussed in this section are all those occurring at a moving interface, 
giving a two-phase, usually lamellar or rod-like, product. The moving interface may be 
an interphase boundary, as in eutectic and eutectoidal decomposition, or it may be a 
matrix grain boundary as in the precipitation reaction conventionally described as 
discontinuous precipitation (Gust [1979]). The moving interface is usually incoherent. 
This name, as previously noted in § 1 .4, is an unfortunate one since the general name for 
the second class of transformations considered by Gibbs, those that do not involve 
nucleation and growth, is continuous transformations (Soffa and Laughlin [1983]). The 
term “discontinuous” should then apply to all Gibbs type-I, nucleation-and-growth, 
transformations and not just the subset of them that occur at a moving boundary giving 
a two phase product. (Or a two phase product reacting at a moving interface to give a 
single phase product as in discontinuous dissolution.) The term “discontinuous transfor- 
mations” will be used here in the narrow meaning even though an alternative term, such 
as “moving two-phase boundary” (MTPB) reactions would be clearer. This review is not 
the place to make this unilateral change of notation, but care has been taken to avoid the 
expression “continuous precipitation” for reactions that either are nucleated entirely 
within a matrix grain or for those that are, nucleated at grain boundaries and then grow 
away from the boundary, for example Widmanstatten ferrite. The term continuous 
transformations will be restricted to reactions which do not involve nucleation and 
growth (§ 3). 

2.5.1. Eutectoidal decomposition 

In eutectoidal decomposition, the general reaction involves the change y —>a +/3. The 
most important example is the “pearlite” reaction, the decomposition of austenite, y, in 
an iron-carbon alloy containing 0.8wt% C, to give ferrite, a, and cementite, Fe 3 C: 
y -» a + Fe 3 C. The name pearlite comes from the similarity between the optical effect 
produced by light diffraction from the lamellar spacing (which is comparable wavelength 
of visible light) of the two-phase eutectoid and the same optical effect produced by 
“mother of pearl”, a biological composite material. 

The process can be understood from the schematic microstructure of fig. 46, for the 
transformation occurring isothermally at an undercooling AT below the eutectoid 
temperature of fig. 47. The extrapolations in the phase diagram, in fig. 47, give the 
compositions of the y in equilibrium with a and /3 as C y (a) and C y 03); the equivalent 
values in the a phase are C a (y) and CJfi). If the main diffusion path is along the 
moving boundary, as it often is metal-metal eutectoids such as Cu-Al, then the com- 
positions in the boundary are given as k b ay C y (a) and k^ y C y (fi) where k b y and k|j y are the 
partition coefficients for solute between the appropriate phase boundary and the 
decomposing austenite. Since the separate values of the different coefficients are rarely 
known, an average value, k y , is often used. 


References: p. 1497. 



1452 


R. D. Doherty 


Ch. 15, §2 


/£ZZ 


Ht 


777m) Crip) 

cM 


T \ 


C a (T) 

l 

Ca(Q! 

bim/Tf/T 


C T (o) v 
/ 

CrO) 


TV 


Fig. 46. Growth of an a +/3 lamellar structure, showing the variation in composition, C y (a), C y (/3) in the 
y-phase, and C y (y), C„(/3) in the a-phase, that drive the reaction. 


From figs. 46 to 48 it can be seen that there is a composition difference A C y within 
the y-phase, A C y = C y (a) - C y (/3), and an equivalent composition differences A C a in the 
a-phase and A C b in the boundary. In the initial treatment of the diffusional problem 
given by Zener [1946], it was assumed that diffusion took place down a solute gradient 
between the centers of adjacent lamellae. The mean gradient, dC/dy, is approximately 
2AC y /A s in the austenite, where A s is the lamellar spacing. The solute diffusion was 
assumed to occur within a thickness of approximately A/2 away from the interface, 
where the gradient is assumed to have the mean value, with no gradient further away 
from the interface. By a mass balance of the diffusing solute needed for the reaction, the 
following relationship between the growth velocity, v, and the spacing was obtained: 


2P y AC y |_ ]_ 

fJ P 'Cj-C.V 


(69) 


where f a and fp are the relative widths of the two phases, assuming that the molar 
volumes are the same in both phases (Hillert [1972]). Eq. (69) fails to take into 
account, however, the interfacial energy, cr a/3 , of the interface between the product 



Fig. 47. The equilibrium diagram showing the origin of the composition differences seen in fig. 46. 
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Fig. 48. The free-energy-composition diagram that gives the diagram seen in fig. 41. The composition 
difference AC, is reduced to AC y (A s ) by the rise in free energy per mole of a 4-/3, Tcr^VJ A s , caused by the 
lamellar spacing. 


phases.* The origin of the lamellar equivalent to the Gibbs-Thomson effect is shown 
in fig. 47. The interfacial energy per unit volume of the lamellar product is 2cr a/3 /\ s and 
so the two-phase product is raised in molar free energy by 2 <r a/3 V m /A s (J/mole -1 ) with a 
consequent reduction of A C y to A C y (A s ). If the critical spacing that reduces A C y to zero, 
A s (rev), is included, eq. (69) can be changed to give the velocity corrected for the 
interfacial energy effects. The description of A s (rev) is used since at that spacing, both 
the two-phase product with this spacing and the high-temperature austenite phase have 
the same free energy and so the reaction would be reversible at A s =A s (rev). With A C y 
changed to AC y (A s ), the velocity becomes: 


2D y AC y J_ f j _ A s (rev) 

fJp{Cp - c„) K { K 


(70) 


In deriving these diffusional equations the compositions are used in units of atoms per 
unit volume. However since the composition terms occur as ACV AC", use of atomic 
fractions will introduce only very small errors arising from small changes in the atomic 
volumes. 

In the iron-carbon system, the diffusion coefficient in the a phase product is 
significantly faster than in the close-packed austenite, y, so diffusion in a, is an 
alternative pathway which can be described by appropriate modification to eq. (70). For 
solute transport by diffusion in the interface the analysis by Turnbull (1955] showed 


* In the simple derivation of this Zener model, it is easiest, as here, to use the af} interface. For the point of 
changing the concentrations at the moving interfaces the energies of the ay and a/3 interfaces are critical. The 
parameters are interrelated by the triple-point angles where the three phases meet, see Jackson and Hunt 
[1966] and Kurz and Fisher [1989], 
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that not only must the compositions and the diffusion coefficient be changed but the 
diffusion thickness is reduced from A/2 to the boundary thickness, <5 b , so the resultant 
equation is: 


v{b) = 


SPb^AC/b 1 r A, (rev) 
fafefa ~ C a ) % l K j 


(70a) 


Hillert [1969, 1972, 1982] has discussed these equations and the more rigorous 
solutions to the diffusion equations in these various cases. He also considers the 
possibilities that there may be some significant supersaturation left in the product phases 
and that some free energy, or equivalently composition difference, may be required to 
drive the interface. That is, the reaction front may move faster than the rate required to 
achieved full equilibrium and that there may some element of interface control. For full 
analytical details and analysis of experimental results the reviews by Hillert, quoted 
above, should be consulted. 

It is apparent from eqs. (70) and (70a) that the problem is not fully specified, even 
in the simple treatments, until some means of determining the operating lamellar spacing 
can be found. Experiment shows that there is a constant spacing at an given undercooling 
and that this spacing decreases as the undercooling increases and consequently A s (rev) 
decreases. For a eutectoid reaction with a molar entropy change A S m , at an undercooling 
AT below the eutectoid temperature, A s (rev) is given by: 


A, (rev) = 


AS m AT 


(71) 


A discussion of the choice of spacing is given in §2.5.4. For the usually adopted 
assumption of a maximum velocity, we find A s = 2A s (rev). This gives the usual express- 
ion for volume diffusion control of: 


DyACy 1 
faftfa ~ C„) K 
DyACy 1 

uA c * - c °) 2A ‘ (rev) 


There is a simple relationship between AC y and undercooling, AT: 


AC, 


AT 


m a +m l3 
m a m p 


(70b) 


Here m a and are the slopes of the y solubility with temperature m a = dT/dC y (a)). So 
combining these equations together we find the often quoted form of this valid for all 
undercoolings: 






- 



(70c) 
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The equivalent equation derived by Jackson and Hunt [1966] has the same form as this 
but is expressed in terms of the interfacial energies, cr ay and cr fjy , and the triple point 
angles, of the moving interfaces with the parent y matrix. 

The growth velocity, eq. 70b, initially increases with undercooling (as AT 2 ) due to the 
effect of temperature on the spacing and A C y . However there is the usual exponential fall 
of the diffusion coefficient with temperature, so that although the eutectoid growth 
velocity initially increases with undercooling, it quickly reaches a maximum possible 
velocity and then the growth velocity decreases rapidly in the temperature range where 
diffusion becomes very slow. This qualitative pattern is always found and is of great 
importance in the heat-treatment of steels. In outline, the carbon-containing austenite 
needs to be quenched to form the high hardness martensite and to do this, the diffusional 
decomposition to pearlite, and the other diffusional products, allotriomorphic and 
Widmanstatten ferrite and also bainite (§ 2.6), must be avoided. To achieve this, either 
very rapid quenching is required or for samples too thick to be rapidly quenched, the 
transformation to pearlite needs to be slowed down by inhibiting either the nucleation of 
pearlite or its growth. The experimental diffusional kinetics are usually reported on time- 
temperature-transformation (TTT) diagrams showing the time for isothermal transfor- 
mation; these diagrams normally show “C-curve” behaviour (fig. 49), the “nose” of the 
C-curve being that for the maximum overall rate of transformation which has the usual 



Fig. 49. Isothermal TTT diagram for a low-alloy steel, containing 0.4% C, 0.3% Mo, 0.8% Cr, 1% Mn and 
1.8% Ni. The high-temperature transformation is firstly to precipitate ferrite (F) followed by pearlite (F + C), 
while the lower-temperature diffusional reaction is the formation of bainite (§2.6). (From the Atlas of 
Isothermal Transformation and Cooling Curves, ASM, Metals Park, OH, 1977.) 
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“5” kinetics of a JMAK type of nucleation and growth reaction (§2.4.1). In TTT 
diagrams, of course, the x-axis is not velocity but its effective reciprocal, the isothermal 
holding time to achieve a certain fraction transformed. The kinetics of the experimental 
TTT curve depend not only on the variation of growth rate with temperature but also the 
equivalent variation of nucleation rate. For the same reasons, the nucleation behaviour 
also show C-curve variation with temperature, § 2.1. The same type of C curve behaviour, for 
the equivalent reasons, is found for the diffusional precipitation of single phase a iron a 
(ferrite “F” in fig. 49) from y austenite (“A”) and the lower reaction of austenite to 
“bainite” (§2.6). These reactions as well as the formation of pearlite are seen in fig. 49. 

Although the transformation can be slowed down by an increase in austenite grain 
size, which increases the spacing between the grain-boundary nucleation sites, this is not 
the best solution, for reasons related to optimizing mechanical properties. The best 
method of improving the “hardenability” of steels is by slowing down the pearlite growth 
rate. This is achieved by addition of small amounts of substitutional metallic contents 
(nickel, manganese, chromium, etc.) which are soluble in the austenite but which at 
equilibrium will partition between the two product phases, but do so by slow substitut- 
ional diffusion. This idea the basis of low-alloy steels. At high temperatures the solute 
must partition since pearlite with uniformly distributed solute will have a higher free 
energy than the parent austenite. At lower temperatures, and higher undercoolings, there 
will be sufficient free-energy difference to allow transformation with “no partitioning” 
(Hultgren [1947]) but with an inevitable decrease in the free energy driving force and 
therefore an increase in the critical spacing, A s (rev). The “no partitioning” of ternary 
solute in eutectoids is comparable to the diffusionless precipitation of single-phase 
products discussed in §2.2.1. The reduction in growth velocity caused by these ternary 
solute effects increases with the amount of the low-alloy solute additions. 

There has been considerable experimental interest in studies of solute partitioning in 
steels (see, for example, Williams etal. [1979], Ridley and Lorimer [1981], Smith et 
al. [1981] and Ricks [1981]). Although it is well established that at high temperatures 
there is strong solute partitioning in low-alloy steel pearlites, and that with a fall in 
temperature the partitioning becomes incomplete, experimental difficulties appear, so far, 
to have prevented demonstration of the full no-partitioning of solute during the pearlite 
transformation. The difficulty is spatial resolution of the analysis, since at higher 
undercoolings the experimental spacings become rather too fine for the current analytical 
resolution. For the bainite reaction in steels there seems little doubt that there is no partition- 
ing of the substitutional metallic elements between the ferrite, a, and the carbide phases § 2.6. 

2.5.2. Discontinuous precipitation — MTPB precipitation 

An equation similar to eq. (70) describes the growth of a two-phase product behind 
a moving matrix grain boundary that relieves a supersaturated a matrix, the reaction 
being a —>a' +(3. This process occurs in many low-temperature precipitation reactions 
and arises after grain-boundary nucleation of the precipitate phase (3 which can then be 
supplied with solute by grain-boundary diffusion at temperatures where the competitive 
processes involve solute transport by very much slower bulk diffusion. The solute can be 
supplied from regions adjacent to the boundary — a process that is called the “collector 
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plate mechanism” if the boundary does not move. However, if the boundary can be 
induced to migrate into one of the grains, it can then collect and transport the solute from 
the supersaturated grain interior that it passes through, without requiring any bulk 
diffusion. The precipitates initially nucleated on the boundary can then grow while 
remaining attached to the moving boundary giving the characteristic lamellar structure. 
Gust [1979] published a very detailed review of the literature on this topic, starting from 
the original observation of the precipitation of copper at moving grain boundaries in 
Ag-Cu alloys by Ageew and Sachs [1930]. There are several points of interest in this 
type of transformation. They include the problem of how and why the reaction initiates, 
that is why the boundary starts to move, and how it is possible for different parts of the 
same boundary to migrate in different directions, as reported for example by Williams 
and Edington [1976]; the question of what determines the residual supersaturation in 
the product, since in many cases this is reported to be appreciable (e.g., Porter et al. 
[1974]); there is also the general problem of determining which reaction, the discon- 
tinuous one or growth by volume diffusion, will occur under any given circumstances. 
Studies by Sulonen [1964] and more recently by Chung etal. [1992] have shown that 
many of these questions may be answerable in terms of an elastic stress model for 
boundary migration in discontinuous precipitation. This is equivalent to the process 
driving the process usually called diffusion (or chemically) induced grain boundary 
motion — DIGM discussed in § 2.5.5. The role of elastic stress in both discontinuous, 
MTPB, precipitation and DIGM is considered in § 2.5.5. 

There is extensive evidence for significant residual supersaturation in the products of 
discontinuous precipitation. This comes from lattice-parameter measurements and from 
very high resolution microanalysis by electron energy loss techniques applied to Mg-Al 
alloys (Porter etal. [1974]) and to Al-Li alloys (Williams and Edington [1976]). 
Figure 50 gives the chemical analysis results across a matrix grain boundary in Mg-Al. 
This shows (i) a discontinuous change at the boundary, characteristic of a process 
proceeding by boundary diffusion, and (ii) that the resulting solid solution is still 
supersaturated in the aluminium solute. 
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Fig. 50. Microanalysis by electron energy loss, across the migrating grain boundary during discontinuous 
precipitation in a Mg-Al alloy (after Porter etal. [1974]). 


References: p. 1497. 




1458 


R. D. Doherty 


Ch. 15, §2 


The initiation of the discontinuous reaction was studied by Tu [1972] in the Pb-Sn, 
solder, system at room temperature. His micrographs show regular grain-boundary 
nucleation of a colony of Sn-rich precipitates which grow apparently with an orientation 
relationship with the grain that is not consumed, when the boundary moves. Once the 
boundary starts to move, further precipitates form to give the operating lamellar spacing. 

An important question concerns the conditions required for the discontinuous reaction 
to dominate. This will presumably require that the discontinuous process occurs faster 
than any alternative reactions such as grain-interior precipitation (usually of an easy-to- 
nucleate, usually metastable phase) or the growth of a boundary nucleated phase into the 
grain away from the stationary boundary. Both of these processes occur by mechanisms 
whose rates are determined by bulk diffusion. It might be expected, then, that the 
discontinuous process will take over from growth of boundary precipitates as the 
temperature falls, since the relative advantage of boundary diffusion over bulk diffusion 
increases as the temperature falls. Boundary diffusion has a lower activation energy. 
Results reviewed by Gust [1979] confirm this description with the discontinuous process 
in, for example, copper-rich Cu-In (Predel and Gust [1975]) dominating the precipitat- 
ion process as the temperature of the reaction fell below 300°C. However, if the 
temperature falls very far, general grain-interior nucleation dominates most precipitation 
reactions, at least in alloys selected for precipitation hardening (Kelly and Nicholson 
[1963]). For such alloys, high temperatures of precipitation frequently cause discon- 
tinuous precipitation of the stable phase in place of the general precipitation of the 
strengthening metastable phases. On this basis, there should be only a narrow “window” 
of conditions that allows the discontinuous reaction to proceed. This is just as well since 
discontinuously precipitated alloys do not have good mechanical properties; having 
neither the high strength of precipitation-hardened alloys nor the good toughness of 
single-phase alloys. The studies by Sulonen [1964a and b] and by Chung etal. [1992], 
discussed in § 2.5.5, show clearly that, in addition to these requirements, there must be 
an elastic stress effect due to a change in the lattice parameter of the a matrix by 
changes in the solute content is also required. It appears as if the elastic stress in the 
consumed grains drives the process so it should be possible to develop a model for the 
kinetics of this type of discontinuous, MTPB, precipitation based on the elastic stress 
model. Such a model, however, does not yet appear to have been published. 

2.5.3. Discontinuous (MTPB) coarsening 

The ordinary coarsening process, described in §2.4, involves the reduction of 
interfacial energy by growth of large precipitates, at the expense of smaller ones, by 
diffusion either through the lattice or — for a set of precipitates all linked by grain 
boundaries — by boundary diffusion in stationary boundaries. Livingston and Cahn 
[1974] discovered a discontinuous (MTPB) coarsening reaction in polycrystalline 
eutectoids in Co-Si, Cu-In and Ni-In, when the microstructures were annealed at 
temperatures close to the eutectoid temperature. Figure 51 shows a typical microstructure 
in which the upper grain is growing into the lower grain on the right while the opposite 
is happening on the left. The grain boundary is only visible in the grey Co 2 Si phase. The 
driving force for this reaction certainly involves the reduction in interfacial area between 
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Fig. 51. Discontinuous coarsening of eutectoid in Co-Si alloys; a polycrystalline sample annealed at 1000°C 
for 96 h. x 1875. (From Livingston and Cahn [1974].) 


the two phases. The reaction kinetics were successfully analyzed by Livingston and 
Cahn [1974] on the basis of boundary diffusion to give a predicted boundary velocity: 


v = 


j _ 

fj^-c^^RTy "Aj 


(72) 


where A, and A 2 are the spacings before and after coarsening, C b is the solute com- 
position in grain boundary <7 a/3 is the energy of the interface between the two phases and 
the other terms have their previous meanings. This analysis was consistent with all the 
observations made. Of particular interest was the clear demonstration of which way the 
boundary moves. As seen in fig. 51, the grain whose lamellae lie nearly parallel to the 
initial boundary plane is able to grow, since it automatically has a larger effective 
spacing along the boundary. The only difficulty that arises here is the inevitable change 
of interfacial plane that occurs with this process, even though the two phases have the 
same relative orientations; the properties, particularly the energy of the interface, <r u/3 , 
may vary with boundary orientation unless the interface is fully incoherent. 

A further very important point, discussed by Livingston and Cahn, was the magnitude 
of the spacing ratio, A^A,, which was shown to be between 5 and 7 for Cu-In and 
Co-Si and between 10 and 20 for Ni-In. This is discussed in more detail in §2.5.4. At 
present there is no evidence if discontinuous coarsening required movement of the grain 
boundary is, or is not, driven by elastic stress effects. 

Chuang etal. [1988] studied the discontinuous coarsening reaction in an Ni-7.5at% 
In alloy where the initial lamellar structure had formed by discontinuous precipitation. 
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They showed that the initial lamellar structure retained, as it commonly does, a signif- 
icant solute supersaturation in the matrix phase and that during coarsening this residual 
supersaturation was essentially removed. In this case, both the residual supersaturation 
and the interfacial energy were driving the reaction. The authors reported that the 
transport mechanisms, in both the discontinuous precipitation and the subsequent 
coarsening reaction, were by grain-boundary diffusion. Somewhat smaller increases (by 
3 to 5 times) of the lamellar spacing occurred here than in the coarsening reaction 
studied by Livingston and Cahn whose starting structure had been produced by eutectoid 
decomposition. 

Other examples of discontinuous coarsening had been published by Gust [1979], 
though without discussion. One further example has already been mentioned, the work 
of Williams and Edington [1976] using Al-Li alloys. At low ageing temperatures close 
to room temperature, the discontinuous reaction apparently involved formation by precip- 
itation of the two-phase lamellar structure from a fully supersaturated solid solution, but 
at higher ageing temperatures the discontinuous reaction was one of coarsening when a 
lamellar, Al + Al 3 Li, cube-cube oriented, product grew out from grain boundaries to 
consume the fine coherent precipitate nucleated in the adjacent grain. The reaction halted 
when the spherical Al 3 Li distribution had coarsened by the usual LSW process, and also 
presumably when the aluminium supersaturation had been removed by completion of the 
grain-interior reaction. An interesting result reported by Williams and Edington was that 
grain boundaries close to coincidence-site orientations (ch. 9, § 2.2) were less able to 
show the discontinuous reaction than general high-angle grain boundaries. 

Doherty [1982] discussed the formation of similar coarsened lamellar structures by 
deformation-induced boundary migration (recrystallization) in nickel alloys containing 
coherent Ni 3 Al precipitates. Here the driving force for boundary movement was clearly 
the stored energy of the dislocations. 

An interesting final reaction in this series is one described by Nakkalil and Gupta 
[1987]: this was the discontinuous (MTPB) dissolution of discontinuously precipitated 
Al-20% Zn when annealed close to the solvus temperature. Of interest here is that the 
spacing, of the dissolving two phase structure is fixed. Does the reaction have time to 
reach equilibrium giving a homogeneous solid solution and what role, if any, does the 
coherency stress play? 

2.5.4. Determination of lamellar spacing in discontinuous (MTB) reactions 

In the initial Zener [1946] analysis it was assumed that the system adopted the 
spacing that allowed the interface to migrate at the maximum velocity, which for the 
volume-diffusion mechanism gives A s = 2A s (rev) and for interface diffusion mechanism, 
A s = 1.5 A s (rev). Puls and Kirkaldy [1972] discussed alternative optimization criteria, 
including the spacing that gives the maximum rate of entropy production. This yields 
somewhat larger optimum spacings, which are 3A s (rev) and 2A s (rev), for volume and 
interface diffusion mechanisms, respectively. There is, however, considerable difficulty 
in estimating the appropriate values for A s (rev) since its evaluation requires knowledge 
of the interfacial energy as well as the free energy driving the eutectoid reaction. Free- 
energy calculations for discontinuous precipitation will be easier, particularly in dilute 
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alloys. However, as Livingston and Cahn [1974] pointed out, the discontinuous 
coarsening reaction does not have either of these difficulties, since the relevant parameter 
is the spacing ratio which can be easily directly measured, this is not true however if the 
initial lamellar structure still retains some solute supersaturation. The spacing ratios in 
coarsening were found by Livingston and Cahn to be considerably bigger than predicted 
by either of the proposed optimization criteria, maximum velocity or maximum rate of 
entropy production. The coarsening results therefore suggest growth well away from the 
maximum-velocity condition, at larger spacings. The same conclusion is also suggested 
by various experimental studies on discontinuous precipitation (Russew and Gust [1979] 
and Speich [1968]), results reviewed by Hillert [1982]. It was shown by Hillert that 
the experimental results lay close to the line, predicted by his diffusional model, of a plot 
of the residual supersaturation against lamellar spacing, but at larger values of the spacing 
than was predicted for the maximum-velocity criterion. 

By comparison with the equivalent problem in dendritic growth, it would appear that 
there are experimental reasons, firstly, for expecting growth on the large side of the 
spacing predicted by the maximum-velocity criterion, and secondly, for the potential 
successful application of some type of stability criterion for lamellar spacing. The choice 
of spacing will also need to be analysed in the context of the insights into the role of 
elastic stress described below. 

2.5.5. Diffusion-induced grain-boundary migration (DIGM) 

In discontinuous, MTPB, reactions the available free energy of the precipitation, for 
example IJ in fig. 1, might be applied solely to the growing /I -phase alone and the 
growth of the /1-phase then may pull the depleted a matrix along. Alternatively the 
energy could be applied to both phases. Hillert [1972, 1982] in his analysis of 
discontinuous precipitation has used this idea of the partition of the free energy between 
the two phases. In order to try to study this problem more fully, a simpler experimental 
situation was devised by Hillert and Purdy [1978]. Their experiment identified a 
process that is now called by the title of this section, diffusion- induced grain-boundary 
migration, DIGM. CIGM, for chemically induced boundary migration, is sometimes used 
as an alternative name for the same process. DIGM can be achieved by alloying, or 
dealloying, a polycrystalline sample from the surface at a temperature at which bulk 
diffusion has been almost “frozen out”, but where boundary diffusion still takes place at 
a significant rate. Hillert and Purdy [1978] introduced zinc into thin polycrystalline 
iron samples from a vapour source, an Fe-11.3wt% Zn alloy, at temperatures between 
545 and 600°C. They observed boundary migration which left a layer of zinc -enriched 
solid solution behind the migrating boundary. They treated the driving force for this 
reaction is the free energy of mixing per atom, AF rn . The effective or apparent mobility, 
Af a , of the boundaries was then estimated from 

v = M^ F m . (73) 

Apparent mobilities of grain boundaries in iron, at 580°C, of Af a of about 1(T 17 m 4 /J s 
were found which were much smaller than the values found from ordinary boundary- 
mobility experiments or from the work of Speich [1968] on discontinuous precipitation 
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in iron-rich Fe-Zn alloys (M= 10~ 5 and 10“ 12 m 4 /J s respectively). In studies of discon- 
tinuous precipitation in Cu-Co by Perovic and Purdy [1981] values of Af a lower by 10 4 
than boundary mobilities measured in recrystallization were also reported. It was not 
clear, at that time, whether the discrepancy arises from extraneous sources such as solute 
drag or if the use of eq. (73) to determine velocity is not justified. The process of DIGM 
had been previously seen, for example by Rhines and Montgomery [1938] and by den 
Broder [1972] and was subsequently identified in solid-liquid mixtures by Yoon and 
Huppmann [1979]. The paper of Hillert and Purdy [1978] created a great deal of 
interest in the process and numerous subsequent studies of the phenomenon followed as 
described by the reviews of Handwerker [1989] and Yoon [1989]. 

Cahn et al. [1978] and Balluffi and Cahn [1981] reviewed the phenomenon of 
DIGM and noted that it occurred under a whole range of conditions, including both 
alloying and dealloying in systems with either a positive or a negative deviation from 
ideal solution behaviour, that is, with positive or negative heats of mixing. Cahn et al. 
[1978] draw attention to the critical theoretical problem in DIGM and equivalently in 
discontinuous precipitation. This is how the overall free-energy decrease can couple with 
the individual atomic motions required for boundary migration. The problem is shown in 
the schematic migrating-boundary structure in fig. 52. The boundary contains solute in 
equilibrium with the solute content of the growing grain on the left but, apart from the 
boundary plane, there is no solute in the grain on the right. If an atomic layer of solute- 
rich alloy is formed on the grain on the left by atom transfer from the right, together 
with solute addition down the boundary, a decrease of free energy of mixing will have 
occurred. However, this difference in free energy is apparently not available to the atoms 
in the right grain, that must jump across the boundary. Cahn et al. [1978] proposed a 
model for the process based on differences in the rate of boundary diffusion , the grain- 
boundary equivalent of the Kirkendall effect. This has the result that extra lattice sites 


Boundary 

motion 



Grain 1 Grain 2 


Fig. 52. Diffusion-controlled grain-boundary motion; solute atoms, solid circles, have diffused down the 
boundary and are causing growth of grain 1, which is being solute-enriched and therefore causing a reduction 
in free energy by the free energy of mixing. What is the driving force causing a net transfer of atoms across 
the boundary? 


Ch. 15, §2 


Diffusive phase transformations 


1463 


must be created, or lost, at the boundary which provides, via the climb of grain-boundary 
edge dislocations, a means of moving the grain boundary. There is clear evidence, for 
example in the Fe-Zn alloy experiments of Hillert and Purdy, that there is surface tilting 
at the migrating grain-boundary regions, characteristic of volume changes predicted by 
the boundary Kirkendall effect. 

The topic of DIGM has been reviewed several times recently, by Handwerker 
[1989], Yoon [1989] and Doherty [1992] and it now seems clearly established that the 
driving force that moves the grain boundary is the elastic coherency stress due to the 
change of lattice parameter in the matrix in the layer of solute enriched material immedia- 
tely ahead of the moving grain boundary. An example of this is monolayer of such 
enrichment shown in fig. 52. If the grain boundary is moving with a velocity v and the 
bulk diffusion coefficient is D, then the thickness, <5 S , of the solute-enriched layer, in the 
grain being consumed, is given by: 

S s = D/v when D/v > b (74a) 

S s = b when D/v < b. (74b) 

b is the atomic diameter of the matrix atoms. Chung et al. [1992] suggested that 
eq. (74a) would be expected to apply even when D/v<b, but this fails to take into 
account that diffusion of solute down a grain boundary does not occur in a channel of 
atom sites within the boundary but occurs by atoms which are part of the surface layer of 
both grains. Figure 52 sketches the situation expected of eq. (74b). The large driving 
force of the free energy of mixing will ensure that atoms in a grain boundary should 
always be part of the surface layers of both grain in a boundary and this idea is clearly 
confirmed by the experimental results reported by Chung etal. [1992], In the situation 
where 17, 17 = (l/a)(da/dC) * 0, then the presence of such a solute-rich layer, of composi- 
tion C s different from the matrix composition C 0 with either S s = b or S s > b, will give rise 
to an elastic energy per atom, A / r Ea . This occurs since the solute modified layer which 
unstressed will have a different lattice parameter from the unsolute modified part of the 
grain but is coherently stressed to that lattice parameter. This elastic coherency energy is 
given by: 

AFfc, = VJ(n)8 2 (75a) 

Here V a is the volume per atom, T(n) is the elastic modulus that varies with the surface 
normal n and misfit 8 is the strain given by: 

5 = V (C s - C 0 ) (75b) 

What appears to happen during DIGM is that the solute diffuses down the initially 
stationary grain boundary and enters the grains on both sides of the boundary to a depth 
that is approximately ( Dt ) 05 , where D is the bulk diffusion coefficient and t the time of 
diffusion. There will be a steady build up of elastic strain until on one side of the 
boundary the strain is plastically relaxed by the formation of a cross-grid of edge 
dislocations. These dislocations take up the change of lattice parameter between the 
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solute enriched region and the original composition. Such dislocation arrays, at a position 
close to the original position of the grain boundary, are commonly reported, see for 
example Pan and Balluffi [1982]. Once the elastic energy is plastically relaxed on one 
side of the boundary, then there is a driving pressure to cause migration of atoms from 
the grain that is still elastically strained. The velocity of the migration is given by 
equation (73) but with the driving force that of the elastic coherency energy, A F^, not 
the full free energy of mixing, A F m . The free energy of mixing drives the diffusion 
process down the grain boundary and into the grains so giving rise to the elastic strain 
energy. Since A F m > A the low apparent value of boundary mobility in DIGM noted 
above arises since the incorrect larger driving force was used for its evaluation. 

The elastic stress energy explanation for DIGM was demonstrated by several studies 
from Yoon and his colleagues using liquid-solid mixtures, as reviewed by Yoon [1989] 
For example, Rhee and Yoon [1989], using liquid phase sintered Mo-Ni established a 
uniform Ni content in the Mo powder particles which, after sintering, produced fully and 
partially liquid wetted grain boundaries. Liquid wetted boundaries were ones where the 
grain boundary energy, cr gb , was larger than twice the solid liquid energy, er sl . The 
partially wetted boundaries were ones with <r gb < 2 <r sl . On adding either Co or Sn to the 
liquid, there was rapid liquid and grain-boundary diffusion of the ternary solutes but slow 
bulk diffusion into the solid particles — causing elastic coherency strain in the surface 
layers on both sides of the boundary. When this strain was plastically relaxed by an array 
of edge dislocations on one side of the boundary, it caused DIGM of partially liquid 
wetted boundaries and an equivalent process of liquid film migration (LFM) of fully 
wetted grain boundaries. The authors were able to vary the strain energy driving the 
reaction by use of different combinations of Co and Sn: One solute, Co, gave a negative 
strain and the other a positive strain. Figure 53 shows the predicted parabolic relation- 
ship between the migration distance, in a fixed diffusion time, and the calculated 
coherency strain, AFga, from eq. (75a) being used as e the driving pressure in eq. (73), 
with v=M a AF Ea . The study showed the same predicted type of parabolic dependence on 
coherency strain for migration of grain boundaries (“CIGM” for chemically-induced 
grain-boundary migration) as for grain boundaries wetted by liquid films. The kinetics of 
LFM are apparently slightly faster than that of DIGM/CIGM. The coherency energy 
model of DIGM is also confirmed by the initial observation by Hillert and Purdy 
[1978] that the position from which the grain boundary has moved in DIGM is marked 
by an array of dislocations that have relaxed the coherency strain in the grain that 
subsequently grew into the opposite grain. This plastic relaxation on the growing side of 
the boundary is what lowers the coherency energy of one grain and allows it to grow 
into the grain that is still elastically strained. It is commonly seen that after awhile DIGM 
stops — again a not unexpected feature if the grain which is being consumed also 
manages to relax the coherency strain plastically. 

The coherency strain energy model had been previously identified, at least qualitatively, 
as controlling discontinuous precipitation by the elegant experiments of Sulonen [1964a 
and b]. In these experiments the velocities of moving grain boundaries, giving discon- 
tinuous precipitation, were modified by the application of unidirectional tensile or 
compressive stresses. Boundaries parallel or normal to the stress axis had their velocities 
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Fig. 53. The observed variation of the average migration distance for DIGM (labelled CIGM) and Liquid Film 
Migration (LFM) for Mo-Ni alloys embedded in Mo-Ni-Co-Sn on holding at 1460°C for 2 hours as a function 
of the coherency strain. The numbers above the filled circles are the values of the increased Co content. The 
Solid and dotted lines are the parabolic fits to the experimental data. After Rhee and Yoon [1989]. 


increased or decreased in a predictable manner in a series of binary alloys dependent on 
the signs of the applied stress and the local elastic coherency strain in a range of 
systems. Hillert [1972] provided a clear analysis of the thermodynamics of the 
situation. Interestingly, despite the availability of this critical evidence of the role of 
coherency strain energy in discontinuous precipitation, the insights were largely over- 
looked for nearly 30 years in much, but not all, of the subsequent literature of discon- 
tinuous precipitation and DIGM. It is pleasing to note that the good work of Sulonen 
[1964a and b] was eventually recognized. Chung et al. [1992] repeated the Sulonen 
experiments in an Al-21.8at% Zn alloy that after solution treatment was held under 
different tensile stresses at both high, 215°C, and low, 75°C, temperatures. The changes 
in boundary velocities of discontinuous precipitation were determined as a function of 
applied stress. Figure 54 shows the averaged growth rates of the two types of boundaries, 
parallel and transverse to the stress, as a function of applied tensile stress. The predicted 
transverse and parallel velocities, v, and v p , using the elastic energy relationships, derived 
by Hillert [1972], were quoted by Chung et al. [1992] as: 


v t =M a 


Y8 2 
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The values of the apparent mobility, A/ a , and the misfit, 8, were calculated by applying 


References: p. 1497. 




1466 


R. D. Doherty 


Ch. 15, §2 



0 20 40 60 80 100 120 

Applied stress (MPa) 

Fig. 54. The average discontinuous precipitation growth rate as a function of applied tensile stress, for 
boundaries normal (transverse) to the stress and boundaries parallel to the stress in Al-22at % Zn held at 75°C 
for 1 hour. After Chung, Shin and Yoon [1992]. 

these equations to the the slopes and intercepts of the two plots in fig. 54. The values 
found were Af a = 4.1 x 1CT 15 m 4 F 1 s -1 and 8 =-1.6x 10 -3 for the transverse boundaries and 
Af a = 2.5x 1CT 15 m 4 F 1 s~ l and 5 =-2.0x 10~ 3 for the parallel boundaries. The estimated 
value of the strain, 5, due to the misfit with C 0 = 21.8at% Zn, the alloy composition, and 
C s = 12.8at% Zn, the equilibrium solute content at the transformation temperature of 75°C 
was, from eq. (75b), 5 =-2.5x 1CT 3 . It is likely that the value of C s will not fall all the 
way to the equilibrium solubility since most studies of discontinuous precipitation 
indicate that the product matrix retains some supersaturation. The very close qualitative 
and quantitative agreement between theory and experiment here is, however, very 
satisfactory and clearly indicates that the analysis appears valid. In other words, the 
identification by Sulonen [1964] of coherency stress as the central mechanism in 
discontinuous precipitation, that he had qualitatively demonstrated, is now quantitatively 
confirmed. It might be noted that exactly similar coherency effects should occur in 
eutectoid transformations and thus influence the migration kinetics. However the author 
of this review knows of no theoretical or experimental investigation of this idea. 

Chung et al. [1992] were disturbed that the analysis worked as well for the 
conditions at the low temperature, 75°C, where D/v was very much less than an atomic 
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diameter, b, than at the higher temperature where D/v was much larger than an atomic 
diameter. However, as noted above, this result appears very reasonable since the surface 
layer of a grain boundary exposed to solute should contain a high solute concentration. 
In fact, it is clear that the results from Chung etal. [1992], together with similar results 
that they quote from earlier studies, demonstrate that the solute must penetrate the 
surface layer during boundary diffusion even when the microscopically predicted 
penetration, D/v, fails to indicate this. In other words the analysis of Chung et al. shows 
clearly that, during grain boundary diffusion, the outer layer of both grains carries the 
solute content of the grain boundary and changes the equilibrium lattice parameter of the 
solute modified layer even for conditions (such as D/v<s.b) in which continuum 
modeling predicts no solute penetration. 

The clear demonstration of the role of elastic strain energy in driving boundary 
migration in discontinuous precipitation immediately explains why the resulting matrix 
may be supersaturated. The boundary migrates at a rate determined by the product of the 
boundary mobility and the elastic energy and not at rates determined directly by the 
precipitation. So the boundary migration could easily be too fast to allow the full release 
of the matrix supersaturation. The result also allows an easy prediction as to the 
likelihood of the discontinuous precipitation process to occur. The larger the values of 77 
and of (C 0 -C s ) the more likely is the discontinuous process to occur. Bohm [1961] had 
earlier noted the importance of the minimum magnitude of 77 in determining the onset of 
discontinuous precipitation in a range of copper alloys. A significant value of the elastic 
energy is only a necessary condition but it is not always a sufficient condition for this 
mode of precipitation since supersaturation may be released even faster by a high rate of 
nucleation and growth of grain interior usually metastable precipitates — with a low 
interfacial energy barrier to nucleation. 

An important phenomenon, described by Chongmo and Hillert [1981], remains 
however a major mystery. In their study of DIGM in Fe-Zn they observed a pheno- 
menon now called diffusion-induced recrystallization (DIR) on the surface of their 
samples. In some circumstances new small grains were nucleated at the surface of the 
sample in regions away from prior grain boundaries. The term “recrystallization” refers 
to the formation of new grains, usually as a result of annealing plastically deformed 
material. The process of nucleation of recrystallization, discussed in chapter 28, requires 
high angle grain boundaries to give the required mobility for new grains to grow. In 
plastically deformed materials the required misorientations are known to be produced by 
plastic deformation, see for example the review by Doherty [1978]. How solute entry 
into the surface of the material with its associated elastic coherency energy is able to 
produce the required large local misorientations remains a unsolved problem in DIGM. 
The phenomenon of DIR is reported as occurring more easily with rougher surfaces than 
with electropolished surfaces and it occurs more often at higher temperatures. Similarly, 
though the mechanisms of growth in DIGM now appears to be well understood, there is 
less insight into which grain DIGM moves into at arbitrary grain boundaries and what 
causes the common observation, see for example Hillert and Purdy [1978] and 
Chongmo and Hillert [1981], that in some grain boundaries, migration of different 
parts of the same boundary occurs at different places into one grain and elsewhere on the 
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same boundary into the other grain. As noted above similar two-way boundary migration 
can happen in discontinuous precipitation. That is, the nucleation stage of DIGM, 
discontinuous precipitation and particularly DIR is not yet understood. 

2.5.6. Experimental results on discontinuous eutectoidal reactions 

Puls and Kjrkaldy [1972] reviewed in detail the experimental results of eutectoidal 
decompositions. For the eutectoidal reaction in binary iron-carbon alloys, the reactions 
appear to go at the rates expected for volume diffusion of carbon, though with some 
indication of rate enhancement by additional boundary diffusion. In the observations on 
eutectoid growth in binary substitutional alloys several studies, such as the fee Al-78% 
Zn eutectoid (Cheetham and Ridley [1971]) showed boundary diffusion to be dominant 
while in the Cu-Al eutectoid where the decomposing “austenite” phase is bcc with a high 
bulk diffusivity, volume diffusion control was reported (Asundi and West [1966])*. In 
all these eutectoid systems, as in eutectic solidification where the reviewed results were 
considered along with solid-state reactions by Kjrkaldy and Sharma [1980], the results 
were tested against models which assumed a maximum-velocity, or the equivalent, a 
minimum-undercooling criterion, and showed physically sensible behaviour. 

From their study on Fe-C-X ternary alloys, Puls and Kirkaldy [1972] suggested 
that the data are apparently explicable on the reasonable basis that there is no partitioning 
of the substitutional solute and hence carbon diffusion control operates at low temperat- 
ures: At high temperatures, however, there is evidence for a shift to control by boundary 
diffusion of the substitutional elements, which are then partitioning. 

Unfortunately there is lack of full information on the crucial parameters, such as the 
surface energies and the relevant diffusion coefficients, to allow a really satisfactory test 
of the theories to be made. However, at the moment, the overall situation appears to be 
one of residual doubt about the operating criterion that determines the spacing, but apart 
from this problem the theories appear in good shape. However it would appear that a 
general account of all moving two-phase boundary transformations: eutectoid decom- 
position, discontinuous (MTPB) precipitation, discontinuous (MTBB) coarsening and 
dissolution, DIGM and LFM is needed that includes the role of coherency strain energy. 

2.6. Bainitic transformations 

In the isothermal TTT diagrams for iron-carbon alloys there is an important transfor- 
mation, seen in fig. 49, at temperatures below the “nose” of the pearlite reaction. The 
product of this lower-temperature transformation is called bainite and usually appears to 
be a mixture of a-Fe (ferrite) and finely dispersed cementite, Fe 3 C. Two varieties of 
bainite, known as upper bainite and lower bainite, have been identified, corresponding to 


* This result is interesting for the theory of precipitate shape stability (§ 2.2.6). The y-phase in Cu-Zn and in 
Cu-Al precipitated from supersaturated -phase as solid-state dendrites and it was argued in § 2.2.6. 1 that a 
high rate of bulk diffusion relative to interface diffusion is required for such instabilities to form. That the 
eutectoidal decomposition of bcc / 3 in Cu-Al is by bulk rather than interface diffusion is in agreement with this 
hypothesis. 
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their respective temperatures of formation. Upper bainite has carbides which appear to 
have formed with the usual orientation relationship found for carbides forming in the 
austenite matrix. Lower bainite appears to have carbides with an orientation relationship 
with the growing ferrite phase indicating that they precipitated within the ferrite, after it 
was formed. In some steels, particularly those with high levels of silicon, carbide free 
bainite is formed, at least initially. The subject has been reviewed and discussed many 
times by Aaronson [1969], Hehemann [1970], Hehemann etal. [1972] Bhadeshia and 
Edmonds [1980], Christian and Edmonds [1984], Aaronson et al. [1987-89], 
Bhadeshia [1985, 1987] Christian and Edmonds [1988/9], Aaronson etal. [1990], 
Bhadeshia and Christian [1990], Aaronson and Hall [1994a], Aaronson et al. 
[1994b], Hillert [1994], The last four references come from two recent conferences: 
International Conference on Bainite [1990] and Role of Shear and Diffusion 
in Plate Shaped Transformation Products [1994] devoted solely to the topic of 
bainite and related transformations. There are difficulties in making a clear interpretation; 
the following represents a summary of what appear to be significant results reviewed in 
the light of the ideas presented in this chapter. For full details of the experimental results 
and differing views on the transformation the papers at the conferences and their cited 
references should be consulted. It should also be noted that the author of this review has 
not carried out any experimental or theoretical investigations in the field of bainite. 

There are several interpretations of the experimental results on bainite, all of these 
disputes are connected with the mechanism of the interface reaction as the plate-like 
product, bcc ferrite in Fe-C bainite, grows from the matrix, fee austenite in Fe-C bainite. 
They concern the nature of the interfacial reaction, the carbon supersaturation in the 
bainitic ferrite and the mechanism of the incomplete reaction in bainite. 

i) Bainite: displacive or reconstructive? 

One interpretation is that the formation of bainite plates in bainite occurs by a 
coordinated atomic motion, a mechanism similar to that seen in martensite. This is 
described as a “military” or a “displacive” reaction (Christian [1965], Bhadeshia 
[1985] see §§ 1.1 and 1.6). The alternative viewpoint is that the transfer of atoms occurs 
by random, thermally-activated atomic jumps across the interface a mechanism described 
as “civilian” or as “reconstructive”. The term diffusive for the civilian interface reaction 
will not be used here since, at some stage, in the formation of low carbon ferrite from 
austenite with a higher carbon content, there is a composition change occurring by the 
diffusional motion of carbon atoms (This is why the reaction is discussed in the present 
chapter). For clarity, the two models will be described here as either displacive or 
reconstructive. One of the difficulties of interpretation concerns the significance of the 
surface relief that accompanies the formation of plates of bainite. This type of surface 
relief is a defining experimental characteristic of a displacive reaction, see the following 
chapter, and the observation by Ko and Cottrell [1952] of surface relief in the bainite 
reaction was for most, but not all, students of the subject, the critical observation 
supporting the displacive mechanism. 

The mechanism of the bainite reaction in iron-carbon alloys needs to be compared to 
the mechanisms of other austenite decomposition reactions in iron-carbon alloys. These 
other reactions include: 
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(i) The formation of carbon-depleted, grain-boundary “allotriomorphic” a ferrite or, 
at higher carbon contents, iron carbide, Fe 3 C. The name allotriomorph comes from the 
non plate-like shapes of the grain-boundary products that form at high temperatures and 
low supersaturations. 

(ii) The precipitation of non-carbon partitioned ferrite, the “massive” reaction, 
§ 2.2.52, forming from very low carbon austenites. 

(iii) The growth of lamellar sheets of a plus carbide, the “pearlite” reaction, §2.5.1 
and 2.5.6. 

(iv) The growth of coherent “Widmanstatten” plates of either carbon-depleted ferrite 
or equivalently of iron carbide, § 2.2.6.2. 

(v) The formation of “lath” martensite (with a high dislocation density) and, at higher 
carbon concentrations, “plate” (internally twinned) martensite in rapidly quenched iron 
carbon alloys. (These displacive reactions are discussed in the next chapter). 

The first three reactions are agreed to be reconstructive. There is no surface relief 
reported during their growth. In addition, it is generally accepted that these grain- 
boundary nucleated structures grow into the austenite grain with which the product or 
products have no good-fit orientation relationship. That is, the interface has the same type 
of incoherent structure as is found for a high-angle grain boundary. The martensite 
reactions are agreed to be displacive and to occur with, characteristic surface relief, no 
partitioning of carbon at least during the initial growth of the martensite plates. The 
growth of martensite can occur at rates that approach the speed of sound in iron 

1 km/s). The growth of plates of carbon depleted Widmanstatten ferrite (iv) also has 
some aspects of an apparent displacive reaction, notably its surface relief effects, as 
described by Clark and Wayman [1970] and by Watson and McDougal [1973], and 
so the interface reaction in Widmanstatten ferrite is thus a central part of the bainite 
dispute. As previously discussed in § 2.2.6, the Widmanstatten plates result from growth 
into the parent austenite grain with which the product does have a good atomic fit 
orientation relationship. The habit plane of the precipitate is the interface plane that 
shows the good atomic fit. The growth rate or at least the lengthening of a 
Widmanstatten plate is known to be constant at a fixed undercooling and to occur at a 
rate matching that expected for carbon-diffusion control, § 2.2.6.2. 

Purdy and Hillert [1984] have argued for the close morphological similarity 
between the formation of Widmanstatten ferrite and bainite and between Widmanstatten 
ferrite and martensite by some simple and elegant metallographic experiments. Bainite 
plates in partially transformed Fe-C alloys were shown, by up-quenching, to continue 
their growth as Widmanstatten plates while maintaining the same morphology they had as 
bainite. Similarly, continued growth of Widmanstatten plates as martensite plates parallel 
to the original ferrite plates was shown by down-quenching partially transformed 
structures. 

A mark of the essential crystallographic basis of the growth of both Widmanstatten 
ferrite and of bainite is that no demonstration of either structure growing across an 
austenite grain boundary appears to have been reported. Ko and Cottrell [1952] 
initially noted that bainite plates did not cross austenite grain boundaries. The crystallo- 
graphic model for martensite growth, as for mechanical twinning, forbids the continued 



Ch. 15, §2 


Diffusive phase transformations 


1471 


growth of a twin or a martensite plate across a grain boundary into a crystal with which 
it does not have the required orientation relationship. Precipitates growing by random, 
thermally activated, atomic motion (the “civilian transformation”, Christian [1965]) can, 
and certainly do, grow across matrix grain boundaries, an important example being 
pearlite, as reported by Mehl [1947]. 

There are other precipitation reactions which are also described as “bainitic” by virtue 
of showing both diffusional changes in composition as well as surface relief effects. 
Examples include Cu-Zn (Garwood [1954], Cu-Al (Smith [1962]), Cu-Sn (de Bondt 
and Deruyterre [1967]), Ag-Cd (Kostic and Hawbolt [1975]) in which plates of fee 
(a) precipitation from bcc (b) all growing with significant supersaturation showed surface 
relief, as discussed by Clark and Wayman [1970], Liu and Aaronson [1970] reported 
surface relief effects for the Widmanstatten precipitation of Ag 2 Al plates from super- 
saturated Al. Other reactions, in which there is a change of chemical composition and 
surface relief effects during precipitation, include: Cr-Cu, (Hall et al [1975]), Ti-Cr, 
(Lee and Aaronson [1988]) and Cr-Ni (Cliff et al. [1990]). In all these reactions there 
is long-range diffusion of substitutional solute combined with the interface reaction that 
gives rise to the surface relief. Cliff et al. [1990] recently demonstrated that at the 
earliest stages of growth of these substitutional bainites that there was a measurable 
concentration difference between the two phases, so confirming that the reactions are not 
fully martensitic. Surface relief combined with long range diffusion of interstitial solute 
has been reported for oxide (Landuyt and Wayman [1968]), hydride (Bowles et al. 
[1977], Cassidy etal. [1977]), and nitride (Dahmen et al. [1987] precipitation as well as 
for the bainite and Widmanstatten ferrite reactions in Fe-C alloys. 

For the precipitation of Ag 2 Al, previously discussed in § 2.2.3 and 2.2.5.2, it is now 
agreed by the leading protagonists of the reconstructive model for bainite growth, 
Aaronson et al. [1990], that the Ag 2 Al reaction does indeed occur by a combined 
diffusive flux of solute to the precipitate and a displacive interface motion. In this case 
the displacive reaction takes place by the glide of Shockley partial dislocations on every 
second [111] plane, giving a purely displacive transformation of the fee matrix to the 
eph precipitate, Christian and Edmonds [1988/9]. These partial dislocations are the 
growth ledges discussed in §2.2.3 and, as previously mentioned, the system shows 
surface relief effects. It appears therefore that a combined reaction showing long range 
substitutional solute transport by diffusion with an interfacial reaction that is completely 
described by dislocation shear model can and does indeed occur. It remains to be settled 
however if such a description applies to all, or even some, of the other reactions forming 
plate-like products with a good matching habit plane. Only for the Ag 2 Al reaction does 
there appear to be full agreement on the structure of the interface. It might be noted that 
the martensitic reaction between fee and eph is unusually simple in that the so-called 
“Bain-Strain” plus rigid body rotation that transforms, with minimum strain, one crystal 
structure into the product is the required “Invariant Plane Strain” for minimum distortion 
needed for the “Phenomenological Theory of Martensite” (see the next chapter for details 
of this theory). Most other martensite reactions are ones in which some other lattice 
invariant shear, usually dislocation slip or twinning, is needed, in combination with the 
Bain-Strain plus rigid body rotation, to produce the required invariant plane strain. In 
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these other martensitic reactions the interfacial structures needed for glissile displacive 
interface migration are much more complicated than that between fee and cph. 

Ko and Cottrell [1952] first observed the characteristic surface relief of the 
formation of bainite and Srinivasan and Wayman [1968] showed that the form of the 
surface displacements was the invariant plane strain characteristic of a martensitic 
reaction. There is also clear evidence from Clark and Wayman [1970], Watson and 
McDougal [1973] and Bhadeshia [1981]) that Widmanstatten ferrite also has a 
displacive character, with an invariant plane strain, demonstrated by the surface relief of 
the ferrite plates. Frequently the surface relief is described as being “tent-shaped” as 
though there were two back to back plates shearing in opposite directions, though both 
shears still have an invariant plane strain form. Such plates, with the tent shaped relief, 
are commonly reported to be mono-crystalline, Hall and Aaronson [1994], and thus 
they have not formed by the growth of two differently oriented product crystals. That is, 
the tent shaped surface relief is caused by growth of a single crystal plate showing 
opposite shear as the two opposite plate surfaces grow apart. In ferrite, there is also clear 
evidence for a high dislocation density in the product plates (both bainitic and Wid- 
manstatten) just as is found in lath martensite formed from low carbon austenite. The 
dislocation density is found to be lower in ferrite plates formed at higher temperatures 
(see for example Bhadeshia and Christian [1990]). One explanation of the surface 
relief effects and the dislocations is that the bcc ferrite grows from fee austenite by a 
displacive mechanism. A reasonable explanation for the lower dislocation densities might 
be faster dislocation recovery at higher temperatures. Another view, see for example by 
Aaronson etal. [1990, 1994], is that the surface relief is caused by the glide of growth 
ledges with dislocation character along the ferrite/austenite interface. It is not clear, to the 
author of this review, what distinction there can be between a true displacive transfor- 
mation and one occurring by such glide of interface dislocations. If the structural 
transformation from one structure to the other can be accomplished solely by the glide 
of the growth ledges that are dislocations (as in the fcc-cph transformation) then the 
reaction is displacive in character even if accomplished by ledge migration, just as in the 
agreed case of Ag-Al. The question that appears to remain unsettled, however, is if the 
glide motion of the dislocation/growth ledges, on the bcc/fcc interface in Widmanstatten 
ferrite and in bainite, can fully accomplish the transformation in structure or if some 
additional, thermally activated, atomic motion, not present in the fully glissile martensite 
reaction, is required in Widmanstatten ferrite and in bainite. It is perhaps significant, as 
noted by Wayman [1994], that the current phenomenological theory of martensite, 
described in the following chapter, does not as yet fit two the types of martensite formed 
in Fe-C alloys, in low carbon, dislocated lath martensite and in, medium carbon, plate 
martensite with near {225} habit planes. Given this uncertainty about two closely related 
martensite reaction, it is perhaps not a critical failure of the displacive mechanism for 
bainite and Widmanstatten ferrite that a full description of the process exactly matching 
the phenomenological theory is not yet available. 

There has been, in the recent conferences on this topic, considerable analysis of the 
crystallography of the reactions involving plate precipitates showing surface relief. These 
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analyses include those of Aaronson et al. [1994], Aaronson and Hall [1994], 
Christian [1990 and 1994], Dahmen [1994], Hirth [1994], Howe [1994] and Muddle 
et al. [1994], It seems rather well established that the motion of growth ledges which 
have dislocation character in the fully or partially coherent precipitates can give rise to 
a lattice site correspondence even if the occupation of these sites may change, becoming 
more solute rich as solute rich phase grows into the low solute matrix. That is, there is 
be atomic site correspondence but not atom correspondence. In other words, as the 
solute, needed for the change in chemical composition between the two phases, reaches 
the growth ledges, then these move forward with site correspondence by glide of the 
Shockley partial dislocations, structurally the riser of the ledge, and this causes two 
[111] atomic layers to change from fee to eph in structure and in composition. Thermally 
activated atom exchange between the sites as they displace appears to be required to 
allow the thermodynamically required composition change to occur. For a Widmanstatten 
or bainitic ferrite plate to grow from an interstitial rich austenite the two coupled 
processes, on the displacive model, are the rapid diffusion away of interstitial atoms 
combined with the ferrite growth by lateral glide of the dislocation like growth steps — 
giving the observed shape change and a true atomic correspondence for the substitutional 
atoms. However, this view though widely is not fully accepted, see for example 
Aaronson etal. [1994], 

An interesting question that appears not to have been addressed in the debate on the 
displacive or reconstructive mechanism of bainite is the role of austenite grain size in 
helping or hindering the reaction. For most reconstructive reactions, a finer grain size 
promotes the reaction by providing heterogeneous nucleation sites, however for marten- 
site reactions a fine grain size is known to hinder the reaction, see for example Inokuti 
and Cantor [1982], A fine grain size is known to degrade hardenability but it is not 
clear if this is due solely to the known acceleration of the high temperature recon- 
structive transformations giving allotriomorphic ferrite and pearlite or if a fine grain size 
can also accelerate the formation of bainite. Experiments to answer this question would 
be of considerable interest. 

ii) Carbon supersaturation in growing bainite plates 

The second dispute concerns the carbon supersaturation in bainite. Hillert [1975, 
1994] has demonstrated, using the Trivedi analysis for the linear lengthening of both 
Widmanstatten ferrite plates and bainite sheaves that the lengthening reaction is rather 
well described by near carbon diffusion control local equilibrium. Figure 55 gives, as the 
solid line, the predicted kinetics based on a simplified model of carbon diffusion- 
controlled lengthening. The predictions match the experimental points rather well despite 
the uncertainties in much of the data used. However, the agreement is based on a rather 
low value of the interfacial energy — only 0.06 J/ m -2 . A higher value would lower the 
theoretical line and would certainly increase the small deviations seen for some alloys 
especially at the lowest transformation temperatures of 650 K. The result of carbon 
diffusion-controlled lengthening appears to be fully accepted for Widmanstatten ferrite, 
seen at the higher temperatures in fig. 54, but this result is not universally accepted for 
bainitic ferrite, for example by Bhadeshia and Christian [1990] or by Olson et al. 
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Fig. 55. Experimental lengthening rates of Widmanstatten and bainitic a-femte in iron-carbon alloys as a 
function of the carbo supersaturation at different temperatures. The line is the rate predicated for carbon 
diffusion control that is, local carbon equilibrium at the a/fi interface. D is the diffusion coefficient of carbon 
in austenite, v is the growth velocity, T the absolute temperature the carbon content of the alloy and the 
equilibrium carbon content of y in equilibrium with a. After Hillert [1994], 


[1989, 1990].* 

Hillert [1994, 1995] has pointed out that given the known growth rate of bainitic 
ferrite combined with the high mobility of carbon at the temperatures of bainitic growth 
it is difficult to understand how there could be a significant difference of carbon activity 
across a slowly moving ferrite austenite interface. Local equilibrium for carbon across 
the ferrite and austenite interface does not, of course, mean that both phases may not be 
supersaturated with carbon with respect to the precipitation of carbide. Bainite formation 
typically occurs in a temperature well below that which at pearlite forms and for pearlite 
formation both austenite and ferrite are supersaturated with carbon, so this will be 


* The bainite reaction in Fe-C alloy steels is known as one in which there appears to be no partitioning of the 
substitutional, slow-diffusing solutes, (Cr, Ni, Mo, Si, etc.), see for example Bhadeshia and Christian [1990] 
and Stark etal. [1990]. The lack of measurable partitioning of the substitutional solutes during the growth of 
bainite is generally accepted. 
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certainly true for bainitic ferrite. The precipitation of carbide either at the austenite ferrite 
interface (upper bainite) or partially within ferrite for lower bainite is an essential feature 
of most but not all bainite reactions.* Precipitation of carbides during bainite growth 
has the important effect of allowing bainite plates to grow in “sheaths” of numerous 
adjacent plates. Growth of multiple plates in contact is not possible for Widmanstatten 



— log v (ftm/s) 


Fig. 56. Experimental lengthening rates of Widmanstatten a ferrite in two iron-carbon alloys (from fig. 55) and 
the experimental growth velocity of pearlite. After Hillert [1994]. 


* Hillert [1994] has shown using models based on the experimental growth rates for pearlite and diffusion 
limited linear lengthening rates for Widmanstatten and bainitic iron, that in plain carbon steels at temperatures 
below about 550°C that Widmanstatten ferrite lengthens faster than pearlite grows (fig. 56). His explanation of 
this faster low temperature growth of ferrite is in terms of the asymmetry of the extrapolated austenite 
solubilities below the eutectoid temperature. This explanation, however, seems rather incomplete given the 
differences between the theoretical kinetics for pearlite growth, §2.5.1 (eq. 70), and that for linear lengthening 
of Widmanstatten ferrite, eq. 64 § 2.2.6.2. The pearlite grows with a high energy ferrite/austenite interface with 
no orientation relationship, while the Widmanstatten ferrite grows with a good fit, low energy, interface. The 
energy of the interfaces partially determines the required carbon diffusion distances in both reactions and this 
may be a dominating effect at temperatures below the nose of the TIT diagram where the activation energy of 
diffusion dominates the kinetics. An increased driving force would be the dominant effect above the nose in 
the curve since above the nose, an lowering of transformation temperature increases the driving force and 
accelerates the reaction. However, as noted by Hillert [1994], the asymmetry which makes austenite at lower 
temperatures more supersaturated with respect to ferrite than to cementite is an additional reason why ferrite 
plates can grow faster than pearlite at low temperatures. 


References: p. 1497. 




1476 


R. D. Doherty 


Ch. 15, §2 


ferrite since here the excess carbon rejected by one plate will, by decreasing the super- 
saturation of iron, fig. 20a, prevent growth of an adjacent plate in the carbon enriched 
austenite next to the first plate. Enomoto [1994] has, by computer simulation, modelled 
the growth of ferrite plates as two adjacent ledges of height equal to a ledge height and 
again shown that plates of bainitic and Widmanstatten ferrite lengthen at rates controlled 
by carbon diffusion control. His numerical results support Hillert’s calculations. 

In contrast to this picture of apparent diffusion-controlled growth, Olson et al. [1989, 
1990] have modelled the bainite transformation as a displacive reaction. Their rather 
elegant models were for the velocity of a martensite-like plate growing with partial 
supersaturation of carbon (the interface reaction) matching the growth limited by removal 
of the additional carbon by the standard Ivantsov analysis, eq. 63. The essential feature 
of Olson etal.'s model is that carbon diffusion, away from the lengthening plate, reduces 
the barrier for the subsequent martensitic growth. Low carbon austenite can grow 
martensite at a higher temperature than can higher carbon austenite. The theory for the 
martensite reaction mobility was obtained from an earlier dislocation model of marten- 
sitic growth by Grujicic etal. [1985]. In the 1989 paper, the usual arbitrary, maximum 
velocity, assumption was needed to provide the determining criterion for growth, 
§ 2.2.6.2. In the later paper, however, the need for this additional criterion was avoided 
by use of a model for the velocity-modified change of carbon partitioning developed by 
Aziz [1982] for rapid solidification. The results of these calculations are shown in 
fig. 57. Figure 57a gives the calculated residual supersaturation in the bainitic ferrite in 
Fe-0.4wt% C for both plate nucleation and growth. Fully supersatured nucleation leads 
to martensite so the M s temperature is predicted to be close to 400°C, compare with 
fig. 49 for an alloy steel with 0.4wt% C. At higher temperatures, carbon depletion was 
needed for nucleation but fully supersatured bainite growth was predicted at the tip of the 
growing plates. At even higher temperatures, there was significant residual super- 
saturation even for growth and this supersaturation approaches zero, local equilibrium for 
carbon, only at about 550°C. At this stage, however, growth of bainite ceases, on the 
Olson etal model, at the bainite start temperature. The calculated TTT curve for bainite 
with a B % temperature of about 550°C is shown in fig. 57b. The conflict between the two 
calculated TTT diagrams for plane carbon steel in figs. 56 and 57 nicely illustrates the 
alternative models for the transformation. The obvious experimental solution to this 
conflict, is the experimental determination of bainite growth rates in plain carbon steel 
at 500 to 600°C. Unfortunately this growth is hidden from experimental examination by 
the competing pearlite reaction. In alloy steels however, where the pearlite reaction has 
been slowed down, there is indeed the well established “bay” in the TTT diagrams, see 
fig. 49, so here comparison is possible. This bay would appear to provide solid evidence 
in support of the displacive mechanism for bainite growth as predicted by the model of 
Olson etal. [1990]. However competing mechanisms giving rise to the known inhibited 
decomposition of austenite in low alloy steels, below the allotriomorphic ferrite and 
pearlite reaction temperatures and above those of bainite is then the subject of the third 
dispute in this field. 

iii) Mechanism of the incomplete reaction in Fe-C bainites. 

In many alloy steels there appears to be a well established upper temperature for the 
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Fig. 57. (a) Plot of the calculated normalized supersaturation (afV.r 0 ) of the carbon in ferrite as a function of the 
transformation temperature for an iron-0.4wt% carbon alloy for the situations of initial nucleation and for linear 
growth on a model for displacive interface mobility, carbon diffusion and solute trapping, (b) Calculated 
isothermal time temperature transformation diagram for the iron-0.4wt% carbon alloy filled points. The open 
points are from the authors’ earlier maximum velocity analysis. After Olson etal. [1990]. 
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formation of bainite, the bainite start temperature, B s , and below this temperature 
although the reaction starts, it is commonly reported as ending with only a partial 
transformation , see fig. 49. At B s , the fraction untransformed tends to 100%. This was, 
for example, found experimentally by Bhadeshia and Edmonds [1979] who noted, in 
an alloy of Fe-0.4% C-3% Mn and 2% Si, that at 450°C, 5% of the structure trans- 
formed to bainite formed after only 100 s, but there was little further transformation of 
austenite until, after 32 days, the remaining austenite then decomposed to pearlite. This 
type of behaviour appears to arise rather naturally from the models of Olson et al. 
[1990] as seen in fig. 57b. However there is an alternative explanation offered by 
proponents of the reconstructive viewpoint. This explanation is that the metallic alloying 
elements added to Fe-C to improve “hardenability”, manage to inhibit bainite growth by 
a “solute drag” mechanism. This has been argued recently for example by Aaronson et 
al. [1990], Aaronson and Hall [1990], who cite the earlier references, and by Hillert 
[1994, 1995]. Solute drag is well established for its role in inhibiting grain boundary 
motion in recrystallization, where very large reductions in grain boundary mobility are 
found for additions of a few parts per million of substitutional impurities (ch. 28, 
§3.4.1). The drag is proposed to occur both directly, as in recrystallization, and in an 
additional way, in the decomposition of austenite, by changing the activity coefficient of 
the interstitial solute. However it is not immediately obvious how it can cause the 
apparent complete stasis of the bainite reaction while, at a higher temperatures the 
mobility inhibiting (hardenability) effects of the solute merely slow down but do not halt 
the growth of ferrite and pearlite. It is also not clear how solute drag can allow a certain 
fraction of bainite to form quickly but then to stop further bainite nucleation and growth. 
In addition, it might be noted that current models of solution hardening by mobile solute 
atoms, such as Mg in Al, see for example Schmidt and Miller [1982] suggest that 
solute drag effects are responsible for inhibiting the movement of dislocations and thus 
promoting solution hardening and in particular inhibiting dynamic recovery during 
working and thus promoting strain hardening. If solute can inhibit dislocation migration 
under the influence of mechanical stress, solute does seem capable of exerting equivalent 
solute drag on at least slowing moving displacive interfaces. Martensitic interfaces may 
move too quickly for any significant solute drag effects which might account for the lack 
of significant solute hardenability effects of the formation of lath martensite in steel, 
beyond the solute effects associated with the thermodynamic stabilization of austenite by, 
for example C, Ni and Mn. All current models of solute drag can slow interface 
movement but they have not been shown to halt the movement. 

Bainite is clearly still a topic in which there is no universally accepted qualitative 
model for the basic mechanism of its formation. For the reactions described as bainitic, 
the evidence of surface relief is, for most students of the subject, critical evidence for 
some type of displacive interface reaction. For the group of investigators associated with 
Aaronson, a model of the surface relief based on glide of interface dislocations associated 
with growth ledges is argued as being, in some way, different from a true displacive 
reaction. It is agreed, for at least some reactions showing surface relief, the substitutional 
bainites, Cu-Zn, Ag-Cu, Cr-Ni etc., Widmanstatten ferrite, high-temperature bainite and 
in particular Ag 2 Al from Al, that surface relief is combined with a composition change 
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so that there is at least some reactions that do have these two characteristics, and which 
are currently describable as “bainitic”. What is clearly not yet established is (i) if the 
dislocation glide causing the surface relief is the only atomic processes needed for the 
structural transformation and (ii) what is the level of solute supersaturation in many of 
the reactions, with the largest discrepancy for the lower bainite in Fe-C forming at 
temperatures close to the martensite start temperature. There is also disagreement on the 
origin of the incomplete bainite transformations at high temperatures. 

Unusually for a topic in metallic solid-state phase transformations, the problem is 
here a lack of agreed theory rather than a shortage of quantitative data. It might be noted 
that there appears to be no disagreement that there can be: 

(i) Fully reconstructive reactions with both a change of composition and a civilian 
change of structure, for example growth of grain boundary ferrite or pearlite growing 
behind interfaces with no lattice coherency. 

(ii) Fully martensitic reactions with no change of composition (i.e., partitionless) and 
with a military or displacive change of structure. Examples are the lath and plate 
martensite reactions in Fe-C and Fe-Ni alloys. 

(iii) The so-called “massive” reactions. These are the formation of a different 
structure or orientation by a reconstructive interface process with no change of com- 
position, that is they are as in (ii) partitionless. Examples of the latter include the 
massive reaction in iron and many other metallic systems, described by Massalski 
[1970], In addition of course are the recrystallization and grain growth involving grain 
boundary migration in non supersaturated alloys. 

There appears to be no reason why the final combination cannot be found. This is: 

(iv) A diffusional change of composition with a displacive lattice transformation. This 
is agreed to occur in the Widmanstatten precipitation of Ag 2 Al in AlAg. 

The question that remains to be settled is: Does bainite and or Widmanstatten ferrite 
growth actually occur in this manner? There is an related and rather interesting, 
unanswered, question: Do all plate-like products, growing coherently in a crystalline 
matrix, have a displacive interface mechanism? 

Although it is not fully accepted that bainite does form by a displacive lattice 
transformation, in the absence of any satisfactory alternative explanation of the surface 
relief seen in bainite the name “bainitic” for a displacive transformation occurring with 
some change of chemical composition seems currently satisfactory. It would therefore be 
appropriate to describe the precipitation of silver-enriched cph y from fee a in Al-Ag, 
where the interface reaction is established to be displacive, as a further example of a 
bainitic reaction. If in the future, a non displacive origin was ever established for the 
surface shear in the bainite reaction in steel, then a new name would be needed for the 
reaction identified in Al-Ag as occurring by a combination of a change of composition 
and a martensitic, fully displacive, interface process. There seems, however, no need to 
invent a new name, at least at present. That is, in the terms used by Aaronson et al. 
[1990, 1994], though clearly not with their approval, this reviewer has adopted the 
“surface relief’ definition of bainite for reactions involving a change of composition as 
well as surface relief. 
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3. Continuous transformations 

Although the vast majority of phase transformations in physical metallurgy are of 
Gibbs type I-involving nucleation and growth, Gibbs second mode of transformation, 
type II, now usually called continuous transformations, are important and interesting 
diffusive transformations. A continuous transformation can only occur if the initial and 
final structures share either a common crystal lattice or, as in liquids and glasses, a lack 
of a lattice. 

The initial example of a continuous transformation was spinodal decomposition, the 
development of a composition fluctuation on a scale larger than that of the interatomic 
spacing. This transforms a phase, a, with an initially uniform solute composition into two 
phases of the same lattice type, a' and a", one solute depleted, the other enriched. The 
reaction is solely a diffusion reaction involving atoms exchanging lattice sites. Although 
binary equilibrium diagrams indicate only a few potential examples of equilibrium spino- 
dal decomposition in metals, there are numerous metastable reactions that could occur by 
the continuous mechanism (see Cahn [1968] and Soffa and Laughlin [1983]). A 
second example is the process of continuous ordering, see Cook et al. [1969] and DE 
Fontaine [1975]. This consists of a composition modulation but with a wavelength of 
the interatomic spacing, transforming a disordered solid solution into an ordered one. 


3.1. Spinodal decomposition 

Two excellent early reviews of this subject were provided by Cahn [1968] and 
Hilliard [1970]; The reviews gave clear accounts of the basic theory and they reviewed 
the relevant early experimental observations. The following will therefore give an outline 
of the initial theory, following Cahn [1968], and will then try to bring out the important 
advances made subsequently. A later review of the topic was provided by Soffa and 
Laughlin [1983] who considered the not uncommon reaction in which an ordered 
coherent precipitate forms from a disordered solid solution. 

In a region of a single phase where the curvature of the free-energy composition plot, 
d 2 F/dC 2 , is negative (fig. 4) uphill diffusion can occur, as this lowers the free energy. In 
the literature on spinodal decomposition, the discussion is usually in terms of this second 
differential, while in the literature on diffusion, the discussion is usually based on the 
“Darken term”, 1 + d ln£/d In C a , derived by Darken [1948] in his analysis of diffusion, 
ch. 7, §5. 3. 1.2). If D a * and D B * are the tracer diffusion coefficients of the two compo- 
nents of atomic fractions C A and C B , then the interdiffusion coefficient, D, is given by: 

D = (C A ZV +C B Z> A *) 1 + (77) 

It is readily shown, using standard solution thermodynamics, e.g., by Martin and 
Doherty [1976], that: 


d 2 F 
d C 2 


= wl 


l 

1-C B 


+ 



1+dln f a 
dlnC„ 


(78) 
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From here onwards, the atomic fraction C will stand, as elsewhere in this chapter, for the 
atomic fraction of the second component, B. So C A = 1 - C B = 1 - C. 

From eqs. (77) and (78), we obtain: 

D = (1-C)d; + CD;^--^^ (79) 


D = M d 


d 2 F 
dC 2 


(79a) 


Equations (79) and (79a) define the diffusional mobility, M D * 

As a consequence of the sign of D being determined by the sign of d 2 F/dC 2 , a 
homogeneous solid solution with d 2 F/dC 2 <0, is unstable in the presence of an initial 
infinitesimal fluctuation (fig. 5). So any such sinusoidal perturbations of initially minimal 
amplitude can grow in amplitude. The crucial problem concerns the wavelength of the 
perturbations that can grow. To discuss this, the thermodynamics of inhomogeneous solid 
solutions are needed (Cahn and Hilliard [1958]). If there were no restrictions as to a 
minimum wavelength, then the growth of short wavelengths, requiring diffusion over the 
shortest distance, would grow fastest, leading to the microstructure with the shortest 
possible wave length, an interatomic spacing. Unfortunately this yields an ordered solid 
solution. However spinodal decompositions only occur in systems that wish to phase- 
separate: They need to maximize the number of like neighbour bonds. An ordered solid 
solution however minimizes the number of like neighbour bonds. Some additional 
thermodynamic feature is needed to avoid this failure of an over-simple model. 

For an homogeneous AB alloy in which we exchange a mole of A for a mole of B, 
the change of free energy, A F, is: 


A/r = Ma _ 

v m dC 


(82) 


Here p A is the chemical potential of A, p. A = {dF/d« A ) T p Ca Cb , where n A is the number 


* It has been recently pointed out, Martin etal. [1995] that there is a simple physical explanation for the need 
to use the interdiffusion coefficient, D, rather than D B , where D B = D B * (I +dlnjfe/dlnC 0 ), in the second law 
of diffusion, eq. 80. 

(6C7S/)„=i>(S 2 C'/8jr) 1 (80) 

C is the volume concentration of B atoms, (C = C/V, where V, is the volume per atom) so the use of D a might 
have been expected. In the presence of a Kirkendall effect, where D A * A D B *. and where as a result there is also 
a change of volume, the concentration of B atoms in a small region, (8C/8t),. is affected not only by the flux 
of B in and out of the region but also by the change of volume of the region. Rederiving the second law in the 
standard way but in the presence of a Kirkendall effect, yields either eq. 80, Darken [1948], or a modified 
second law, eq. 80a 

(SC /St), = D B (8 2 C/8xr\ + C(8v/Sx\ (80a) 

Here v is the velocity of interface planes, as measured by the velocity of inert Kirkendall markers. The velocity 
v is given by Darken [1948] as: 

v= VfD B -D B (dC /dr) = (D n -D B )(dC/ dr). (81) 
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of moles of A, /x B is the chemical potential of B and F is the free energy of unit volume 
of the homogeneous solution. In this simple treatment it is assumed that the atoms are of 
equal size; a correction to account for the change of lattice parameter with composition 
in a real system is introduced later. 

The diffusive flux, J, is then given by the usual equation, the first law of diffusion; 

J = -DVC (83) 

with D given by eq. (79), and C is C/V m , the concentration expressed in atoms of B per 
unit volume. 

From the thermodynamics of inhomogeneous solid solutions, a correction for 
inhomogeneity is introduced as: 


dF v 

dC 


dF v 2 KV 2 C 
dC V ra 


(84) 


A' is a positive (for systems that wish to un-mix) gradient-energy coefficient which is 
determined by the difference in number of like atomic neighbours between an atom in a 
homogeneous alloy and an atom in an alloy which has a variation in composition. Cahn 
[1968] pointed out that the first differential, VC, will have no effect on the free energy 
since with a linear gradient, VC, any excess of one component in one direction will be 
balanced by an equal depletion of that component in the opposite direction. The first 
term in the series to produce an effect is V 2 C. The value of K is given by; 

K = N'lcT'i/j 2 , (85) 


where T c is the critical temperature, below which homogeneous alloys will wish to unmix 
into 4-rich and fi-rich regions, and i p is the chemical “interaction distance” of atoms 
(Cahn [1968]). In regions where V 2 C>0 the effect of K is to lower the free energy of 
B atoms since they sense a higher density of like atoms than they would do in a 
homogeneous solid solution. Where V^C < 0 the effect of K is to raise the free energy of 
B atoms since they sense a lower density of like atoms than they would do in a 
homogeneous solid solution. 

Comparison of a solution with a sinusoidal variation of composition about an average 
composition C 0 with the same alloy with a uniform composition C 0 shows, fig. 5, that the 
microstructure with the composition modulation has a higher free energy with K> 0. This 
occurs since when C>C 0 V 2 C<0, and with C<C 0 , V 2 C>0, so there are more atoms 
having their free energy increased than there are atoms having their free energy decreas- 
ed. The total free energy increase, due to the gradient energy term, with K> 0, increases 
both with an increase of amplitude and with a decrease in wavelength. Both of these 
changes increase the magnitude of the composition curvature, V^C. This extra K term is 
the new thermodynamic parameter needed in the analysis of continuous transformations 
in general and spinodal decomposition in particular. Physically, it acts in a similar way 
as does the interfacial energy in nucleation and growth reactions. 

Substitution of eqs. (79a) and (84) into the diffusion equation, eq. (83) gives: 
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J = 


-M d d 2 F 
dC 2 VC' 


+ 2 M d KV 3 C'. 


( 86 ) 


The change of composition with time, dC/dt, is obtained in the normal way for the 
derivation of the second law of diffusion (e.g., Shewmon [1963]), as: 


d C _ V m d C 
dt dr 


M D d 2 F 
dC 2 V 2 C 


2M d KV*C. 


(87) 


Cahn [1961] showed that this differential equation has the following solution. 
C = C 0 + exp (/?(/?) t) cos(jSr). const., 


( 88 ) 


with C 0 = initial composition;/? = wave number, 27r/A w ;A w = wavelength of the particular 
fluctuation. The “amplification” factor R(fi) is given by: 


*03) = -A/ D 0 ‘ 


r d 1 F 
d C 2 


+ 2 K0‘ 


(89) 


Since the diffusional mobility, M D , is inherently positive, it can be seen, from the term 
in brackets in eq. (89), that in a system showing unmixing, when K is positive, short- 
wavelength fluctuations will decay since K(3 2 >-d 2 F/dC 2 for large values of /3, but 
fluctuations below a critical wave number, /?*, that is with wavelengths longer than a 
critical wavelength, A*, can grow: 


/?* = 


' d 2 F 1 
dC 2 2 K; 


1/2 


(90) 


The corresponding critical wavelength, A*, is 27 t//3*. 

The fastest growing wavelength, A max , is determined from eqs. (88) and (89) as the 
wavelength having the maximum amplification factor, R( )0). This occurs at /3 *l\j2 , 
owing to the double effect of an increasing wave number in (i) reducing the diffusion 
distance, the term outside the bracket in eq. (89), as well as its effect in reducing the 
driving force, the terms inside the brackets. This is a similar result to that previously seen 
in other areas of phase transformation by the balance between interfacial energy requiring 
large spacings and rapid diffusion requiring short ones (§ 2.2.6 and § 2.5). In the present 
case, all wavelengths will be found to be present, but with initially very small ampli- 
tudes. These arbitrary amplitudes provide the undefined “const” in eq. (88). This result 
comes from the mathematical idea that any perturbation can be represented as a sum of 
sine or cosine waves. In addition, at very small amplitudes, all wavelengths are able to 
grow independently, so the fastest-growing wavelength will be present and will be able 
to grow faster than all other wavelengths and so must dominate the decomposition at 
least initially when the theory, given above, applies. 

An important addition to the theory comes from considerations of elastic strains that 
arise when there is a change of lattice parameter with a change of alloy content. With r) 
defined as the unit strain per unit composition difference, r] =(da/dC)/a = (dln a/dC), Y 
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is Young’s Modulus and v p is Poisson’s ratio, eq. (89) is changed by the elastic strain to: 




(89a) 


The strain term, in eq. (84a), acts, in addition to the gradient-energy term, to inhibit the 
reaction. It is often more convenient, however, to consider the second term together with 
the first term, as defining the coherent spinodal region. This is a region, in the phase 
diagram, in which the sum of the first two terms in the parentheses of eq. (89a) is 
negative. Only within this coherent spinodal region can fluctuations develop while the 
crystal remains fully coherent, but elastically distorted, as the solute-rich and solute-poor 
regions, with different values of the unstrained lattice parameter, a, interacting with each 
other within an elastically distorted lattice. 

Rundman and Hilliard [1967] tested the model for spinodal decomposition by 
small-angle X-ray scattering experiments with Al-Zn alloys. Al-Zn has a phase diagram 
very similar to that shown in fig. 3. Their results, fig. 58, show the behaviour expected 
for spinodal decomposition. The alloy, Al-22at% Zn, had been quenched from the single- 
phase, all fee, region and annealed at 65°C for the times indicated. The critical wave 
number, /?*, where there is no change of scattered intensity with longer aging time is 
seen, as the“cross-over” point. Also seen is the maximum rate of increase of intensity at 
a wave number about 0.7 of /3*. The interpretation of small-angle diffraction is discussed 
in ch. 12, § 5.1. Other experimental demonstration of the importance of the gradient term 
in solid solutions with short wavelength composition fluctuations were diffusion 
experiments in stable solid solutions, made inhomogeneous at a very short wavelengths 
by repeated depositions of alloys of different compositions, see in particular Cook and 
Hilliard [1969] and Philofsky and Hilliard [1969], Other early experimental studies 



Fig. 58. Small-angle X-ray spectra for an Al-22 at% Zn alloy quenched from 425°C and annealed at 65°C for 
the times indicated (after Rundman and Hilliard [1967]). 
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that supported the linear spinodal model described by Cahn [1968] were reviewed by 
Hilliard [1970], 

Following the initial development of Cahn’s theory of spinodal decomposition, there 
have been various modifications to the theory to try to deal with later stages in the 
reaction when the linear model of independent growth of all fluctuations is no longer 
valid. A further important modification was consideration of the influence of random 
thermal fluctuations, “Brownian Motion”, on the process. Random thermal fluctuations 
will give local increases of energy paid for by the increase of entropy that comes from 
the “disorder” introduced by the small groups of atoms having the increased energy. In 
nucleation theory this process gives the vital eq. (11) for the probability of creating a 
nucleus with a local increase of energy A/*. Cook [1970] introduced this idea of random 
solute movements into a treatment of spinodal decomposition and he showed that the 
effects seen in experiments on small-angle scattering could be fitted to the changes 
proposed by his model. The effect of the thermal fluctuations is essentially that for alloys 
close to the spinodal boundary the distinction between spinodal decomposition and the 
nucleation and growth of coherent “zones” becomes much less clear-cut than in the 
original model of spinodal decomposition. 

Langer [1975] discussed these ideas in more detail and showed that the initial model 
of spinodal decomposition, as described above, predicts that at the spinodal point, when 
d 2 F/dC 2 = 0, the critical wavelength becomes infinite and the only mechanism for 
transformation will then be by the nucleation of solute-rich “Guinier-Preston zones”. 
However, nucleation of such zones will occur frequently with very small critical radii 
and with the very low-energy, “diffuse” interface expected in these circumstances*. In 
other words, nucleation will result in a structure very much like that of spinodal 
decomposition. The same picture is given in recent models of spinodal decomposition 
modifed by thermal fluctuations. Figure 59a shows the structure factor S computed by 
Langer etal. [1975] as a function of q, a temperature-modified wave number and t, a 
modified annealing time, for an alloy held at the spinodal boundary. It can be seen that 
a spinodal-like fluctuation builds up even with d 2 F/dC 2 = 0, but the maximum intensity 
shifts to larger wavelengths with increased reaction times. 

The other modification to the theory deals with the later time development of the 
fluctuations. The results, also computed by Langer et al. [1975] for the expected 
development of the structure factor for an alloy at the centre of the spinodal region are 
shown in fig. 59b; q = 1 corresponds to the critical wave number (3*. The value q p is 
f}*/\f2 , the fastest growing fluctuation in the linear theory. It can be seen that for short 
times, this is where the maximum growth of the intensity actually appears. However, at 
longer times, the peak in the structure factor, the wavelength with the strongest intensity 


* Simple application of the “quasi-chemical bond” model to a coherent interface between solute-rich and solute- 
poor zones shows that the interfacial free energy, rr, is reduced at relatively high temperatures by making the 
interface extend over several atomic layers at some increase in energy but with a compensating increase in 
entropy Cahn and Hilliard [1958]. Atom-probe FIM has shown, however, that at temperatures much lower 
than the critical temperature, 7), where the solid solution becomes unstable, the coherent interface (in Cu-1.9 
at% Ti) was atomically sharp (von Alvensleben [1982]). 
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Fig. 59. Computed structure factor as a function of modified wavenumber for increasing times, r, for (a) an 
alloy at the edge of the spinodal region and (b) an alloy at the centre of the spinodal region. (After Langer et 
al. [1975].) 

moves to smaller wave numbers, larger wavelengths. In addition, the peak intensity does 
not continue to grow exponentially with time. These changes appear to indicate the 
movement towards the LSW-type coarsening, with larger wavelengths growing at the 
expense of shorter wavelengths. Within spinodal decomposition, a process that is 
equivalent to interfacial energy-induced coarsening appears to be occurring during the 
decomposition and is not restricted to the last stages of the reaction when the solute-rich 
and solute-poor regions have discrete interfaces and are close to the equilibrium form of 
a' and a". Coarsening is expected with spinodal decomposition even before any well 
defined interfaces have formed. It arises directly from the gradient energy term, K. As 
described above, the increase of energy due to the K term in eq. 84, for a composition 
fluctuation of a given amplitude increases as the curvature rises due to shorter wave- 
lengths and higher composition amplitudes. So competition between wavelengths will 
favour the development of longer wavelengths at the expense of shorter ones, with the 
same composition amplitude. This is a true coarsening process. 

Experimental data on the later stages of spinodal decomposition are shown in fig. 60 
for Al-22at% Zn (Hilliard [1970]). The data here are for a higher temperature, 150°C, 
than in fig. 57 which was obtained at 65°C, so the diffusional unmixing is much more 
advanced than in the earlier study. It is seen in fig. 59 that the peak of intensity is shifted 
to much smaller wave numbers as the reaction proceeds; this is exactly the effect pro- 
duced in the computer model of Langer et al. [1975]. Assuming an activation energy for 
diffusion of Zn in Al of 120 kJ/g-mole gives the ratio of diffusion coefficients at the two 
temperatures, D( 1 50°C)/£>(65°C), as 5000. The much higher intensities in fig. 51 than 
in fig. 49 can be seen, confirming that the reaction is, indeed, much further advanced. 

Tsakalakos [1977] and Tsakalakos and Hilliard [1980] provided some analytical 
insight into the later stages of spinodal decomposition, when the compositional amplitude 
of the fluctuation is no longer small, but begins to approach the difference in compos- 
ition A C a . a ~ between the solute-rich and solute-poor regions of the two phase systems 
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Fig. 60. Experimentally observed small-angle X-ray spectra for an Al-22 at% Zn alloy annealed at 150°C for 
the times indicated (after RundmaN etal. [1970]). 

(fig. 3). The difficulty is readily seen from the free-energy-composition curve (fig. 4), 
since, as the composition fluctuation reaches the spinodal points, d 2 F/dC 2 =0, the driving 
force for further unmixing by spinodal decomposition then vanishes. It should be noted 
that at the spinode points where, d 2 F/dC 2 = 0, solute transfer will occur easily across the 
“interface” since the chemical potential of the solute, the intercept of the tangent to the 
free energy curve is higher at the low composition spinode point than at the high- 
composition spinode point. Ditchek and Schwartz [1980] discussed the theory and 
extend it beyond the single wavelength considered by Tzakalakos, to a range of 
wavelengths. For a single wavelength the amplitude grows until it reaches a critical wave 
form which has a tanh (fir) function (Cahn and Hilliard [1958, 1959]). 

The system can then continue to lower its free energy by perturbing the wavenumber 
to smaller values; this will reduce the gradient energy which is opposing further 
decomposition. Ditchek and Schwartz’s extension, to consideration of a range of 
wavelengths, allows the reaction to continue by the growth in amplitude of waves with 
smaller wavenumbers. In the application of the analysis to experimental results in 
spinodally decomposing alloys, several fitting parameters are required. These take into 
account the initial composition waves produced during the quench after the solution 
treatment in the single-phase region. This is a weakness in their test of the theory, but 
nevertheless the comparison of the experimental results and the theory, shown as the 
points and the solid lines, respectively, in fig. 61, is rather satisfactory. The results were 
obtained for a Cu-10.8at% Ni-3.2at% Sn alloy, solution-treated at 800°C, quenched to 
room temperature and then aged at 350°C for various times, before examination of the 
composition fluctuation; e =0.015 corresponds to the metastable equilibrium value of the 
composition difference of about 2.4at% Sn between the two phases; the modulation is 
almost entirely in the tin content. Figure 61 shows satisfactory agreement with the theory 
and, in addition, various features of the theory and the results can be seen. These features 
include the initial growth of amplitude at a fixed wavelength, A w = 5 nm (50 A); the early 
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Fig. 61. Experimental results (points) and theoretical predictions (solid lines) for the amplitude, e, and dominant 
wavelength, A, for the spinodal decomposition of Cu-Ni-Sn after slow and fast quenches (after Ditcher: and 
Schwartz [1980]). 


departure from exponential growth of the amplitude, which occurs before the onset of the 
increase in the dominant wavelength. Finally it can be seen that the faster quench has 
given an initially smaller modulation in composition than does the slower quench. In the 
Cu-Ni-Sn alloys the tin-rich regions show an ordering reaction to give a D0 22 super- 
lattice at later stages of the decomposition. 

In a high-resolution study using the field-ion microscopy/atom-probe technique, 
Biehl and Wagner [1982] were able to study the details of spinodal decomposition in 
an alloy of Cu-2.7at% Ti. They reported a steady rise in the titanium content of the Ti- 
rich regions as the reaction progressed at 350°C indicating a spinodal process. This rise 
in solute content, saturated at 20at% Ti, the composition of the ordered phase, Cu 4 Ti. 
The dominant wavelength grew with time both during, and after, the time that the 
modulation was increasing in magnitude, with the dominant wavelength of the modulat- 
ion increasing with reaction time to the j power. The diameter of individual titanium-rich 
clusters also grew, in this case with time to the } power, the time exponent expected for 
LSW coarsening both during and after the change in solute content. 

It is striking that in the study of decomposition of Ni-14at% A1 by Wendt [1981], 
previously discussed in § 2.2.6, it was clear that the reaction in the nickel alloy was much 
better described as a nucleation-and-growth reaction, rather than as a spinodal reaction. 
This was seen since the solute-rich regions, when first detected, had the equilibrium 
composition of Ni 3 Al. Wendt had used the same techniques, FIM/AP, as had Biehl 
[1980]. It seems surprising that an alloy whose composition, 14at% Al, is quite close to 
the composition of the final precipitate, should show a clear nucleation behaviour, while 
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the Cu-2.7at% Ti alloy that was much further from the composition of the ordered 
phase, Cu 4 Ti, should show all the indications of spinodal decomposition. The difference 
may be due to there being a very different form of the free-energy-composition curve in 
Cu-Ti at 350°C than in Ni-Al at 550°C (their homologous temperatures, T/T m , are 0.46 
and 0.48, respectively); this is discussed further in §3.2. von Alvensleben [1982] also 
found by FIM/AP that a Cu-1.9at% Ti alloy decomposes by nucleation and growth. So 
the spinodal point appears to be be between 1.9 and 2.7at% Ti at 350°C. Subsequent 
reanalysis by von Alvensleben and Wagner [1984] showed, however, that while 
Cu-1.9at% Ti certainly decomposed by a nucleation, growth and coarsening mechanism, 
§2.1.5., but it was less certain that decomposition of the 2.7% Ti alloy was by spinodal 
decomposition. Soffa etal. [1984] described FIM/AP studies of spinodal decomposition 
in Fe-Cu at times where the metastable copper rich regions remained bcc and in all bcc 
Fe-Cr alloys. 

The classic electron microscopy studies of spinodal decomposition were provided by 
Butler and Thomas [1970] and by Livak and Thomas [1971] using ternary Cu-Ni-Fe 
alloys. In the first investigation, a symmetrical composition (51.5at% Cu, 33.5at% Ni, 
15at% Fe) near the centre of the spinodal region, was investigated. In the second study, 
an asymmetrical composition (32at% Cu, 45.5at% Ni, 15at% Fe) nearer to the edge of 
the spinodal region was used. In both studies, Curie-temperature measurements, which 
are very sensitive to the composition of the Fe-Ni rich phase, supplemented the 
microscopy. In the “as-quenched” alloys no sign of decomposition could be seen but, 
since the scattering factors of the components are similar in the Cu-Ni-Fe alloys, any 
initial perturbations will be very difficult to detect, unlike the case in the Cu-Ni-Sn alloy 
used by Ditchek and Schwartz [1980], Butler and Thomas showed that the waves 
developed along <100>, which are the elastically softest directions. They also found that 
the two-phase structure was initially composed of rod-like particles with diffuse 
interfaces, but the “precipitates” developed planar interfaces with extended coarsening. 
At long times of ageing, the interfaces lost their coherency by the usual formation of a 
cross grid of edge dislocations. 

Butler and Thomas [1970] reported that the wavelength of the modulation 
coarsened, with a time to the power y dependency, both during the later stages of the 
reaction and also earlier while the Curie temperature was varying. The change in the 
Curie temperature indicated a variation in composition of the copper-depleted regions, 
that is, spinodal decomposition was still occurring. For alloys aged at 625°C, these 
simultaneous changes can be clearly seen, fig. 62a, and these results confirm that the 
coarsening reaction is occurring during the initial decomposition as well as after the 
decomposition. In their study on the asymmetrical Cu-Ni-Fe alloys, Livak and Thomas 
[1971] gave similar results, but there were some important distinctions. One of these 
distinctions was the slower development of the composition fluctuations; the change of 
Curie temperature continued for 100 h at 625°C for the asymmetrical alloy (fig. 62b) as 
distinct from only 1-5 h (fig. 62a) for the symmetrical alloy. This distinction is to be 
expected since the value of d 2 FVdC 2 is smaller in alloys away from the symmetry point. 
In the asymmetrical alloy there was, in addition, no sign of growth in the dominant 
wavelength for nearly 10 h at 625°C (fig. 61b) so that coarsening appears also to develop 
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(a) (b) 

Fig. 62. Variation of the Curie temperature and the dominant wavelength with time on ageing at 625°C, for two 
Cu-Ni-Fe alloys: (a) the symmetrical alloy (after Butler and Thomas [1970]); (b) the symmetrical alloy with 
32 at% Cu (after Livak and Thomas [1971]). 


later in the asymmetrical alloy than in the symmetrical alloy. 

In an important extension of the application of TEM to spinodal decomposition, 
Sinclair etal. [1976] and Wu et al. [1978] used high resolution lattice imaging tech- 
niques to demonstrate the change in lattice parameter on a very fine scale, produced in 
spinodally decomposing alloys. Figure 63 shows the measured variation in lattice spacing 
in a Cu-29at% Ni, 3at% Cr alloy aged at 700°C for 10 min. The apparent wavelength of 
the fluctuation, 4.8 ±0.8 nm, was very close to that obtained from electron diffraction, 
5 ± 0.5 nm, in the same sample. In this high resolution study, it was clearly shown that, 
as would be expected, the interface was diffuse at the early stages of the reaction and 
became much sharper at the later stages; the lattice was then no longer continuous. 

3.2. Continuous ordering 

In the discussion of nucleation-and-growth reactions in § 2, no specific discussion was 
offered for the precipitation of ordered phase from a disordered matrix, since there seems 
no real distinction between growth of an ordered phase from a disordered matrix and the 
nucleation and growth of any other phase, in a Gibbs type-I transformation. For 
continuous reactions, however, continuous ordering is distinctly different from spinodal 
decomposition and will therefore be briefly considered here as separate reaction. Much 
of what follows is derived from the original paper by Cook etal. [1969] and the detailed 
review by Soffa and Laughlin [1983]; reference should also be made to reviews by de 
Fontaine [1975, 1981] and by Cook [1976] for more detailed accounts of the topic. 

It has already been mentioned that in Cu-Ti and in Cu-Ni-Sn the end-result of 
spinodal decomposition can be the formation of an ordered structure in the solute rich 
regions of the microstructure. That is, spinodal decomposition of the solid solution is the 
first continuous reaction and then order later develops as the composition increases 
towards the ideal value for a particular long-range-ordered structure. However, con- 
tinuous ordering can occur without change of composition if the alloy is close to the 
correct composition. This was analyzed by Cook et al. [1969]. 
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Fig. 63. The variation in fringe spacing from lattice images of a spinodally decomposed Cu-Ni-Cr alloy, aged 
at 700°C for 10 min (after Wu et al. [1978]). 


The spinodal decomposition model discussed in §3.1 was based on a continuum 
analysis (Cahn [1961, 1962]). However, Hillert [1956, 1961] had previously con- 
sidered a discrete lattice model for decomposition, albeit in only one dimension. Hillert’s 
approach was extended by Cook, de Fontaine and Hilliard [1969] to a three- 
dimensional, discrete lattice model. Their model considered not only spinodal decom- 
position, the unmixing reaction with a negative d 2 F/dC 2 and a positive gradient energy 
parameter K, as discussed previously, but also the opposite case, a strongly positive 
d 2 F/dC 2 , and a negative K, a situation expected for a system that can go, without a 
change in composition, from a disordered high-temperature structure to an ordered low- 
temperature structure, as the temperature falls below a critical temperature T c . 

The results of the discrete lattice calculation are very similar to the continuum model 
in the spinodal case. Figure 64 shows the amplification factor, here called a, as a 
function of a/k , where a is the simple cubic lattice parameter, for three situations. Figure 
64a is for a spinodal decomposition, that is, with negative d 2 F/dC 2 and positive K. The 
critical wavelength, A c , and the fastest-growing wavelength, A m , are shown. Figure 64b 
shows the amplification factor for a continuous ordering reaction, with positive d 2 F/dC 2 
and negative K, where the maximum amplification occurs at a/A = 1, that is, the maxi- 
mum and minimum in composition occur on adjacent lattice sites, giving an ordered 
structure. Figure 64c shows the situation with a negative amplification factor at all values 
of a/k, which will occur above the critical instability temperature 7’ inslab . This temperature 
is predicted by this treatment to occur when: 


d 2 F _ -32 K 
dC 2 ~ a 2 


(91) 


In a real situation, in any one region of the lattice, perturbations of different “phase” will 
conflict with each other, but eventually one perturbation will dominate the rest and 
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Fig. 64. The amplification factor, a , for the discrete lattice model of continuous reactions: (a) for spinodal 
decomposition; (b) for ordering below the instability temperature; (c) above the instability temperature, a is the 
lattice parameter of the simple cubic structure. (After Cook etal. [1969].) 


establish a particular domain of order in that region. The order in each domain will then 
steadily increase. Different regions will be dominated by waves with differing phases of 
modulation, giving the multiple domain structure of the ordered alloy. (For a discussion 
on domain structure see ch. 4, §4.2) Cook etal. [1969] extended the analysis to bcc and 
fee lattices. As in the original continuum model of spinodal decomposition, the analysis 
of Cook etal. [1969] is restricted to small perturbations where d 2 F/dC 2 and K remain 
constant, so again only the early stages of the reaction are described by this theory. 
Extensions to the initial discussion on continuous ordering have been provided by de 
Fontaine [1975, 1981] and by Cook [1976], who also reviewed some of the early 
experimental investigations on continuous ordering. 

Soffa and Laughlin [1983] reviewed the considerable amount of published work on 
continuous ordering reactions but they concentrated mainly on the more commonly 
studied case of decomposition of metastable solid solutions in which composition 
fluctuations occur on a large scale, spinodal decomposition, with ordering developing in 
the solute-rich regions. They discuss only a few studies of continuous ordering occurring 
without any change of compositions. However, the two-stage reactions of the spinodal 
decomposition to give an ordered precipitate discussed by Soffa and Laughlin are 
probably more important technologically. 

Chen and Cohen [1979] investigated a process of continuous ordering without 
changes of composition, by studies on the intensities of scattered X-rays at superlattice 
positions in various alloys at temperatures above the ordering temperatures. For Cu 3 Au, 
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apart from some additional, heterophase, fluctuations just above the critical temperature, 
the data behave in the expected way for this type of theory (Cook [1976]). Extrapolation 
indicated an instability temperature, below which continuous ordering is expected. 
Between T c and T inslab , ordering is expected to occur but by a nucleation and growth 
mechanism. Similar results were reported by Chen and Cohen for a higher copper alloy, 
for Co-Pt and for Fe-Al. Continuous ordering has also been demonstrated in /3 -brass 
(Kubo etal. [1980]) and in Ni-Mo alloys as reviewed by de Fontaine [1975, 1981], It 
appears from the present data that continuous ordering can occur in the manner proposed 
and that to date the proposed model appears to be reasonably successful. 

Figure 65, from Soffa and Laughlin [1983], illustrates various possible free energy- 
composition curves that can give ordered second phases by continuous reactions, 
involving a change of composition. In fig. 65a, the disordered curve is unstable against 
ordering at all compositions. An alloy of composition C 0 can lower its free energy by 
ordering, whereupon the ordered phase can then spinodally decompose. 5, and S 2 are the 
spinodal points for the ordered structure. This type of process is described as a condi- 
tional spinodal (Allen and Cahn [1976]). An alloy of lower solute content, C 0 \ that 
falls outside the spinodal region of the ordered structure will after ordering not then 
decompose spinodally but will precipitate the ordered phase via a nucleation and growth 
process. In fig. 65b, however, the disordered curve also shows an instability so that 
compositions y 0 can initially decompose in the normal spinodal manner, with the solute- 
rich cluster becoming ordered in the normal course of the reaction when the composition 
of the solute-rich regions has reached a value where the ordered phase has a lower free 
energy than the disordered phase. The distinction between a C 0 and a y 0 alloy could be 
the difference between the systems, Ni-Al and Cu-Ti, discussed in § 3.1. It was reported 
there, that experiments had shown that in the nickel alloy the ordered phase was 
precipitated with the correct composition and order right from the start of the decom- 
position while in the copper alloy there was steady increase in the titanium content of the 
titanium-rich regions for C 0 >1.9at% Ti. The schematic diagrams of fig. 64 appear to 
describe, qualitatively, the processes that can take place in various important alloys that 
precipitate ordered phases either by nucleation and growth or by spinodal decomposition. 
Laughlin etal. [1984] discussed the decomposition of more concentrated Cu-Ti alloys 




Fig. 65. Free-energy-composition curves of alloys that show various types of spinodal decomposition and 
continuous ordering, see the text (after Soffa and Laughlin [1983]). 
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(5at% Ti) than were also studied by von Alvensleben and Wagner [1984] and also 
decomposition in Fe-Be. They showed that both of these alloy systems first underwent 
homogeneous ordering before subsequently spinodally decomposing, eventually forming, 
in the high solute regions, the ordered phase, Cu 4 Ti in Cu-Ti. 

4. Application of phase transformation theory to specific alloy systems 

The application of phase transformation theory to specific systems is a huge topic and 
can only be treated in the briefest outline here. The main alloy system is the one based 
on iron which takes advantage of the allotropic transformation between the high 
temperature fee (austenite) phase and the low temperature bcc (ferrite) phase since fifty 
times as much steel is used compared to te second most important alloy system, that 
based on aluminium. However the growth rate of aluminium use is faster than that of 
steel (Finniston [1978]). Chapter 17 of this volume deals specifically with the physical 
metallurgy of steels and gives the most appropriate discussion of the topic, and further 
references. For completeness however, two additional references to this topic can be 
given: Pickering [1978] and Honeycombe [1981]. 

For aluminum, an excellent review of the various alloy reactions is provided by 
Polmear [1981] whose book also includes a discussion of the phase transformations in 
magnesium and titanium alloys. A recent discussion of the physical metallurgy, proces- 
sing and properties of aluminum alloys is the reviewed edited by Vasudevan and 
Doherty [1989]. The review by Mondolfo [1976] provides exceptional detail on the 
equilibrium diagrams and microstructures of the aluminum alloys in use at that time. 
Lorimer [1978] reviewed the physical metallurgy of aluminum precipitation reactions. 
The same volume on precipitation processes, edited by Russell and Aaronson [1978], 
includes accounts of precipitation in nickel alloys by Merrick [1978], in titanium alloys 
by Williams [1978] as well as Edmonds and Honeycombe [1978] on iron. For the 
nickel alloys, Sims and Hagel [1972] and Betteridge and Heslop [1974] provide 
detailed additional information and for titanium alloys there are also the conference 
reports on titanium published by Pergamon Press [1971] and Plenum Press [1973]. The 
recent series of International Conferences on Aluminum Alloys listed, in further reading, 
give a wide range of recent industrial applications of diffusive phase transformations, 
often coupled with thermomechanical processing. 

As discussed in § 1 for metals such as iron and titanium, the change of matrix crystal 
structure offers tremendous scope for many heat-treatments while in the alloys with a 
single matrix crystal structure, aluminium, copper, nickel, magnesium etc., only 
precipitation is readily available as a means of modifying the microstructures. However, 
although this is not discussed in this chapter, various additional microstructural modify- 
ing processes are available and are of increasing importance; these include rapid 
quenching from the liquid (ch. 19) which gives a much wider range of starting 
microstructure for subsequent treatment to work on than is provided by the conventional 
ingot casting route; the production of composite structures by powder metallurgy (ch. 31) 
and also the wide control of microstructure made possible by thermomechanical 
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treatments in which phase transformations are combined with deformation. This has been 
widely exploited for grain-size control in low-carbon alloys of iron and for ausformed 
iron, see ch. 17, but is now of increasing interest for modification of microstructure by 
precipitation during deformation. Hornbogen and KOster [1978] have shown something 
of the interesting results obtained by cold deformation, followed by precipitation, of 
supersaturated solid solutions. The next step is the combination of warm deformation 
with precipitation, an area only just being scientifically explored in alloys other than iron. 
The process occurs every day of course during current warm-rolling of alloys that are 
subsequently precipitation strengthened. An important review of the field was provided 
in the book edited by Morris [1979], on thermomechanical processing of aluminium 
alloys. There are also excellent reviews by Ferton [1981] on thermomechanical 
processing of aluminium alloys and by Luetjering and Peters [1981] on thermo- 
mechanical processing of titanium alloys. Recent conference reports on aluminum alloys 
and nickel-based superalloys update the situation with respect to all aspects of the 
physical metallurgy of these industrial alloys. 

5. Problems in phase transformations 

A very interesting and unusual approach in materials science and technology was the 
challenge offered by Maddin [1976] in the 25th volume of the journal. Materials Science 
and Engineering to 45 scientists to write a brief account of what was not known in the 45 
topics of major interest to the authors. Many of the topics were from the area of phase 
transformations. 40 of those invited accepted the challenge and produced an important 
series of articles. However, most of the reviews were conventional accounts of what was 
established and sadly did not really address the challenge of discussing what was not 
known or not understood. From the previous pages in this chapter, some of the important 
questions raised but not satisfactorily answered can be listed and there are no doubt more 
not described in this chapter because of the authors insufficient knowledge. Questions and 
uncertainties that seem most important, or at least interesting, include the following: 

(1) More experimental data to test all the theories of the transformation but in 
particular quantitative data are particularly needed on the rates of heterogeneous 
nucleation on both grain boundaries and dislocations, to answer the questions raised by 
coherency of the interface and the higher-level heterogeneities in grain boundaries. 

(2) Extension of atomic modeling of nucleation such as the cluster model described 
by Haasen and Wagner [1992]. 

(3) What are the mechanisms of nucleation and the nucleation kinetics of the growth 
ledges on coherent interfaces between phases of different structures, both during 
precipitation and during coarsening? 

(4) The rationalization of the role and significance of surface relief in transformations 
showing a change of composition, particularly the formation of Widmanstatten ferrite and 
the bainite reaction in steels, the extent of the departure for local equilibrium in the 
solute contents in these bainitic products and the origin of the incomplete formation of 
bainite in some alloy steels. 


References: p. 1497. 
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(5) Do all, or only some, Widmanstatten plates grow by a displacive interface 
mechanism, giving surface relief? 

(6) What determines if precipitates that only have a good fit direction with the matrix 
(e.g., bcc/fcc) grow as needles or as plates? 

(7) Understanding the determining principle of the lamellar spacing in all types of 
discontinuous reactions, and the tip radius in linear lengthening of Widmanstatten plates 
and needles. 

(8) The problem of shape stability for non-spherical precipitates. 

(9) The problem of understanding the effect, if any, of the volume fraction of second- 
phase particles on the rate of Ostwald ripening, especially in the Ni-Ni 3 X systems. 

(10) What are the conditions in which discontinuous precipitation at grain boundaries 
occurs in preference to nucleation and growth without the interference of a moving grain 
boundary? 

(11) To what extent does the coherency stress, known to be the controlling mechan- 
ism of diffusion-induced grain-boundary migration, play in the growth kinetics of all 
discontinuous (moving two-phase boundary) reactions? 

(12) How does diffusion-induced recrystallization manage to produce the high angle 
grain boundary between the new grain and the old grain into which it grows? 

(13) The need for a great deal more data on free energies, interfacial energies and the 
role of impurities in modifying interfacial energies, needed for accurate prediction of 
phase-transformation kinetics and thus of resultant microstructures. 

(14) The problem of the complications to all aspects of phase transformations 
produced by external variables such as elastic stress, plastic deformation, radiation 
(Hudson [1978]) and temperature gradients (McLean [1979]) and the effects of these 
variables both on microstructures and on the phase transformations that produce these 
microstructures. 

It would appear from this list that despite the great successes achieved in the past 50 
years in understanding phase transformations in metals, much remains unknown and 
therefore fascinating in this field. 
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1. Overview 

This chapter deals with phase changes where there are no long range movements of 
atoms in the sense of, say, their diffusional flow down a concentration gradient according 
to Pick’s Law. Indeed, in most cases to be considered the atoms move less than an inter- 
atomic distance and retain the relationship with their neighbors during the phase change. 
The prototype of this behavior is a martensitic transformation, which is equivalent to the 
deformation of the parent crystal lattice into that of the product (martensite). Because of 
their nature, martensitic transformations are frequently referred to as “displacive”, “shear- 
like” or “diffusionless”. A vast amount of geometrical or crystallographic information on 
martensitic transformations has accrued over the years, and we shall see that all 
martensitic transformations have certain common crystallographic characteristics. These 
characteristics help distinguish martensitic reactions and have led to a successful 
phenomenological theory of martensite crystallography, which will be discussed in this 
chapter, along with a consideration of the thermodynamics and kinetics of the transfor- 
mation. As will be seen, there are other phase changes, which although not commonly 
recognized as martensitic (e.g., bainite formation), are described well by the martensite 
crystallography theory. Finally, some other types of diffusionless, displacive solid-state 
transformations such as omega phase and charge-density wave formation are considered, 
both of which involve lattice distortions with correlated atomic displacements. 

2. Martensitic transformations 

2.1. Introduction and general characteristics 

The name martensite is after the German scientist Martens. It was used originally to 
describe the hard microconstituent found in quenched steels. Many materials other than 
steel are now known to exhibit the same type of solid-state phase transformation, known 
as a martensitic transformation frequently also called a shear or displacive transfor- 
mation. Martensite occurs in, for example, nonferrous alloys, pure metals, ceramics, 
minerals, inorganic compounds, solidified gases and polymers (table 1). 

It has long been recognized that martensitic transformations are diffusionless; this is 
why martensite can grow at temperatures below 100 K where the mobility of atoms is 
negligible. It is not implied that all martensitic transformations occur at low temperatures; 
indeed, many of them occur at comparatively high temperatures (table 1), but these too 
are diffusionless. Martensite can also grow at speeds in excess of 1000 m s' 1 , speeds 
which are inconsistent with the diffusion of atoms. In most cases, the amount of 
martensite that is obtained depends on the transformation temperature rather than the 
time at that temperature, in which case the transformation is called athermal. As pointed 
out by Christian [1970], the overall kinetics of a martensitic reaction depend on both 
the nucleation and growth stages and will largely be dominated by the slower of the two. 
For example, slow thermal nucleation may give rise to isothermal transformation 
characteristics. As will be seen later, the martensitic transformations do not obey classical 
nucleation theory. 
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Table 1 

The temperature M s at which martensite first forms on cooling, and the 
approximate Vicker’s hardness of the resulting martensite for a number of 
materials. 


Composition 

M/K 

Hardness HV 

ZrOj 

1200 

1000 

Fe-31Ni-0.23C wt.% 

83 

300 

Fe-34Ni-0.22C wt.% 

<4 

250 

Fe-3Mn-2Si-0.4C wt.% 

493 

600 

Cu-15A1 

253 

200 

Ar-40N 2 

30 



The interface between the martensite and its parent has to have a structure which is 
consistent with the absence of diffusion and with the rapid rate of growth. A glissile 
interface like this does not require significant thermal activation in order to move, and 
must be coherent or semi-coherent, depending upon the crystallography of the particular 
material undergoing transformation. In most cases, e.g., the formation of martensite in 
ferrous alloys, the interface is semi-coherent and the parent and product lattices are 
coherently accommodated only over local regions of the boundary. The accumulating 
misfit is then relieved periodically by auxiliary deformation processes in such a way that 
the interface remains glissile. On the other hand, e.g., the fee -» hep transformation in Co 
and its alloys, the martensite-parent interface is fully coherent, and the deformation it 
causes when it moves (normal to itself) is sufficient to accomplish the required lattice 
change. In contrast, the macroscopic deformation caused by the movement of a semi- 
coherent martensite interface is related in a more complex way to the strain which 
converts the parent to the product lattice. More discussion of interfaces will follow later, 
but for now it is emphasized that both semi-coherent and coherent martensite-parent 
interfaces must be glissile. 

Martensitic transformations are most readily distinguished from other solid-state phase 
changes on the basis of their crystallographic characteristics, which imply a “military” or 
coordinated transfer of atoms from the parent to the product phase. In a “civilian” 
transformation the atoms move across the phase boundary in an uncoordinated manner 
by diffusion. They feature a coordinated structural change involving a lattice correspon- 
dence and an ideally planar parent-product interface which during movement (trans- 
formation) produces an invariant-plane strain shape deformation. A fine-scale in- 
homogeneity in the martensite, such as slip, twinning or faulting, is usually observed at 
the electron microscope scale. This secondary deformation, an intrinsic part of the 
transformation process, produces the invariant-plane condition at the macroscopic scale 
and provides a semi-coherent glissile interface between the martensite and the parent 
phase. Crystallographic features between the martensite and the parent phase such as the 
habit (invariant) plane and orientation relationship are usually not expressible in terms of 
exact relations involving integral Miller indices. The various crystallographic features of 
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martensitic transformations will be discussed and illustrated with representative experi- 
mental examples. After this, a brief account of the development of the phenomenological 
crystallographic theory will be presented, followed by an algebraic analysis. 

2.2. Experimental observations of crystallographic features 

Figure 1 is an optical micrograph of a polycrystalline Fe-24.5Pt at.% alloy taken after 
transformation of the parent phase into martensite by cooling. The specimen was polished 
flat prior to transformation to martensite. The observed contrast results purely from 
deformation caused by the formation of martensite. The light and dark shades correspond 
to regions of martensite which have undergone distortion in different senses with respect 
to the initial surface. This surface tilting or macroscopic distortion is known as the shape 
deformation or shape strain. It can be demonstrated by serial sectioning that the 
martensite has a three-dimensional shape which is that of lenticular plates. An X-ray 
analysis would reveal that the plates showing different optical contrast also feature 
different lattice orientations with respect to the initial parent grain. X-ray analysis would 
also show that a structural change had occurred, which for this Fe-Pt alloy is an 
fee— » bcc transformation*. The differently oriented plates are different crystallographic 
variants of the habit plane and orientation relationship. 

Referring again to fig. 1, if straight scratches were purposely abraded on a flat 
specimen still in its parent phase, then the formation of martensite would displace the 



Fig. 1 . Optical micrograph showing surface relief due to a martensitic transformation in an Fe-24.5% Pt alloy. 
The specimen was polished to a flat condition at room temperature and then cooled to produce the martensite. 


* The terms fee, bcc, bet and hep are abbreviations for face-centered cubic, body-centered cubic, 
body-centered tetragonal and hexagonal close-packed, respectively. 
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scratches in a characteristic manner. After analyzing several nonparallel scratches 
crossing a given plate and noting the initial position of the same scratches in the 
untransformed parent phase, a distortion matrix which describes the shape deformation 
can be derived, which shows that a deformation which is homogeneous on an optical 
scale has taken place. This shape deformation resembles a simple shear, but in general 
is better described as an invariant-plane strain (fig. 2) in which one plane is left 
undistorted and unrotated; this invariant plane is also known as the habit plane. 

An invariant-plane strain is a homogeneous distortion such that the displacement of 
any point is in a common direction. The magnitude of the displacement is proportional 
to the distance from a fixed plane of reference, which is the invariant-plane (itself un- 
affected by the strain). In most martensitic transformations a volume change accompanies 
the structural change which produces a normal component S to the invariant-plane strain. 

It is well established that the shape deformation caused by the growth of martensite 
is an invariant-plane strain. Nevertheless, it is also true that for many martensitic 
transformations, an invariant-plane strain cannot convert the crystal structure of the 



r ? 5 

/ general 
/ invariant 
/ plane strain 


Fig. 2. (a) An invariant-plane strain characteristic of martensitic transformation. The initially straight line DE 
is displaced to the position DF when the martensite plate with habit plane ABC is formed. The plane ABC is 
invariant (unrotated and undistorted) as a result of the formation of martensite, (b) An illustration of the 
difference between a simple shear and a general invariant-plane strain, s is the shear component of the strain 
and S the dilatational component of strain. The dilatational strain defines the volume change of transformation. 
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parent into that of the product phase. In iron alloys the martensitic transformation 
converts an fee parent into a bee (or bet) product, but the measured shape deformation 
matrix when applied to the parent fee will not produce a bee structure. This apparent 
inconsistency will be explained later. 

Martensite plates in a given alloy usually possess a unique habit plane, as shown for 
example in fig. 3. The stereographic projection shows the experimentally determined 
habit plane poles for different martensite plates, which, as can be seen, cluster near the 
plane in the parent phase with Miller indices (3 10 15)y> (subscripts P and M, respectively, 
are used to designate the parent and martensite). 

Early work suggested that, apart from a small relative rotation of corresponding unit 
cells in the parent and product phases, a homogeneous, lattice distortion would account 
for the known structural change in martensitic transformations. As early as 1924, BAIN 
suggested that the austenite (parent phase) — » martensite transformation in steels could be 
explained by a homogeneous “upsetting” of the parent fee lattice into the required bcc 
(or bet) lattice, as shown schematically in fig. 4. There are in fact many ways (corres- 
pondences) to generate a bcc product from an fee parent by means of a homogeneous 
distortion, and another possible correspondence is shown in fig. 5, but analysis shows 
that the Bain deformation involves the smallest principal strains and should therefore be 
favoured. The lattice correspondence, a unique relationship between any lattice point in 
the initial lattice and the point it becomes in the final lattice, implied by the Bain 
distortion has also been verified experimentally using an ordered Fe 3 Pt alloy which undergoes 
nominally a fee to bcc transformation. By observing corresponding superlattice reflections in 
the parent and martensitic phases by means of transmission electron diffraction it was 
deduced that the Bain correspondence actually applies (Tadaki and Shimizu [1970]). 

From the lattice correspondence shown in fig. 4 one would expect for example, 
[001]* II [001],, [0 1 0]* || [1 1 0]„ (1 1 2)* || (1 0 1)„ (0 1 1)* || (1 1 IV, etc. However, 
such exact parallelisms are not observed. For the Fe-Pt alloy shown in fig. 1 the 
following orientation relationship was observed: 


(in) 



Fig. 3. Unit triangle of stereographic projection showing experimental habit planes (open circles) for martensite 
formed in an Fe-7Al-1.5C wt.% alloy. The average habit plane cannot be expressed in terms of simple Miller 
indices, and is near the plane {3 10 15) P of the parent phase. 
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Fig. 4. Lattice distortion and correspondence proposed by Bain [1924] for the fcc-bcc (bet) martensitic 
transformation in iron alloys. The correspondence related cell in the parent phase (indicated by bold lines) 
becomes a unit cell in the martensite as a consequence of a homogeneous “upsetting” with respect to the z' 
axis. 


[001] P - [001] M : 9.10° apart, 

[1 0 1] P - [IT 1] M : 4.42° apart, 

(1 1 1) P - (0 1 \) M : 0.86° apart, 

which is typical of the orientation relationships found in iron alloys and steels. Further 
consideration of the above orientation relationship shows that the correspondence cell is 
not only distorted according to the Bain strain, but also is rotated (about 10° from [0 0 1] P 
towards [1 1 0] P ). This is termed a rigid body rotation. The combined distortion-rotation 



Fig. 5. Alternative lattice correspondence for the fcc-bcc (bet) martensitic transformation which involves larger 
principal distortions than the Bain correspondence. 
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is known as the lattice deformation. Additional examination of the above orientation 
relationship shows that the close-packed planes (1 1 1) P and (0 1 l) w are nearly parallel to 
each other and the close-packed directions [1 0 1] P and [1 1 l] w are a few degrees apart. 
This is an example of the Greninger-Troiano orientation relationship. When the close- 
packed planes and directions in two coexisting structures are almost parallel, the 
Kurdjumov-Sachs orientation relationship is obtained. 

In summary, the Bain strain correctly transforms the crystal structure of the austenite 
into that of martensite, and when combined with an appropriate rigid body rotation leads 
to the correct orientation relationship. In fact, the combined lattice deformation (Bain + 
rotation) is an invariant-line strain which leaves a single line unrotated and undistorted. 
This line lies in the interface between the austenite and martensite, and permits the 
interfacial structure to be semi-coherent and glissile. Hence, two crystals can only 
transform into one another by a martensitic mechanism if they can be related by a lattice 
deformation which is at the very least an invariant-line strain. 

2.3. The phenomenological crystallographic theory of martensitic transformations 

We have seen that the Bain strain (B) can transform the parent into the product 
crystal structure with minimal atomic displacements. In itself the Bain strain alters every 
vector, but it can in combination with a rigid body rotation (R) be converted into a 
lattice deformation which is an invariant-line strain (RB) which also yields the experi- 
mentally observed orientation relationship. However, this invariant-line strain is incon- 
sistent with the experimentally established invariant-plane strain shape deformation (Pj). 

The reason for this is as follows. Referring to fig. 4 again, it is noted that the 
correspondence cell is contracted (-20%) along the f axis and expanded (-12%) along 
the x and y axes. Such a homogeneous distortion will leave no plane invariant (i.e., 
undistorted and unrotated). But suppose that the distortion along y vanished. One of the 
principal distortions would then be less than unity (along z ), one greater than unity 
(along x') and the remaining one (along y') exactly unity. These conditions imply the 
distortion of an initial sphere (parent phase) into a triaxial ellipsoid (martensite) 
following which the sphere and ellipsoid can fit together along an undistorted plane of 
contact (habit plane) as shown in fig. 6. But this special set of conditions is not generally 
found in practice; the principal distortions are determined by the lattice correspondence 
and observed lattice parameters of the two phases. 

The apparent inconsistency described above is resolved by envisioning a lattice- 
invariant deformation P involving no structural change, such as slip or twinning to occur 
in conjunction with the lattice deformation RB. The additional deformation must be 
lattice-invariant because the necessary structural change is effected by the Bain distortion 
alone. The role of the additional deformation is essentially to shear (distort) the ellipsoid 
resulting from the Bain distortion into tangency with the initial sphere; then one of the 
principal distortions (OX in fig. 6) becomes unity and an undistorted contact plane exists. 
This additional deformation (slip, twinning, or faulting) being a shear is known as the 
inhomogeneous shear or complementary shear of the crystallographic theory. 

These are the three phenomenological steps B, R, P describing the total transform- 
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Fig. 6. The Bain distortion has a mathematical analog wherein an initial unit sphere is distorted into an 
ellipsoid. If one of the principal distortions is unity and the other two are of opposite sign, then the sphere and 
ellipsoid can fit together along the undistorted planes AOX or BOX. See text for discussion. 


ation crystallography. There is no time sequence implied as to which step occurs when. 
Of course the combined effect of these three operations must be equivalent to the shape 
deformation. 

Working with the theory just described, different crystallographic features such as the 
habit plane and orientation relationship can be predicted by supposing the inhomogeneous 
shear to occur on different crystallographic planes and directions. For example, in most 
iron alloys an inhomogeneous shear on the (1 1 2)— [I I l] w twinning system will predict 
a habit plane near (3 15 10),,, but assuming the inhomogeneous shear system to be 
(0 1 1)-[1 1 1] M predicts a habit plane near (1 1 1) P . Since the lattice parameters and 
correspondence are usually known, the Bain distortion for a given transformation is 
specified, and the flexibility in the theory comes through different suppositions concern- 
ing the plane and direction of the inhomogeneous shear. Once the inhomogeneous shear 
system is assumed, a shear of a certain magnitude will produce an undistorted plane, 
which when rigidly rotated to its original position becomes the invariant habit plane. 

The previous discussion and examples cited above have centered around iron alloys. 
Because of the importance of steels there has been substantial work on them and thus an 
abundance of experimental data exists. However, the principles presented are quite 
general and apply to all martensitic transformations. 

2.3.1. Summary of crystallographic theory 

The Bain strain converts the structure of the parent phase into that of the product 
phase. When combined with an appropriate rigid body rotation, the net homogeneous lat- 
tice deformation RB is an invariant-line strain (step a to c in fig. 7). However, the obser- 
ved shape deformation is an invariant-plane strain P, (step a to b in fig. 7), but this gives 
the wrong crystal structure. If, however, a second homogeneous shear P 2 is combined 
with Pj (step b to c), then the correct structure is obtained but the wrong shape since 
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INVARIANT-LINE 
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(b) 
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LATTICE 
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DEFORMATION 




Slipped 

Martensite 


Correct macroscopic shape, correct structure 
Fig. 7. Schematic illustration of the phenomenological theory of martensite crystallography. 


p,p 2 = RB 

These discrepancies are all resolved if the shape changing effect of P 2 is cancelled 
macroscopically by an inhomogeneous lattice-invariant deformation, which may be slip 
or twinning as illustrated in fig. 7. 

The theory illustrated elegantly explains all the observed features of the martensite 
crystallography. It is easy to predict the orientation relationship, by deducing the Bain 
strain and adding a rigid body rotation which makes the net lattice deformation an 
invariant-line strain. The habit plane does not have rational indices because the amount 
of lattice-invariant deformation needed to recover the correct the macroscopic shape is 
not usually rational. The theory predicts a substructure in plates of martensite (either 
twins or slip steps) as is observed experimentally. The transformation goes to all the 
trouble of ensuring that the shape deformation is macroscopically an invariant-plane 
strain because this reduces the strain energy when compared with the case where the 
shape deformation might be an invariant-line strain. 

We now proceed to discuss the details of this theory. 
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2.3.2. The inhomogeneous shear and martensite substructure 

The inhomogeneous shear was introduced in the crystallographic description of 
martensite transformations to ensure that the habit plane is macroscopically undistorted. 
Figure 8 is a schematic representation of the appearance of internally twinned and 
internally slipped martensite plates. Although there are localized distortions at the 
interface, the “saw tooth” effect because of alternating twins (called transformation twins) 
or slip lamellae prevents accumulation of any strain at the interface over large distances. 

For internally twinned martensite the Bain distortion is envisioned to occur along 
different contraction axes in the two twin-related regions, and the twinning plane in the 
martensite is derived from a mirror plane in the parent. In the case of internally slipped 
martensite the Bain distortion is the same in all regions of a plate. Figure 9 shows that 
the same effective shear angle y can be accomplished by slip or twinning. Note that the 
relative thickness of the two twin components determines the angle y. 

Because of the inhomogeneous shear mentioned above, one would expect to observe 
some kind of substructure in the martensite. Such observations have been made since the 
introduction of the theory. Figure 10 is a transmission electron micrograph of a marten- 
site plate in an Fe-Ni-C alloy. The fine striations crossing the plate are transformation 
twins ({1 1 2} m twinning plane). Regions adjacent to the martensite are retained austenite. 
If the twin plane is indexed specifically as (1 1 2) M , the habit plane trace becomes 
specifically (3 15 10),,. If (1 1 2)— [1 1 1] M twinning is used in the habit plane calculations, 
the predicted habit plane is in fact (3 15 10),,. Thus the particular variant of the twin 
plane is found to be consistent with the particular variant of the habit plane, and for the 
Fe-Ni-C alloy the experimental observations are in excellent agreement with those 
features which are predicted using the phenomenological crystallographic theory. This is 
also the case for many other martensitic transformations which have been studied in 
some detail using transmission electron microscopy and diffraction. Notable exceptions, 



Fig. 8. Schematic representations of the inhomogeneous shear in a martensitic transformation involving (a) 
internally twinned, and (b) internally slipped plates of martensite. The serrated effect at the interface (habit 
plane) in each case prevents the long-range accumulation of strain and consequently the interface remains 
macroscopically undistorted. 
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Fig. 9. As shown for twinning (a) and slip (b) the same magnitude of the inhomogeneous shear, as given by 
the angle y, can be accomplished by either. 


however, are certain steels which transform to martensite with a {22 5},, habit plane. The 
martensite in these materials has a complex substructure, as seen in the electron micro- 
scope, and is not very well explained using the theory described above. In addition, there 
is no adequate crystallographic description of the lath martensites found in ferrous alloys. 

2.3.3. Mathematical description of the phenomenological crystallographic theory 

The basic equation of the crystallography theory just described is: 



Fig. 10. Transmission electron micrograph showing a martensite plate and adjacent retained austensite in an 
Fe-30% Ni-0.4% C alloy. The striations within the plate running from left to right are transformation twins. 
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P, = RPB (1) 

where B represents the Bain distortion, P is a simple shear (following in a mathematical 
sense the Bain distortion), R is the rigid body rotation previously mentioned, and P, is 
the invariant-plane strain shape deformation. B, P, R and P t can all be represented as 
(3 x 3) deformation matrices. A vector, on the other hand, can be represented by a 1 x 3 
single-row matrix or a 3 x 1 single-column matrix. The product of a deformation matrix with 
a vector gives a new vector which may in general be of a different length and direction. 

The matrix product RPB is equivalent to the shape deformation P, and the rotation 
matrix R rotates the plane left undistorted by PB to its original position. That is, P ( = 
RPB is an invariant-plane strain. 

Although eq. 1 indicates that the inhomogeneous shear P follows the Bain distortion, 
the same end result can be obtained by “allowing” the shear to occur in the parent phase 
prior to the Bain distortion. This alternative procedure simplifies computations. In this 
case the basic equation becomes: 

P, = RBP (2) 


where P as before is a simple shear. 

The matrix representation of the invariant-plane strain shape deformation is: 

P = I + mdp' 



o 

o 



= 

0 1 0 

+ m\d l d 2 

d 3 \Px Pi p4) 


o 

o 




fl + md l p l md\Pi 

md l p 3 

= 

mdiPi 

1 + md 2 p 2 md 2 p 3 


^ md J p l 

md 3 p 2 

1 + md 3 p 3 ) 


where p' (prime meaning transpose) being a plane normal is written as a (1 x 3) single- 
row matrix, in contrast to d, which is a vector written as a (3 x 1 ) single-column matrix. 

The Bain distortion illustrated in fig. 4, for the fee -» bcc (or bet) transformation can 
be represented as: 


B = 


'42 a/a 0 
0 


0 

42a/a 0 

0 


0 


(4) 


For typical values of the lattice parameters of the fee lattice, a, and of the bet lattice, a 0 
and c, the matrix B becomes 
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B = 


f 1.12 
0 


0 0 N 
1.12 0 
0 0 . 8 , 


(5) 


It is apparent from eq. 5 that two of the principal distortions are greater than unity (i.e., 
1.12) and the third is less than unity (i.e., 0.8). Therefore, the Bain distortion cannot 
leave a plane invariant; as stated earlier, for a strain to be an invariant-plane strain, one 
of the principal distortions must be zero, one greater than unity and the other less than 
unity. 

Referring back to eq. 2, it is noted that P is a simple shear, and therefore can be 
represented as (I + mdp'). Also, the inverse of P, P _1 = (I-mdp') is a simple shear of the 
same magnitude, on the same plane, but in the opposite direction. That is, both P and P~* 
are invariant-plane strains. It is now convenient to rewrite eq. 2 as: 

P,P 2 = RB, (6) 

where P 2 = P” 1 . Since both P, and P 2 are invariant-plane strains, their product RB is an 
invariant -line strain S. The invariant-line is defined as the line of intersection of the 
planes which are invariant to P, and P 2 . If the invariant-line strain S is known, all the 
crystallographic features of a given martensitic transformation can be predicted. It is not 
within the scope of the present account to go into the details of the invariant-line strain 
analysis (Bowles and Mackenzie [1954]), but certain highlights should be mentioned. 
The Bain correspondence and distortion B are known from the lattice parameters of the 
parent and martensite phases. R is determined once the plane p 2 ' and the direction d 2 of 
P 2 are assumed. It should be noted that RB = S constitutes the lattice deformation of the 
transformation. 

The important results of the invariant-line strain analysis, noting that the shape strain 
is P, =I + m 1 d l Pi' and that the simple shear (mathematically preceding the Bain distort- 
ion) is P 2 = I + m 2 d 2 p 2 ' are as follows (where the magnitudes, directions and planes of the 
component invariant-plane strains are given respectively by m, d and p'): 

= [Sy 2 - y 2 ]/pjy 2 » (7) 


Pi = (<li -q^j/q'S^d,, (8) 

where y 2 is any vector lying in p 2 (except the invariant line x) and q 2 is any normal 
(other than n', the row eigenvector of S -1 , i.e., n'S M =n') to a plane containing d 2 . The 
normalization factor for d, in eq. 7 is 1/m, and therefore P„ m„ d, and p{ are all 
determinable. The matrix R is determined from the requirement that x and iT which are 
displaced by the Bain distortion must be totally invariant. R defines the orientation 
relationship within any small region of the martensite plate not involving P 2 . Thus, the 
assumed correspondence and lattice parameters determine B, the assumption of p 2 and d 2 
allows R to be determined, and RB = S defines the elements of P,. 

The above description of the crystallographic theory parallels the analysis given by 
Bowles and Mackenzie [1954] but the treatments of Wechsler et al. [1953] and 
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Bullogh and Bilby [1956] are equivalent. 

Some variations in the basic theory just presented include the introduction of a 
dilatation parameter, which relaxes slightly the requirement for the habit plane to be 
undistorted, and the incorporation of two inhomogeneous shear systems such that 

P, = RBS 2 S, (9) 

where S 2 and S, are the two inhomogeneous shear systems involved. Neither of these 
modified approaches is without criticism, and it has been argued that the double shear 
approach loses generality. 

2.3.4. Some other crystallographic observations 

As pointed out earlier in this discussion, the martensite habit plane is near [3 10 15},, 
for an Fe-24.5Pt at.% alloy. A similar habit plane is also found for martensite formed in 
Fe-Ni alloys containing approximately 30 wt.% Ni. However, in some steels, e.g., 
Fe-8Cr-l.lC wt.%, the habit plane is near {225},,, and the habit plane of lath marten- 
sites typical of low -carbon steels and dilute iron alloys is near [1 1 1 },,. However, the 
martensite-parent orientation relationship is almost the same in the [3 1015], {225} and 
{111} transformations. 

It is unclear why different steels transform to martensite with a different habit plane, 
but it is generally thought that there are different operative modes of the inhomogeneous 
shear, i.e., different planes and directions, or even the operation of two or more modes 
within a given martensite plate or lath as suggested by the multiple shear theories. Fig- 
ure 1 1 shows some experimental habit plane determinations for a variety of iron alloys. 

Techniques are improving for measuring the shape strain associated with martensitic 
transformations and the information so obtained is leading to a clearer understanding of 



Fig. 11. Unit triangle of a stereographic projection showing the results of habit plane measurements on 
martensite formed in five different Fe-alloys. 
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the transformation crystallography. In addition, the use of transmission electron micro- 
scopy has greatly enhanced the understanding of the crystallography of martensitic 
transformations. 

23.5. Further observations on the martensite morphology and substructure 

In general, martensite in ferrous alloys takes the form of lenticular plates as seen in 
fig. 1 for an Fe-Pt alloy. This morphology is readily understood because a lenticular 
plate, as a mechanical twin, is a low strain energy shape in a “shear” transformation. 
However, other morphologies are not uncommonly found in ferrous alloys, and these are 
briefly mentioned here. 

In low-carbon steels (up to 0.4 wt.% C) and in some other ferrous alloys the 
martensite takes the form of laths rather than plates. A typical lath has dimensions 0.3 x 
4 x 200 /j,m. As mentioned above, the habit plane of such laths is near { 1 1 1 } P . In 
Fe-Ni-C alloys, three distinct morphologies are found and are classified as the lenticular, 
thin plate, and butterfly morphologies. In one well-studied case it has been shown that 
butterfly martensite (fig. 16, below) forms at the highest temperatures, lenticular 
martensite forms at intermediate temperatures, and thin-plate martensite forms at the 
lowest temperatures. In some materials, particularly ferrous alloys, a needle-like 
morphology known as surface martensite is formed under some conditions. Apparently 
this morphology is easily nucleated at surfaces and can develop into a “needle” form 
because matrix constraints are relaxed near a free surface. 

Whereas the plate types of ferrous martensite frequently form in a self-accom- 
modating manner involving several habit plane variants, lath martensites do not appear 
to do so. Instead, the laths tend to cluster together into packets, each adjacent lath having 
the same habit plane, orientation relationship, and shape deformation. The intrinsic lath 
morphology (as opposed to plate morphology) is thought to be related to the larger shape 
strain associated with lath martensite. Unlike the plate morphology which typically con- 
tains a high density of transformation twins as shown in fig. 10, martensite laths are typi- 
cally untwinned and contain a high density of internal dislocations as shown in fig. 12. 
Since the inhomogeneous shear in ferrous lath martensite is not twinning, it is expected 
that slip is the mode involved and that the lattice-invariant deformation would be effected 
by the movement of interface dislocations. An example of such interface dislocations in 
an Fe-Ni-Mn martensite lath is shown in fig. 13. The high density of dislocations 
usually observed within lath martensites may not be directly related to the transformation 
crystallography. It may result from the constraints a lath experiences during growth, i.e., 
the internal dislocations probably arise from accommodation distortion. 

Another martensite morphology found in certain steels, notably austenitic stainless 
steels and Fe-Mn-C alloys, is the banded morphology (fig. 18, below). However in these 
alloys, the transformation is not fee— » bcc, but fee— » hep instead. This transformation is 
accomplished by distorting the fee parent phase on { 1 1 1 },, planes by the passage of 
partial dislocations which generate the hep martensite. As would be expected, the bands 
delineate { 1 1 1 } P planes. This type of fee — » hep martensite transformation is similar to 
that which occurs in cobalt and cobalt alloys, and, as mentioned earlier, involves a fully 
coherent interface and no lattice-invariant deformation in the usual sense. 
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Fig. 12. Transmission electron micrograph showing dislocations (rather than twins) in a martensite lath formed 
in an Fe-20% Ni-5% Mn alloy. 



Fig. 13. Transmission electron mmicrograph showing interface dislocations (parallel to the arrow) in a lath of 
martensite in an Fe-20% Ni-5% Mn alloy. 
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Figures 14-18 are optical micrographs showing typical martensite morphologies in 
ferrous alloys. 

2.4. Martensite-parent interfaces 

The phenomenological theory of martensitic transformations permits all the crystal- 
lographic features of the parent and product phases to be mathematically related, but does 
not yield detailed information about the mechanism of transformation. The latter depends 
critically on the structure of the transformation interface. 

The fact that martensite can grow at very low temperatures and very high velocities 
means that the transformation interface must be very mobile and be able to move without 
any need for diffusion. The interface must be glissile. A fully coherent interface is 
glissile, but since the parent and martensite crystals cannot in many cases be converted 
into one another by a lattice deformation which is an invariant-plane strain, the interfacial 
structure is more likely to be semi-coherent. 

A semi-coherent interface consists of coherent regions separated by discontinuities 
which prevent the misfit in the interface plane from accumulating over large distances. 
There are two kinds of semi-coherency [Christian and Crocker, 1980]; if the 
discontinuities mentioned above are intrinsic dislocations with Burgers vectors in the 
interface plane, not parallel to the dislocation line, then the interface is said to be 
epitaxially semi-coherent. The normal displacement of such an interface requires the 
thermally activated climb of the intrinsic dislocations. A martensite interface cannot 
therefore be epitaxially semi-coherent. 



Fig. 14. Optical micrograph showing lenticular martensite formed at -79°C in an Fe-31Ni-0.23C wt.% alloy. 
The more heavily etched linear central region of the plates is known as a midrib. 
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Fig. 17. Optical micrograph showing surface relief following the formation of lath martensite in an Fe-0.2C 
wt.% alloy. 


In the second type of semi-coherency the discontinuities are screw dislocations, or 
dislocations whose Burgers vectors do not lie in the interface. This is the kind of semi- 
coherency associated with glissile martensite interfaces, whose motion is conservative 
(i.e., it does not lead to the creation or destruction of lattice sites). Two further condi- 
tions must be satisfied before even this interface can be said to be glissile [Christian 
and Crocker, 1980]: 

(i) A glissile interface requires that the glide planes of the intrinsic dislocations 
associated with the product lattice must meet the corresponding glide planes of the 
parent lattice edge to edge in the interface, along the dislocation lines. 

(ii) If more than one set of intrinsic dislocation exist, then these should either have the 
same line vector in the interface, or their respective Burgers vectors must be parallel. 



Fig. IS. Optical micrograph showing bands of hep martensite (A and B) formed in the fee austenite matrix in 
an Fe-12Mn-0.48C wt.% alloy. 
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This ensures that the interface can move as an integral unit, and places severe 
restrictions on any theory involving more than one lattice-invariant deformation. 

The intrinsic dislocations accomplish the lattice-invariant deformation as the interface 
glides. The interface also contains small, atomic height steps, which are coherent and 
whose motion leads to the transformation of the parent into the product phase. These 
have the character of transformation dislocations, and their Burgers vector can be defined 
in terms of the lattice deformation S (Christian [1982]). The density of such steps 
depends on the local curvature of the interface, but it must be emphasized that their 
motion is glissile and that it is their motion which leads to transformation, whereas the 
intrinsic dislocations cause the lattice-invariant deformation. 

When the transformation dislocations (the coherent steps) generate the final lattice in 
a deformation which is an invariant-plain strain, as in the fee -» hep cobalt transform- 
ation, no other defects are required in the interface. 

2.5. Energetics of martensitic transformations 

2.5.1. Transformation hysteresis and the reverse transformation 

Two representative but widely different cases are considered here: the non-thermo- 
elastic martensitic transformation in Fe-Ni (“30 wt.%) alloys, and the thermoelastic 
martensite in typical /3 -phase alloys such as /3 -brass, often known as the § electrons per 
atom Hume-Rothery phases. In both cases the shear component of the macroscopic shape 
strain is about 0.2 (shear angle about 12°) but the way in which this strain is accommodated 
by the parent phase is quite different for the two cases, as is the transformation hysteresis. 

In the Fe-Ni alloys the martensite-parent interface appears to become immobilized 
after a plate of martensite has thickened to a certain extent, and when the martensite is 
heated, the interface will not move backwards, apparently being pinned by its damaged 
environment. Instead, the reverse martensite — » parent (austenite) transformation takes 
place by the nucleation of small platelets of the parent phase within the martensite plates 
(Kessler and Pitsch [1967]). And in ferrous alloys containing carbon, i.e., steels, the 
usual stages of martensite tempering occur (Krauss [1980]) within which the martensite 
decomposes through diffusional processes. 

On the other hand, in alloys of the /3-phase type, the shape deformation of the 
martensite is elastically accommodated by the matrix with no dislocation or “debris” 
generation; the interface therefore remains glissile, capable of “backwards” motion 
leading to the shrinkage of martensite plates during heating. In addition, in such 
thermoelastic transformations (Kurdjumov and Khandros [1949]), stored elastic energy 
apparently predisposes the reverse martensite-parent transformation, allowing it to occur 
“prematurely” (Kaufman and Cohen [1958]). In such cases, the stored elastic energy 
contributes to the driving force for the reverse transformation. 

Whether martensitic transformation in any particular alloy will be thermoelastic or not 
can be predicted by an analysis of the stress field around a plate of martensite. If the 
stress induced by the shape deformation causes the matrix to yield plastically, then the 
transformation is not thermoelastic [Olson and Owen, 1976; Ling and Owen, 1981]. 
Austenite in iron-platinum alloys undergoes ordering which greatly affects the elastic 
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properties. The shear modulus decreases and the yield strength increases, so that it 
becomes possible to elastically accommodate the martensite. The ordered alloy therefore 
has thermoelastic martensite whereas the disordered alloy has non-thermoelastic 
martensite. The plastic deformation for the two cases is illustrated in fig. 19. 

2.6. Thermoelastic and non-thermoelastic martensitic transformations 

The differences between thermoelastic and non-thermoelastic martensitic transfor- 
mations may be highlighted by comparing Au-Cd and Fe-Ni alloys, respectively 
(Otsuka and Wayman [1977]). Figure 20 shows that there is a substantial difference in 
the transformation hysteresis for the two alloys, which is the difference between the A F 
(completion temperature for the reverse transformation) and M s temperatures (Kaufman 
and Cohen [1958]). The thermoelastic transformation (Au-Cd) is characterized by a 
small hysteresis. Another difference is in the manner of the martensite-parent reverse 
transformation. In the thermoelastic case, the martensitic transformation on cooling 
proceeds by the continuous growth of martensite plates and the nucleation of new plates. 
When cooling is stopped, growth ceases, but resumes during further cooling, until the 
martensite plates impinge with each other or with grain boundaries. The reverse 
transformation upon heating occurs by the backwards movement of the martensite-parent 
interface, and the plates of martensite shrink and revert completely to the initial parent 
phase orientation. 

On the other hand, a plate of non-thermoelastic martensite generated at a given tem- 
perature does not grow further upon subsequent cooling because the interface apparently 
becomes immobilized. Nor does the immobilized interface reverse its direction of motion 
during heating. Instead, the parent phase has to nucleate within the immobilized 
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Fig. 19. Ling and Owens’ calculations showing the contours where the matrix satisfies a yield criterion, as a 
function of the aspect ratio of the martensite plate and the degree of ordering (5). The region between each 
contour and the martensite plate represents plastically deformed austenite. Only half the martensite plate is 
illustrated in each case. 
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Fig. 20. Graph showing electrical resistance change during heating and cooling for Fe-Ni and Au-Cd alloys, 
indicating the hysteresis for the martensitic transformations on cooling and the reverse transformation on heating 
for non-thermoelastic and thermoelastic transformation, respectively (after Kaufman and Cohen [1957]). 


martensite plates; since the parent phase can nucleate in many orientations, the original 
parent phase orientation is not recovered on heating (Kessler and Pitsch [1967]). 

2.6.1. Free energy change due to martensitic transformation 

Figure 21 shows schematically the change in chemical free energies of martensite and 
austenite (parent phase) with temperature. T 0 is the temperature at which the austenite and 
martensite are in thermodynamic equilibrium and M s is the temperature at which the trans- 
formation starts upon cooling. The difference in free energies between austenite (y) and 
martensite ( a ), AG^ at the M s temperature is the critical chemical driving force for the 
onset of the martensitic transformation (other features of fig. 21 will be described later). 

In general, the free energy change associated with a martensitic transformation is 
given by: 

A G™ = A G^ m + A G£ m (10) 



Fig. 21. Schematic diagram showing the free energy change for a martensitic transformation. See text for details. 
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where A G^ M is the chemical free energy change (per unit volume of transformation) 
associated with the transformation from the parent to martensite, and A G^ M is the non- 
chemical energy opposing the transformation (consisting of elastic strain and surface 
energy). Since the martensite-parent interface is semi-coherent the surface energy term 
should be small, and the elastic energy term will dominate. As will be seen later, the 
elastic strain energy per unit volume of martensite scales with the aspect ratio c/ r, where 
2c and 2 r represent the plate thickness and length respectively. 

During thermoelastic equilibrium, the plate of martensite continues to increase its 
aspect ratio unit the chemical and non-chemical terms balance. Consequently, a plate of 
constant length can be thickened or thinned when the transformation temperature is 
lowered or raised, respectively. This is the origin of the terminology thermoelastic 
transformation. 


2.6.2. Nucleation of martensite 

If a martensite transformation were able to nucleate (by a random fluctuation of 
structure) in an ideally perfect crystal of the parent phase, the attendant free energy 
change could be written (Cohen and Wayman [1981]) as: 

A G = (477/3)/- 2 cAG£ _> ‘ m + (47t/3 y 2 c(Ac/r) + 2vr 2 a (11) 

assuming that a martensite plate can be represented as an oblate spheroid of radius r and 
semi-thickness c. The plate volume is then (A / rr/3>)r L c and its surface area - 2m 2 . The 
elastic strain energy per unit volume, due to the shape deformation, is given by Ac/r, 
where 


A = 


77(2 - v ) 
8 ( 1 -") 



( 12 ) 


where s and S are the shear and dilatational components of the invariant-plane strain 
shape deformation, the dilatational strain being directed normal to the habit plane of the 
martensite. The Poisson ratio (n) and the shear modulus (p.) are each taken to be the 
same for the parent and product phases. With typical values for the martensitic reaction 
in ferrous alloys, A is about 2.4 x 10 3 MJ m' 3 . 

Most martensitic transformations, and certainly the fee — » bcc (or bet) structural 
change in iron-base alloys, involve semi-coherent rather than fully coherent interfaces, 
and so the interfacial energy is estimated to be about 100-200 mJ m -2 (100-200 
erg cm -2 ) rather than to lie in the coherent range of 10-20 mJ m -2 (10-20 erg cm -2 ). 
A Gc^ m is approximately -170 MJ m -3 for a typical ferrous alloy at its martensite-start 
temperature ( M s ), and when this quantity is substituted into eq. 1 1 along with the above 
values of A and tr, the resulting function of AG versus r and c is a saddle-shaped surface 
with a barrier on the order of 8 x 10“ 16 J per nucleation event (- 5 x 10 3 eV). This is about 
10 5 kT at temperatures where the transformation typically occurs. Clearly, the available 
thermal energy is much too small for homogeneous nucleation by heterophase fluc- 
tuations, and so special nucleation sites must be considered. It is frequently thought that 
dislocation arrays serve as martensite nuclei. 
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Martensitic reactions in ferrous alloys can occur either isothermally or athermally 
(i.e., on continuous cooling). Most transformations in steels have been observed 
athermally as a function of dropping temperature, but the best quantitative insights into 
the reaction kinetics are obtained from isothermal transformations in which case 
nucleation and transformation rates can be determined experimentally. In any event (i.e., 
athermal or isothermal reaction kinetics) the transformation kinetics are controlled by the 
nucleation rate and not the growth rate. It is the relatively rapid growth rate which leads 
to this situation. 

The important problem of martensite nucleation was previously addressed by 
Christian [1970] who concluded that the critical stage in martensite nucleation involves 
the attainment of a condition from which non-thermally activated growth can begin, 
which implies the formation of a semi-coherent interface. He further suggested a 
framework within which to consider the martensite nucleation process involving (a) 
classical nucleation utilizing thermal fluctuations (either homogeneously or at randomly 
distributed preferred sites), (b) the quenching-in of a pattern of embryos or compositional 
fluctuations which are subcritical at high temperatures but become supercritical at lower 
temperatures, and (c) the non-thermally activated rearrangement of the structure of a 
defect, or the interaction of different defects to produce a configuration resembling the 
martensitic structure. Recent thinking (Clapp [1973], Olson and Cohen [1976] would 
tend to favor the last of these categories. 

Whatever the details of the mechanism of martensitic nucleation, one very important 
result is that martensite is triggered when the temperature T is such that free energy 
change for the diffusionless transformation of the parent phase y to the product phase a', 
i.e., A becomes less than a critical value A G]^ a (Kaufman and Cohen [1958], 
Magee [1970]). This critical value is about -1100 J mole" 1 for low-alloy steels. 
However, it depends on the strength of the parent phase; dissolved elements which 
strengthen the austenite make martensitic transformation more difficult and this effect 
alone can depress the M s temperature. Thus, the value of A G^ a may be much smaller 
(say -3000 J mole" 1 ) for steels which have large concentrations of strengthening elements 
(Ghosh and Olson [1992]). 

2.7. Mechanical effects in martensitic transformations 

2.7.1. Introductory comments 

The most familiar mechanisms of plastic deformation are slip, mechanical twinning 
and diffusion-induced creep. For very small plastic strains, the first two of these 
deformation modes are conservative — i.e., they preserve an atomic correspondence 
between the deformed and undeformed parts of the crystal so that the crystal contains a 
memory of its original shape. All of these deformation modes are lattice-invariant 
because although they cause a change in the shape, the crystal structure remains as it was 
prior to deformation. 

Martensitic transformation can also be regarded as a mode of deformation, but a 
mode which at the same time causes a change in the crystal structure. The shape change 
is of course, an invariant-plane strain with a large shear component and a relatively small 
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dilatational strain. The distinguishing features of a variety of deformation modes are 
listed in table 2. 

Given that the growth of martensite causes a change in shape, it is natural to expect 
the transformation to be influenced by an externally applied stress. The permanent strain 
caused by martensitic transformation is called transformation plasticity. A phase change 
in a stress-free material is usually triggered by heat treatment, when the parent phase 
passes through an equilibrium transformation temperature. Alternatively, the application 
of a stress in isothermal conditions can trigger transformation in circumstances where it 
would not otherwise occur. Unusual effects can occur when stress and temperature work 
together. The transformation may occur at remarkably low stresses or at very small 
deviations from the equilibrium temperature. This is why even minute stresses can have 
a large influence on the development of microstructure. It is not surprising that transfor- 
mation plasticity can be obtained at stresses which are much smaller than the conven- 
tional yield stress of the parent phase. 

2.7.2. Chemical and mechanical driving forces 

The interaction of an externally applied stress (which is below the yield strength of 
the parent phase) can manifest in two ways: 

(i) The stress can alter the driving force for the transformation. 

(ii) It can change the appearance of the microstructure by favouring the formation of 
those variants which best comply with the applied stress. 

Both of these factors are illustrated in fig. 22, where a fine grained polycrystalline 
sample of austenite was stressed at a temperature above its normal martensite-start 
temperature. The amount of martensite obtained is seen to vary directly with the 
magnitude of the applied stress. The stress manifests as a mechanical driving force whose 
contribution assists the chemical driving force, which on its own is inadequate to trigger 
transformation. Not only does the stress induce the formation of martensite, but only 
those variants which comply with the tensile stress grow in profusion (fig. 22b). Thus, 
most of the plates are tempted to grow on those planes which are close to the plane of 
maximum shear stress (45° to the tensile axis). The microstructure would have been 
much more chaotic in the absence of the stress. The stress caused the alignment of 


Table 2 

Characteristics of different modes of deformation/transformation 



Slip 

deformation 

Mechanical 

twinning 

Displacive 

transformation 

Reconstructive 

transformation 

Causes permanent change in shape 

Yes 

Yes 

Yes 

Yes 

Invariant-plane strain shape change 
with a large shear component 

Yes 

Yes 

Yes 

No 

Changes crystallographic orientation 

No 

Yes 

Yes 

Yes 

Changes lattice type 

No 

No 

Yes 

Yes 

Can lead to a density change 

No 

No 

Yes 

Yes 
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plates, and a more ordered, organised microstructure. 

The chemical free energy change associated with a martensitic transformation was 
depicted in fig. 20. Referring to fig. 20 again, when a stress is applied to the austenite at 
7, (between M s and T 0 ), the mechanical driving force, U, due to the stress is added to the 
chemical driving force, AG/ - *" , and the martensitic transformation starts at the critical 
stress where the total driving force is equal to AG/ - *" . U' (= AG,/ - *" - AG/ - *" ) in fig. 
20 is the critical mechanical driving force necessary for the stress-induced martensitic 



(a) 
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Fig. 22. Stress-affected martensitic transformation in a Fe-28Ni-0.4C wt.% alloy tested at a temperature above 
the martensite-start temperature, (a) Volume fraction of martensite as a function of stress, (b) Optical 
micrograph from a low stress region, (c) Optical micrograph from a high-stress region. The arrow indicates the 
direction of the tensile stress. 

transformation at T { . The mechanical driving force is a function of stress and the 
orientation of a transforming martensite plate, and can be expressed (Patel and Cohen 
[1953]) as: 

U = rs + aS, (13) 

where r is the resolved shear stress along the transformation shear direction in the mar- 
tensite habit plane, cr is the normal stress resolved perpendicular to the habit plane, and 
s and 5 are the shear and dilatational components of the shape deformation due to mar- 
tensitic transformation. For a small applied stress <r,, r and a (fig. 23) may be expressed 
as follows for any given orientation of a martensite plate (Onodera et al. [1976]); 


^( 7 , sin 26 cos a. 

(14) 

± ] ( 7,[1 + cos 20], 

(15) 


where <7, is the magnitude of the applied stress, 0 is the angle between the applied stress 
and the normal to the habit plane, and a is the angle between the transformation shear 
direction and the maximum shear stress in the habit plane. The plus and minus signs in 
eq. 15 correspond to uniaxial tension and compression, respectively. From eqs. 13-15 the 
mechanical driving force is 

U = \ <7, [s sin 20 cos a ± 5(1 + cos 20)]. 


(16) 
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Fig. 23. Schmid-factor diagram for the influence of an applied stress cr ] along stress-axis A in inducing a 
martensitic transformation on habit plane P with normal N. S is the direction of the shape strain for the 
martensite and S n , is the maximum shape strain elongation parallel to the habit plane. 


2.7.3. Critical stress to induce martensitic transformation 

When martensite is induced in polycrystalline austenite (containing randomly oriented 
grains) by an external stress, the martensite plate whose orientation yields a maximum 
value of U, eq. 16, will be formed first. If it is assumed that the angle a is zero, then the 
maximum value of U is obtained by setting dU/d9-0. For Fe-Ni alloys (s = 0.20, 
S =0.04), the value of 9 which would maximise U is therefore found to be as 39.5° for 
tension and 50.5° for compression. These angles would become exactly 45° if S = 0. 

If the chemical driving force (AG y_ " ) decreases linearly with an increase in temperature 
above M s , then a proportionally greater mechanical driving force is required before martensite 
formation is triggered. The necessary stress might then be expected to increase linearly 
with temperature, and this is in fact observed experimentally (Olson and Cohen [1972], 
Onodera et al. [1976]) over the range M s —> M” (fig. 24). However, above M a s (e.g., at 
T 2 ), martensite is induced at a stress a b after plastic deformation of the austenite occurs. 

The microstructure also changes when martensite is generated under the influence of 
an applied stress. It has been observed (Otsuka and Wayman [1977]) that only a single 
variant, out of (up to) twenty four that might be possible, of martensite is induced when 
an elastic stress is applied. This is the variant which gives rise to maximum elongation 
in the direction of the tensile axis. 

When the austenite is deformed at temperatures above A/J (e.g., at T 2 in fig. 24), it 
begins to deform plastically at a stress a a , and is strain-hardened up to a b . Then the 
martensitic transformation starts to take place. cr b is considerably lower than cr c , which is 
obtained by extrapolating the critical stress-temperature line between M s and Ml . This 
decrease (i.e., tr c - <r b ) in the critical applied stress for martensite formation is due to 
plastic deformation of austenite. As to the role of plastic deformation of austenite on the 
deformation-induced martensitic transformation, two different views have been expressed. 
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Fig. 24. Schematic stress-temperature diagram showing the critical stress to initiate the formation of martensite 
as a function of temperature. The various regimes of behavior are described in the text. 


One is the strain-induced martensite nucleation hypothesis proposed by Olson and 
Cohen [1972, 1976, 1979], and the other holds that the stress is locally concentrated at 
obstacles (e.g., grain boundaries, twin boundaries, etc.) by plastic deformation of 
austenite, and thus the concentrated stress becomes equivalent to cr c in fig. 24, the latter 
proposed by Onodera and Tamura [1979] and others. 

In contrast to the M s temperature where a martensitic transformation will begin as a 
consequence of the chemical driving force, reference is frequently made to the M d 
temperature. M d is the temperature above which the chemical driving force becomes so 
small that nucleation of martensite cannot be mechanically induced, even in the plastic 
strain regime. 

2.7.4. Transformation-induced plasticity (TRIP) 

As an example of TRIP behavior (Zackay et al. [1967]), we consider an austenitic 
Fe-29Ni-0.26C wt.% alloy (M s = -60°C, M d = 25°C) deformed in tension at various 
temperatures. The relationship between tensile properties and test temperature for this 
alloy is shown in the upper part of fig. 25 (Tamura et al. [1970]). Note that the 
elongation is nearly 100%. An inverse temperature dependence of the yield stress is 
observed between M a s and M s . This results from the shape strain of martensite formed 
before yielding of the austenite occurs. The tensile strength is increased with decreasing 
test temperature just above Mg . Such an elongation enhancement is attributed mainly to 
the suppression of necking, due to the increase in work-hardening rate by the formation 
of martensite. Furthermore, it is thought that the initiation and propagation of cracks is 
suppressed by the formation of martensite during deformation, because the stress 
concentrations may be released by the formation of preferential variants of martensite at 
regions of stress concentration. Similar results are seen in fig. 25 for two other alloys. 
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Fig. 25. Experimental data showing the effect of test temperature on the tensile properties of three metastable 
austenitic alloys strained at the rate e = 5.5 x 10~Vs (after Tamura el at, [1970]). 


With an increase in carbon content, the martensite hardness is increased substantially, 
but the austenite hardness only slightly increases. Accordingly, in the case of high-carbon 
steels, since a larger work-hardening rate can be obtained even with a small amount of 
martensite formation, the uniform elongation by TRIP becomes much larger (Tamura et 
al. [1972, 1973]). 
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2.8. Mechanical effects specific to thermoelastic martensitic transformations 

2.8.1. General 

Thermoelastic martensitic transformations are accompanied by very unusual mechan- 
ical effects in both the parent and martensite phases. In addition to the now popular 
shape memory effect (SME) and the mechanical behavior of alloys exhibiting this type 
of behavior, closer inspection shows that SME alloys also show a variety of other kinds 
of interesting mechanical behavior. Examples are the superelasticity associated with the 
formation of a reversible stress-induced martensite, the rubberlike effect, “training” and 
the two-way shape memory effect, extensive deformation resulting from stress-induced 
martensite-to-martensite transformations, unusual damping behavior, and finally, high 
stresses generated during the reverse martensite-to-parent transformation. 

The number of materials exhibiting the shape memory effect is now extensive, 
including many Cu-based alloys, those of the noble metals based on Ag and Au, the 
classic Ni-Ti alloys (NTTINOL), ternary variations of the same such as Ni-Ti-Cu and 
Ni-Ti-Fe, Ni-Al alloys, and Fe-Pt alloys, to mention a few. The martensite that forms 
in these alloys is thermoelastic and crystallographically reversible, and both the parent 
and product phases are in general atomically ordered [Shimizu and Wayman, 1972], The 
SME martensites are either internally twinned or internally faulted due to the lattice- 
invariant deformation. In addition, the parent phase is usually of the B2 or D03 type and 
the martensite crystal structure is given by the stacking sequence of planes originating 
from the parent phase {110} planes, e.g., 2H, 3R, 9R and 18R. 

2.8.2. The shape memory effect (SME) 

A brief description of the SME is as follows. An object in the low temperature 
martensitic condition* when deformed and then unstressed regains its original shape on 
heating through the A s — >A F temperature interval, as the parent phase grows at the 
expense of the martensite. The strains (6-8%) describing the original deformation of the 
martensite are completely recovered. The recovery of the original shape is entirely due 
to the reverse transformation of the deformed martensite. 

The SME effect can be explained as follows (Schroeder and Wayman [1977], 
Saburi etal. [1979]). A single crystal of the parent phase will usually transform into 24 
orientations of martensite (variants of {hkl}). But when this mixture of many variants of 
martensite is deformed, the microstructure changes into a single orientation of martensite. 
This change occurs by the motion of martensite-martensite boundaries, during which the 
variant most compatible with the applied deformation grows at the expense of the others. 
The motion of the martensite-martensite boundaries involves some twinning/detwinning 
deformation, because the twins are in fact simply other orientations of martensite. It has 
been shown that twinning can convert one orientation (variant) of martensite to another 
(Schroeder and Wayman [1977a]). Consequently, when a fully martensitic mixture of 
many orientations of martensite is deformed, one plate orientation grows at the expense 


* Usually below M F , the temperature at which the martensitic transformation is completed. 
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of the others. The favoured martensite is that whose shear component of the shape 
deformation permits the maximum elongation of the specimen in the direction of the 
tensile axis (Saburi et al. [1979]). The entire deformation is therefore due solely to 
martensite variant reorientation. Any other form of deformation (e.g., ordinary slip) tends 
to destroy the shape memory effect. 

Although a single crystal of the parent phase transforms into many orientations of 
martensite, the reverse situation does not occur. Rather, the single crystal of martensite 
resulting from deformation below the M F temperature, transforms during heating into a 
single orientation of parent phase. This is a consequence of the martensite and parent 
lattice symmetries involved and the necessity to maintain atomic ordering during the 
reverse transformation. The highly symmetric parent phase (usually cubic) has many 
crystallographically equivalent ways in which the Bain distortion can occur, giving 
numerous variants of martensite. On the other hand, the relatively unsymmetric marten- 
site (e.g., monoclinic in Cu-Zn-Al) does not have such a multiplicity of choices, so that 
only the original orientation of parent phase can form. The single crystal of martensite 
“unshears” to form a single crystal of the parent phase, thereby restoring the specimen 
to its original shape. A schematic representation of the shape memory process is shown 
in fig. 26 (Saburi etal. [1979]). 
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Fig. 26. Schematic representation of deformation processes associated with the shape memory effect. 
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This description of the shape memory effect is quite general, irrespective of the 
particular alloy system or crystal structure of the martensite. 

Figure 27 (Schroeder and Wayman [1977a] shows a stress-strain curve for a 
Cu-Zn SME alloy single-crystal specimen deformed below the M P temperature. It is seen 
that the martensite begins to deform ( a ') at a relatively low stress, 35 MN rri\ The 
residual strain at point c was completely recovered during heating. 

2.8.3. The two-way shape memory effect 

With the shape memory effect, as previously described, a specimen deformed by 
martensite variant conversion will undeform and regain its original shape during heating 
from A s to A f . In contrast the two-way shape memory (TWSM) involves a reversible 
deformation. A specimen will spontaneously deform during cooling from M s to M F and 
then undeform during heating from A s to A F . Such behavior occurs as a result of 
deformation in either the parent phase or in the martensite (Wasilewski [1975]). There 
are in fact, two types of two-way shape memory depending on the way in which the 
alloy is trained (Schroeder and Wayman [1977b]): 

(i) SME cycling involves cooling a specimen below the M F temperature, deforming it 
to produce a preferential martensite variant, as described earlier, and heating it to 
above the A F temperature. This procedure is repeated several times, the manner of 
martensite deformation (e.g., tension, compression, bending) remaining unchanged. 

(ii) SIM cycling involves the deformation of a specimen above the M s temperature in 
order to produce stress-induced martensite (SIM), followed by reversal of the SIM 
when the applied load is released. This process is also repeated several times, 
using the same means of stressing each time. 

The two-way shape memory is observed after both SME and SIM cycling, and the terms 
SME training and SIM training have been suggested to describe the process of cycling in 
the desired manner (Schroeder and Wayman [1977b]). In either case, the two-way 
behavior comes from the preferential formation and reversal of a trained variant of 
martensite formed after either SME or SIM cycling and cooling the specimen from M s 



Fig. 27. Stress-strain curve for a Cu-Zn shape-memory alloy deformed below its M f temperature. The low flow 
stress a' associated with the martensite deformation is to be noted. 
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to M F . That is, the training of a specimen to form a preferential variant of martensite 
upon cooling to M F can result from prior deformation of the martensite formed by 
cooling followed by heating, or SIM cycling above the M s temperature. In both cases the 
first part of the TWSM occurs upon cooling. In either case, the result is that a major 
portion of the martensite in a given specimen corresponds to a preferred variant, which 
produces a spontaneous strain during cooling, the amount of the strain corresponding to 
the shape strain of the preferred variant. 

The effectiveness of SME training is found to be inferior to that of SIM training. 
Nevertheless, in both cases, the preferential formation of a selected variant of martensite 
is explained by the progressive introduction of a pattern of stresses or defects into the 
material during the training processes. These stresses (or defects) apparently become 
better defined and established with increased cycling (Schroeder and Wayman [1976]). 

2.8.4. The engine effect in shape memory alloys 

Although SME materials generally deform in the martensitic condition at com- 
paratively low stresses, as seen in fig. 27, surprisingly large stresses are generated when 
the deformed martensite is heated from A s to A F . As a case in point (Jackson et al. 
[1972]), a nearly equiatomic Ni-Ti alloy when martensitic will undergo deformation (as 
at point a' of fig. 27) at a stress of about 70 MN m“ 2 . Yet when it is constrained and 
heated to A F it is found that a thermomechanical recovery stress of about 700 MN m~ 2 is 
generated. In other words, heat can be used to create a mechanical force which can do 
work. This is the basis for the design of engines based on shape memory alloys 
(Goldstein and McNamara [1979]). 

2.8.5. Pseudoelastic effects 

Pseudoelasticity refers to a situation where large strains, in excess of the elastic limit, 
are completely recovered upon unloading at a constant temperature. There are two 
categories, superelasticity and the rubber-like effect (Otsuka and Wayman [1977]). In 
superelasticity, the martensite is stress-induced and reverts to the parent phase on the 
removal of stress; the stress therefore contributes a mechanical driving force for 
transformation. By contrast, rubber-like behavior involves the deformation of existing 
martensite with no phase transformation. 

2.8.6. Pseudoelastic effects: superelasticity 

The application of a suitable stress above the M s temperature (but below M d ), induces 
the formation of martensite. In thermoelastic alloys, this stress-induced martensite 
disappears when the stress is released, giving rise to superelasticity. A stress-strain curve 
showing typical superelastic behavior for a single-crystal sample is shown in fig. 28. 
Two plateau regions are seen. The upper plateau corresponds to the formation of a 
preferred variant of stress-induced martensite plates. The preferred variant is that whose 
shape strain best complies with the applied stress (Schroeder and Wayman [1979]). 
The parallel plates of martensite nucleate, lengthen and coalesce, leading eventually to 
a sample which is a single crystal of martensite, at the end of the upper plateau region 
in fig. 28. Upon releasing the load, the stress-strain curve follows the lower plateau 
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Fig. 28. Stress-strain curve for a thermoelastic Cu-Zn shape-memory alloy deformed above the M, temperature, 
showing superelastic behavior as a consequence of the formation and reversion of a reversible, stress-induced 
martensite. 


region, which involves the formation of parallel plates of only one variant of the parent 
phase. The levels of the plateau stresses shown in fig. 28 depend on the test temperature. 
The upper plateau stress is naturally zero at the M s temperature. The temperature 
dependence of the stress to produce SIM is shown in fig. 29. The slope of the M s 
variation with temperature may be used to estimate the latent heat of transformation 
(Otsuka and Wayman [1977]). Stress-strain curves of the type shown in fig. 29 are 
frequently referred to as superelastic loops. 

2.8.7. Pseudoelastic effects: rubber-like behavior 

Rubber-like behavior occurs in the martensite phase of some alloys which undergo 
thermoelastic transformation. It does not involve a phase transformation, but is related to 
the reversible movement of transformation twin boundaries or martensite boundaries. 

It is useful to compare the usual shape-memory effect (fig. 26) with rubber-like 
behaviour, since the latter often occurs in SME alloys which are aged in their martensitic 



Fig. 29. Stress-temperature plot of experimental data showing the temperature dependence of the applied stress 
required to produce stress-induced martensite in a Cu-39.8% Zn alloy. 
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state. In the single-crystal shape-memory effect, applied deformation can convert a 
microstructure which consists of many variants of martensite into a single crystal of 
martensite. Heating then displacively transforms the martensite into the parent phase 
single-crystal, thereby removing the applied deformation. 

However, if the microstructure consisting of many variants of martensite is first 
allowed to age, and then deformed to a single crystal of martensite, the removal of the 
applied stress causes the single crystal of martensite to revert to the many variants, 
thereby reversing the applied deformation. This restoring force only arises after ageing, 
and its origins lie in the changes that occur in the state of order on ageing (Ahlers et al. 
[1978]; Marukawa and Tsuchiya [1995]; Tsuchiya et al. [1995]). 

The long-range order in the parent structure is usually imperfect because the parent 
phase occurs at relatively high temperatures. On quenching, the martensite inherits this 
imperfect order. Ageing at ambient temperature can, with the help of quenched-in excess 
vacancies, allow the state of order in the martensite to become more perfect. The 
martensite also has a lower crystallographic symmetry than the parent phase, so the 
orientation of the more perfect state of order is different for each martensite variant. This 
makes the conversion of one martensite variant into another more difficult, since the 
conversion would lead to wrongly oriented order in the growing variant. The resulting 
increase in energy provides the restoring force which tends to make the single-crystal of 
martensite revert to the original many variants when the applied stress is removed. 

2,8.8. Martensite-to-martensite transformations 

Figure 30 shows the deformation behavior of a Cu-39.8Zn wt.% single crystal 
strained at -88°C, some 35°C above the M s temperature (Schroeder and Wayman 
[1978]) The first upper plateau corresponds to the formation of SIM as described 
previously. However, the second plateau which starts at about 9% strain, corresponds to 
a second martensitic transformation which is stress-induced from the first martensite 
“mother”. The two lower plateaux are a result of the reverse transformations, occurring 
in an inverse sequence. By means of these successive martensite-to-martensite transfor- 
mations, completely recoverable strains as high at 17% can be realized. Figure 30 shows 
a double superelastic loop. It should be noted that the second stress-induced martensite 



Fig. 30. Stress-strain diagram showing the deformation behavior of a Cu-39.8% Zn alloy single crystal 
specimen deformed above the M, temperature. A double superelastic loop is observed which is the result of two 
successive stress-induced martensitic transformations. The first stress-induced transformation is that depicted 
by fig. 25, while the second stress-induced transformation originates from the first-formed (under stress) 
martensite. 
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in the Cu-Zn alloy can be formed from the first SIM, or from the normal thermally 
formed (upon cooling) martensite. 

In the case of the Cu-Zn alloy discussed above, the initial martensite has a 9R 
structure, and additional deformation changes this into a 3R structure. This occurs by 
shearing on the basal (close-packed) plane of the 9R such that the structural stacking 
sequence is changed from ...ABCBCACAB... to ..ABCABCABC... 

3. Crystallographically similar transformations 

3.1. The bainite transformation in steels 

It is widely agreed that the bainite reaction in steels has certain martensitic character- 
istics but there may be some diffusion of carbon at some stage of the reaction. The 
surface relief accompanying the growth of bainite is an invariant-plane strain with a large 
shear component. There are also other martensitic-like features such as an irrational habit 
plane and transformation substructure, and various studies of the austenite-bainite 
orientation relationship imply that the Bain correspondence between the austenite and 
bainitic ferrite exists. (See also ch. 15, §2.6). 

Whereas the detailed role of carbon in the mechanism of the bainite transformation 
requires further work, it is clear that the iron and substitutional atoms are transferred in 
a coordinated manner across the bainite-austenite interface. Consequently, it is reason- 
able to expect the phenomenological crystallographic theory to apply to the bainite 
transformation, and this is indeed found to be the case in practice. 

3.2. Oxides and hydrides 

Various metallic oxides and hydrides form with a plate morphology and exhibit an 
invariant-plane strain shape deformation with a large shear component. Such plates are 
also internally twinned or internally slipped. An example of an internally twinned plate 
in a tantalum oxide is shown in fig. 31. In this case the observed crystallographic 
features of the oxide plates are accurately predicted by the phenomenological crystallo- 
graphic theory developed for martensitic transformations (Van Landuyt and Wayman 
[1968]). A lattice correspondence therefore exists between the metallic atoms in the two 
phases and remains intact despite the occurrence of interstitial diffusion (of oxygen) 
during the phase change. 

33. The CuAu II ordering reaction 

The formation of the ordered orthorhombic CuAu II phase from the disordered cubic 
phase parent at about 390°C generates an invariant-plane strain relief with a large shear 
component, and conforms well with the phenomenological crystallographic theory of 
martensitic transformations. This led Smith and Bowles [1960] to conclude that the 
mechanism by which the internally twinned plates of the CuAu II form, is similar to that 
of a martensite, even though place changes between neighboring atoms are necessarily 
involved during the ordering reaction. But Aaronson and Kinsman [1977] dispute this. 
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Fig. 31. Transmission electron micrograph showing plates of the sub-oxide TaO y following the oxidation of 
tantalum. The striations within the plates are twins, and the general crystallygraphy of the process is well 
described by the phenomenological crystallograpic theory developed for martensitic transformations. 


claiming that the atomic jumps required for ordering are inconsistent with a shear 
transformation and that conformity with the crystallographic theory is not a sufficient 
criterion to identify a shear transformation. 

The CuAu II ordering reaction was later considered by Bowles and Wayman [1979] 
who suggested that the place exchanges between neighboring atoms, required to achieve 
the ordering, are redundant in the sense that they do not contribute to the transformation 
shape change so that the theory still prescribes the displacements involved in producing 
the lattice change. They proposed that the growth of CuAu II plates occurs by the glide 
of transformation dislocations, i.e., the structure of the interface is glissile as required for 
displacive transformation. 

Christian [1994] has analysed these difficulties in detail. Solid-state phase trans- 
formations are often classified as either reconstructive or displacive depending on 
whether or not individual atoms are mobile. In an entirely displacive transformation, each 
atom moves, relative to its neighbours, by less than an interatomic distance. Thus, the 
number of atoms must be conserved in any incremental volume transformed from the 
parent to the product phase by an entirely displacive (i.e., martensitic) mechanism. This 
is not necessarily true for a reconstructive transformation where the atoms are highly 
mobile, and material transport can occur in a way which reduces the strain energy so that 
the shape change does not have a shear component. The ordered products or precipitates 
(such as CuAu II) whose growth involves very short-range diffusion, but nevertheless 
exhibit shape changes with appreciable shear components, are not simply reconstructive, 
but diffusional-displacive . The interfacial structure of this kind of a transformation 
product must be the same as that of martensite, and the transformation will exhibit all the 
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crystallographic features of martensite, with the plate-shape determined by a need to 
minimise the strain energy associated with the shape deformation. 

4. Omega phase formation 

The (3 —>(o phase change (cubic-hexagonal) involves correlated atomic displacements 
and occurs in Ti, Zr and Hf alloys and in /3-phase alloys of the noble metals. It has been 
extensively investigated in recent years. The reader is referred to two reviews: Sass 
[1972] and Williams etal. [1973]. The /3 — » <o phase change occurs both athermally and 
isothermally, with the same mechanism for both paths (Williams etal. [1973]). 

Omega phase formation can occur reversibly and without diffusion at quite low 
temperatures, and the w -phase cannot be suppressed by rapid quenching. These character- 
istics parallel those found in martensitic transformations, but unlike martensite, the 
w-phase “particles” exhibit a cuboidal or ellipsoidal morphology, depending upon the 
relative parent and product lattice parameters. At least during the early stages of 
transformation, the a) particles are quite small (“1.5 nm), and they are aligned along 
<11 1> directions. Their observed number-density is very high (10 18 -10 19 cm -3 ). The 
smallness of the particles and their high number-density suggest that their nucleation is 
not as serious a barrier as their growth, in contrast to martensitic transformations where 
the opposite is the case. Figure 32 shows an example of omega particles. 

A number of alloys exhibiting the /3 — » <a transformation feature the following 
orientation relationship: 

(1 1 1 )^ | ( 0001 ). 

[1 10]^ || [2 1 1 0] M 

Interestingly, these same alloys (although at different compositions) also undergo a 
/3 — » a martensitic transformation in which case the orientation relationship is of the 
Burger’s type: 

(110) , II (000 1)„. 

[111] ^ | [11 50k 

It should be noted that the /3 — » <a orientation relationship shows a multiplicity of only 
four, which, when compared to that of martensitic transformations (where the multiplicity 
is much higher) suggests that the respective lattice correspondences are different. 

Another comparison can be made between the diffusionless (3 —><o and /3 a' 
(martensitic) transformations. When martensite forms, the atomic displacements are 
mostly homogeneous, as given by a Bain-type distortion. Each atom (apart from 
additional shuffles in some cases) is homogeneously transported to its final position 
according to the Bain deformation and correspondence. But in the /3 — » tu transformation, 
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Fig. 32. Transmission electron micrograph (dark field) showing w-particles formed at 480°C in a Ti-Mo-Sn-Zr 
alloy. Only one of four variants of the or- phase is in contrast. 


some of the corresponding atoms undergo movements whereas others do not. Figure 33 
(Sass [1972]) shows a (1 0 1) section through a /3 unit cell. Atoms A, B, E and F remain 
in place during the /3 — » a> transformation, whereas atoms C and D undergo shuffle 



Fig. 33. Schematic diagram showing atomic movements involved during w-phase formation. This is a (lOl) 
section through the parent bcc unit cell. The 2/i < 1 1 1 > longitudinal displacement waves cause atomic 
movements (in directions indicated by the two arrows) which are those needed to form the w-structure (after 
Sass [1972]). 
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movements in opposite sense along [I I I]p, in the end reaching the position j[1 1 1], 
initially having been situated at the \ and j positions. Accordingly, the 1st, 4th, 7th, 10th, 
etc. planes remain unchanged while 2 and 3, 5 and 6, 8 and 9, etc., shuffle past each 
other in opposite senses. The /3 — » w atomic movements are formally consistent with atom 
movements given by a §<111> displacement wave having nodes at n<lll> /3 positions 
(n= any integer). 

Advances in understanding the /3 — » a> transformation have come from the application 
of transmission electron microscopy and diffraction. Diffraction patterns frequently 
exhibit networks of diffuse intensity (sheets of intensity on <1 1 l>p planes) which obscure 
the identification of diffraction maxima from the w-phase. These diffuse diffraction 
effects are observed at temperatures above which those from the w-phase are clearly 
identified, and the former have been ascribed to pretransformation “linear defects”. 

Impurities have a marked influence on the /3 -» a> transformation. For example, the 
presence of 1200 ppm of oxygen can lower the w-start temperature by about 600 K, 
presumably because oxygen stiffens the matrix and depresses the transformation start 
temperature. Apparently the oxygen atoms somehow interact with the pretransformation 
linear defects along <11 1>^ and impede their ordering. 

There are some outstanding problems with w-phase formation, including the possible 
role of lattice vacancies in promoting linear defects and hence nucleation, the high 
nucleation frequency and low particle growth rate, and the extreme embrittlement of the 
matrix /3-phase after the w-phase forms. 

5. Phase changes and charge density waves 

A charge density wave (CDW) is a static modulation of conduction electrons and is a 
Fermi-surface driven phenomenon usually accompanied by a periodic distortion of the 
lattice. In essence, the electronic energy of the solid is lowered as a consequence of the 
lattice distortion, the attendant strain energy of which is more than compensated by the 
reduction in electronic energy. The present understanding of CDWs in conductors follows 
from the pioneering work of Peierls [1955], Kohn [1959] and Overhauser [1968, 1971]. 
A detailed description of CDW phenomena has been published by Wilson etal. [1975]. 

Peierls [1955] suggested that a simple one-dimensional solid is susceptible to a 
periodic lattice distortion which can lower the total energy of the system by means of a 
distortion which changes the lattice periodicity. The basic concept is illustrated in fig. 34. 
fig. 34a shows the familiar E-k relationship for a one-dimensional monovalent metal. 
The first Brillouin zone with boundary at ±ir/a is half occupied. The upper part of the 
figure shows that the electronic charge density is of periodic form, being maximum in 
the vicinity of the ion cores. In fig. 34b the one-dimensional lattice has been periodically 
distorted as shown in the upper part of the figure and the resulting period 2 a causes the 
formation of a charge density wave of period 2 a associated with the new superlattice. 
Since the lattice has been doubled in real space, a new band gap will appear at ±ir/2a 
in £-space which permits some electrons to “spill down”, thus lowering the Fermi energy. 
Thus the (periodic) lattice distortion has a rather simple driving force which is purely 



Ch. 16, §5 


Phase transformations, nondiffusive 


1549 


r°i 

vwv^ 

Lattice o o o o o 

(normal) I I 


r 2o -i 



0 > -K) 0**0 0 > 


l— ao— | 


— CDW 

Periodically 

distorted 

lattice 



Fig. 34. Schematic drawing showing a periodic lattice distortion, charge density waves, and energy band gaps 
before and after a periodic lattice distortion in a hypothetical one-dimensional metal. Note the overall lowering 
of the Fermi level following the periodic lattice distortion. See text. 


electronic in origin. Numerous examples of a CDW phase change have recently been 
found in quasi-one-dimensional organic conductors and quasi-two-dimensional layered 
compounds (e.g., transition-metal dichalcogenides). 

A favorable Fermi surface geometry is necessary for the formation of a CDW, which 
will most likely occur when the shape of the Fermi surface permits a connection by the 
same wavevector Q, i.e., Q = 2k^. This modulation with wavevector Q will modify the 
Fermi surface by creating gaps at these nested positions. If the nested (i.e., dimpled) 
portion of the Fermi surface is significant, the energy “gain” by creating energy gaps 
may overcome the energy “cost” arising from the strain associated with the periodic 
lattice distortion, thus allowing the formation of a CDW. In other words, a structural 
change will occur when the CDW formation is accompanied by ion displacements which 
stabilize the charge perturbation. Another requirement for forming a CDW is a strong 
electron-phonon coupling, required to permit ionic displacements to reduce the otherwise 
prohibitive Coulomb energy, and precursor phenomena such as a soft phonon mode 
(Chan and Heine [1973]) might occur above the transition temperature to assist the 
CDW instability. The normal crystalline periodicity is altered in the presence of a CDW. 
However, it should be noted that the wavevector of a CDW is determined by the Fermi 
surface and is therefore not necessarily an integral fraction of a reciprocal lattice vector 
of the undistorted parent phase. Consequently an “incommensurate” phase may result, 
which is considered to have lost its translational symmetry (Axe [1977]) and is thus a 
quasi-crystalline phase. 

The incommensurate state described above may not actually correspond to the lowest 
possible energy state and, accordingly, the CDW or the lattice may undergo a further 
distortion which makes the two “commensurate” in which case the CDW wavevector is 
an integral fraction of the underlying lattice. The commensurate state is usually referred 
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to as a “locked-in” state. Thus, there can be two phase changes associated with a CDW 
formation: the normal (parent)-to-incommensurate transition (usually second order) and 
the incommensurate-to-commensurate transformation (first order). Since a CDW is 
accompanied by a lattice distortion, diffraction techniques (electron, neutron, X-ray) can 
be used to reveal satellite reflections appearing near the Bragg reflections of the parent 
phase as a consequence of the new superlattice associated with the formation of the 
CDW. These satellites are separated from the associated Bragg reflections by a reciprocal 
lattice vector determined by the CDW wavevector. 

In lower-dimensional systems (1-d and 2-d materials) the possibility of CDW 
formation is enhanced because the simple structures involved lead to a high probability 
for favorable Fermi surface nesting. But in three-dimensional materials CDW phenomena 
should, in principle, be rather rare because of the unlikelihood of favorable Fermi surface 
nesting. 

Besides the incommensurate state mentioned above, McMillan [1976] introduced 
the concept of discommensurations, which are narrow out-of-phase regions (defects) 
between large in-phase (commensurate) regions. Accordingly, the incommensurate-to- 
commensurate change is viewed as a defect (discommensuration) “melting” transition. 

A recent study (Hwang et al. [1983]) of the so-called “premartensitic effects” in 
Ti-Ni alloys has been carried out using Ti-Ni-Fe alloys containing a few percent of Fe 
in order to suppress the martensitic transformation (lower M s ) while at the same time 
leaving the temperature regime of the “premartensite” phenomena essentially unchanged. 
Space does not permit a detailed presentation of this work, but some of the highlights are 
interesting and these are briefly considered. 

The “pre-martensitic” resistivity anomaly (increase with decreasing temperature) in 
Ti-Ni type alloys has been ascribed (Hwang et al. [1983]) to the formation of a three- 
dimensional charge density wave which appears to evolve in two stages. In the first, the 
B2 (CsCl) parent is gradually (second order) distorted into an incommensurate phase 
with decreasing temperature. The incommensurate phase is most simply described as one 
with “distorted cubic” symmetry. Its gradual transition involves the appearance of new 
j<110>g2 superlattice reflections which intensify with decreasing temperature. At this 
stage, the y<110> superlattice reflections are in slightly irrational (incommensurate) 
positions. As the temperature is further decreased these superlattice reflections “lock in” 
to commensurate positions which are precise multiples of y< 110 >g 2 , and the resulting 
structure is rhombohedral (designated the R-phase) as a consequence of a homogeneous 
distortion which involves an expansion along <11 1> cube diagonals (and a small 
contraction in all directions in the plane normal to <11 1>). This incommensurate-to- 
commensurate transformation is first-order in nature. Thus, the events observed during 
the so-called premartensitic state of Ti-Ni-X alloys identify with the sequence: B2(CsCl) 
— > distorted cubic (incommensurate) — » R, rhombohedral (commensurate). In addition, 
microdomains typical of anti-phase domains in ordered alloys have been identified using 
dark field electron microscope images formed by using the new superlattice reflections. 

Finally, it should be mentioned that the incommensurate-to-commensurate transfor- 
mation in Ti-Ni-Fe alloys is also associated with the appearance of twin-like domains, 
as shown in fig. 35. The twin plane has been identified as a [110] plane. It would 
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Fig. 35. Twin-related domains as a consequence of an incommensurate-to-commensurate phase change in a 
Ti 50 Ni 47 Fe 3 (at%) alloy. See text for explanation. 


appear that the only difference between the “twins” and the “matrix” within which they 
exist is that the sense of the <11 1> expansion (Bain distortion) occurs along different c 
axes in each. In this respect, the formation of twins during the incommensurate-to- 
commensurate change appears to be a result of overall strain compensation, not unlike 
that occurring in a martensitic transformation. But in the case of the Ti-Ni-Fe alloys, 
when the actual martensite is formed at a lower temperature, the {1 10} twins and the 
commensurate R-phase are completely destroyed by the advancing martensite interface, 
and, seemingly, the “premartensitic” commensurate twinned structure presents no 
obstacle to the growth of the martensite plates. 

In the above description of charge density waves and related phenomena, a measure 
of consistency has been attempted by referring to the second order parent -» incommen- 
surate (gradual) change as a transition, as opposed to the first-order incommensurate -» 
commensurate (abrupt) change which is viewed as a transformation. 
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1. Iron and steel 

1.1. Introduction 

Steels are alloys of the element iron with carbon and usually several other elements, 
but the term is also applied to some newer alloys that contain virtually no carbon. There 
are three major groups of alloying elements, those that form interstitial and substitutional 
solutions, and those that are immiscible with the crystal lattices of iron (table 1). We 
shall not deal with the last category in the following but we do not imply that they are 
of no importance. Commercial steels always contain both types of alloying elements, 
some added intentionally, others unintentionally during production and processing of the 
steel. Unwanted elements are, for example, N (from air), S (from coke), P, As (from 
some ores), and H (introduced during pickling). Examples of the compositions of some 
commercial steels are given in table 2. 

Iron and steels have been our most important metallic materials for several millennia 
and will continue to be so for the foreseeable future. There are several reasons for this 
preeminent position: 

a) The crust of the earth contains about 4.2 wt% Fe, which can be rather easily reduced 
to the metallic state. 

b) Iron has a melting temperature (1540°C) that allows thermally activated processes at 
moderate temperatures (T>400°C) while these processes (except those involving 
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Solubility of the elements in the crystal lattices of iron. 
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Table 2 


Important alloying elements in some commercial steels. 


no. 




Composition (wt%) 



Structure 

Application 


C 

Mn 

Cr 

Ni 

Mo 

W 

Si 

V 



i 

0.12 

0.45 

- 

- 

- 

- 

- 

- 

ferrite plus 
pearlite 

construction 

steel 

2 

0.65 

0.70 

- 

- 

- 

“ 

- 

- 

tempered 

martensite 

tool steel 

3 

0.50 

< 1.0 

13.0 

- 

- 

- 

- 

- 

tempered 

martensite 

knife blade steel 
(stainless) 

4 

0.10 

0.40 

5.0 

- 

0.5 

- 

- 

- 

bainite 

high temperature 
steel 
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< 0.15 

0.40 
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6.0 

- 

- 

- 
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ferrite 

low temperature 
steel 
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0.80 

0.30 

4.0 
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18.0 
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2.0 

tempered marten- 
site and tungsten- 
carbide 

high speed steel 
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0.10 

< 1.0 

18.0 

8.0 
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- 

- 

- 

austenite 

stainless steel 

8 

<0.07 

- 

- 

- 

- 

- 

4.0 

- 

ferrite 

transformer steel 


interstitial diffusion) are very slow at room temperature. 

c) There are two phase transformations in solid iron that allow the formation of a wide 
variety of microstructures. As a consequence a large range of physical and mechan- 
ical properties can be obtained. 

d) Iron is ferromagnetic at temperatures below 768°C the Curie temperature, T c , a highly 
important characteristic which leads to several anomalies of other properties. 
Despite the long history of steel there is still room for innovations, even in mass 

production — witness the advent, since about 1975, of micro-alloyed steels, dual-phase 
steels and interstitial-free (IF) steels. Steady progress continues to be made in the 
understanding of the finer details of microstructures in steels, the development of 
crystallographic orientations, the distribution and interactions of solute elements and the 
control of mechanical properties by precipitation of solutes from solid solution. These 
advances have been made possible through the use of new tools that enable us to observe 
internal structure bn an atomic scale. 

1.2. Some properties of pure iron 

The iron atom is situated towards the end of the first transition series with an electron 
configuration Ar3d 6 4s 2 . The neighboring elements with larger atomic number than iron 
(Co, Ni, Cu) crystallize in close-packed structures until the 3d states are filled in Cu. The 
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transition elements with less-filled 3d states (J3- Ti, V, Cr, S-Mn) have a bcc structure. 
The maximum of the binding energy for Cr with its half-filled 3d band indicates that a 
contribution of covalent bonding by 3d-states may stabilize this structure (fig. 1). 

Iron is found between these two groups of elements. It crystallizes in both the fee 
(912° <T y < 1394°C) and the bcc lattices (1394° >T a < 1538°C) and again at T a <912°C. 
The re-occurrence of the bcc structure at high temperatures can be rationalized by 
dividing the difference in free energy of bcc and fee iron, A F a ^, into a magnetic and a 
nonmagnetic term (Zener [1955]). The magnetic term is negligible up to the temperat- 
ures at which magnetization of a-Fe is nearly constant. At higher temperatures (depend- 
ing on T 0 ) the bcc structure is increasingly stabilized due to the entropy of demagnet- 
ization. Without this effect, a-Fe should transform to •y-Fe at about 700°C without 
another transformation at 1394°C. In fig. 2, A F a _ y =F a -F y is plotted as a function of 
temperature. The contribution due to magnetic uncoupling in the a-phase is shown as a 
dashed line. This term produces the curvature of AF a ^(T) as determined by Johannson 
[1937] and the re-occurrence of the bcc structure above 1394°C. 

The increasing value of A F with decreasing temperature indicates that an increas- 
ing pressure is needed to produce the closest packed iron at temperatures below 912°C. 
At 20°C this pressure is 11.3 GPa, determined by both static and dynamic measurements. 
Iron can exist in at least three crystal structures, fee y, bcc a and hep s. The closest 
packed phase, e, exists at pressures in excess of 11.3 GPa and at temperatures from 
ambient to at least 2000K. A triple point at which a, y and e coexist occurs at about 
775K and 10 GPa (fig. 3). There is some evidence that a fourth or even a fifth phase 
may exist (Boehler [1986]) but confirmation is required. 

There is no experimental evidence for the existence of thermal vacancies in quenched 
a-Fe, perhaps because of the high ratio of energy of formation, u P , to energy of motion, 
u M \ u F /u M ~ 4, as compared to about 1 for the fee lattice. Vacancies probably anneal out 
during quenching of a-Fe (Johnson [I960]). It is likely that these results are strongly 
influenced by interstitial elements. 

The Burgers vector of the minimum energy dislocation in a-Fe is (a/2)<lll>. 
Segments of a<100> can form by the reaction (a/2)[lll] + (a/2)[lll] — >a[100]. The 
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Fig. 1. Melting temperature T u , and density, p, versus atomic number Z of the elements of the first transition 
period. 
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Fig. 2. Free energy difference A F resolved into magnetic and nonmagnetic components (after Zener [1955]). 


stacking-fault energy of a-Fe (as of the other bcc transition metals) is very high. 
Stacking faults have therefore not been observed in a-Fe even by subtle weak beam 
electron microscopy. Details of the core structure of dislocations in the bcc lattice and 
their consequences for crystal plasticity are discussed in ch. 20, §3.2. The probability for 
the occurrence of annealing twins is low but twins generated during cooling through the 



Fig. 3. Iron phase relations (Saxena el al. [1993]). 
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y — > a transformation are common. In -y-Fe the Burgers vector of the complete 
dislocations is (a/2)<110>. The stacking-fault energy (at I000°C), determined from the 
frequency of annealing twins, is 75 mJ/m 2 (Nutting and Charnock [1967]). (See ch. 
28, fig. 55). Stacking faults and annealing twins are frequently found in -y-Fe and its 
alloys. 

Many discontinuously during the phase transformations of iron. They are often used 
to determine the transformation temperatures in steels. There are discontinuities of the 
self-diffusion coefficient of iron at the transformation temperatures. The diffusivity of 
iron in a is about 10 2 times higher than in y at the same temperatures (fig. 4). This can 
be explained qualitatively by the closer packing of -y-Fe (Birchenall [1951]). Below T c 
there is an anomalous decrease of the diffusion coefficient of about 60% during the tran- 
sition to complete magnetic order. A change in equilibrium vacancy concentration, not 
magnetostriction, was used as an explanation for this behavior (Birchenall and Borg 
[I960]). Since then, much work has been done on this anomaly (reviewed by JOnsson 
[1992]); it is now attributed to a change in the activation energy for atomic jumps. 

An amorphous structure can be obtained for pure iron neither by vapor deposition nor 
by splat cooling down to temperatures close to 0 K. Such a structure is exclusively found 
in certain alloys of iron. (See also sonochemical approach, ch. 19, §3.6). 
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Fig. 4. Self-diffusion coefficient of paramagnetic (D ap ) and ferromagnetic (D o/ ) a-iron; Q in kJ. (After JOnsson 
[1992]). 
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2. Alloys of iron 

2.1. Interstitial alloys 

The elements H, C, N and O can occupy interstitial sites in the lattices of iron. All 
these elements fulfill the condition r, « r Fe (table 3), The interstitial voids in fee iron are 
larger than those in the bcc lattice (fig. 5). The largest void in the fee lattice, 0.052 nm 
in radius, is at an octahedral site; the next largest is 0.028 nm in radius. The radius of a 
carbon atom is 0.08 nm and for nitrogen is 0.07 nm. These atoms in solution reside in 
the octahedral sites, thereby expanding the lattice of austenite. 

In bcc iron the interstitial voids are smaller than those in austenite. Although the 
octahedral site is only 0.019 nm in radius as compared to 0.036 nm for the tetrahedral 
site, C and N atoms reside in the octahedral sites, presumably because there they have 
only two iron atoms as nearest neighbors rather than four in the tetrahedral sites. It is 
this difference in size of the interstitial sites that permits the higher solubility of C in 
austenite than in ferrite which in turn allows the formation of a great variety of micro- 
structures as C precipitates from supersaturated solid solution in a iron. As shown in fig. 
6, diffusivities in bcc iron depend upon the size of solute atoms. 

The solubilities depend on the phase with which the a -iron interstitial solid solution 
is in equilibrium: stable equilibria would be, for example, those with nitrogen gas or with 
graphite. There is a high activation energy of nucleation of these two elements in iron 
because of large surface energy terms. Metastable phases form instead, with lower 
activation energy and higher solubilities in the iron-rich solid solution. Phases that form 
in Fe-C and Fe— N alloys are listed in table 4. In fig. 7 the solubilities of some of these 
phases are shown. Which phase forms depends on temperature and time of heat treatment 
and on the imperfection structure of the solid solution. The probability of formation of 
nitrogen gas or graphite is so low that these phases rarely precipitate in steels. An 


Table 3 



Maximum solubility of interstitials in 

a-Fe. 


element 

atomic radius (for 
coord, no. 12) 

(nm) 

max. solubility (at%) 

at temperature (°C) 

in equil. 
with 

H 

<0.04 

1-2x10 ' 3 

905 

h 2 

6 

0.098 

0.005“ 

912 

Fe 2 B 

C 

0.091 

0.10 

727 

Fe 3 C 

N 

0.092 

0.38 

585 

Fe 4 N 

O 

-0.092 

0.7-13x10^ b 

906 

FeO 

* Ohtani [19881 
b Swisher [1967] 
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(W 

Fig. 5. Interstitial voids in iron, (a) Interstitial voids in the fee structure, octahedral (1) and tetrahedral (2). (b) 
Interstitial voids in the bec structure; octahedral (1) and tetrahedral (2). 

important exception occurs when carbon or low-alloy steels are held in the range 450°- 
650°C for hundreds or thousands of hours. The pearlite in the steels decomposes to 
ferrite and graphite. Normally Fe 3 C (cementite,) or less stable carbides form instead of 
graphite. Recently, Cottrell [1993] has published an electronic interpretation of the 
high stability and curious crystal structure of cementite. In fig. 8 the metastable Fe-Fe 3 C 
diagram is shown in addition to the stable Fe-graphite diagram. 

Chipman [1972] gives the following equations for the solubility of carbon in a-iron 
in equilibrium with graphite, cementite or e carbide (Fe 24 C): 
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Temperature pc] 



hop time 


Fig. 6. Average diffusion coefficients D of substitutional (s) and interstitial elements in iron. (2, = 250 ± 40 
kJ/mol; in a-Fe: (2 C =83 kJ/mol; Q N = 1A kJ/mol; Q 0 = 95 kJ/mol. 


graphite: log [C] ppra = 7.81 - 

4040 

cementite: log [C] ppm = 6.38 - -y- 

1335 

^-carbide: log [C] ppm = 4.06 — (approximate) 

Hassebe etal. [1985] agree with these solubilities for cementite and graphite if the slight 
deviation from the log [C] and \/T linear relation caused by the magnetically induced 
anomaly below the Curie temperature is ignored. 

There are several important composition ranges in the Fe-Fe 3 C diagram (fig. 8): 

a. The maximum solubility of carbon in y-iron (austenite) 2.11 wt% (9.12 at%) is the 
limiting composition separating steel and cast iron. 

b. Cast iron has usually a hypo-eutectic or eutectic composition: C E = 4.3 wt% = 17.3 at%. 


Table 4 

Metastable compounds that precipitate in Fe-C and Fe-N alloys 



Composition 

Crystal 

structure 

Nucleation sites 

Formation 

temperatures 

cementite 

Fe,C 

orthorhombic 

dislocations, grain b. 

> 200°C 

e-carbide 

Fe 24 C 

hep 

matrix, dislocations 

< 250°C 

y'-nitride 

Fe 4 N 

fee 

dislocations, grain b. 

> 250°C 

a'-nitride 

Fe s N 

fig. 14 

matrix 

< 300°C 
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Fig. 7. Solubilities of C and N in a-Fe in equilibrium with different phases (average values from several 
investigations, after Leslie and Keh [1965]). 

Le — > -y-Fe(C) + Fe 3 C. 

c. Steels can be subdivided in hypo-eutectoid and hypereutectoid depending on whether 
their carbon content is smaller or larger than 0.77 wt% (3.48 at%). The 
microstructure that originates from the eutectoid reaction 

-y-Fe(C) — > a-Fe(C) + Fe 3 C 

is known as pearlite, if it is formed by lamellar growth. 

d. The maximum solubility of carbon in a-Fe is 0.0218 wt% (0.10 at%). At lower 
carbon contents, carbides can only form by precipitation from ferrite. 

The lower the carbon content, the lower the temperature below which precipitation 
can start. Below 250°C the less stable e-carbide can form. The smallest carbon contents 
that can be obtained currently (l-2)x 10" 4 wt%, correspond to the solubility at about 
50°C. 

Nitrogen has a relatively large solubility in a-Fe as compared to carbon. The 
solubility of oxygen is very small (Swisher and Turkdogan [1967]). The difference 
between the solubilities of N, C and O in a-iron cannot be explained on the basis of 
atomic-size ratios alone (table 3). 

Solid solubilities are determined on annealed specimens of the highest obtainable 
purity. Interstitial solute atoms tend to segregate to lattice defects because they offer sites 
that are energetically more favorable than those in the undisturbed lattice. These can be 
point, linear and planar defects such as vacancies, some substitutional solute atoms, 
dislocations, sub-boundaries, grain boundaries and free surfaces. However, in annealed 
specimens the concentration of such defects is insufficient to cause significant variation 
in the measured solubility, except perhaps of B and 0 whose solubilities in a-iron is 
extremely low. The important effects of segregation of solutes in steels are treated in 
detail later. 

The precipitation of interstitial solutes from solution in a-iron at low temperatures is 
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Vifeight Percentage Carbon 


Fig. 8. The Fe-C equilibrium diagram up to 7% carbon. Solid lines indicate Fe-Fe 3 C diagram; dashed lines 
indicate Fe-graphite diagram. 


strongly affected by lattice defects, especially dislocations (table 4). The presence of the 
defect serves to minimize the strain energy term in the activation energy for nucleation 
of the precipitates, so that the distribution of precipitates is determined by the distribution 
of defects. 
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2.2. Substitutional alloys 

The elements marked 2 (table 1) are completely or partially soluble as substitutional 
elements in the iron lattices. The Hume-Rothery rules for solubility are well fulfilled for 
alloys with transition elements: large solubility of the elements in the neighborhood of Fe 
(Ni, Co, Mn, Cr and V), limited solubility of Ti, and practically no solubility of Ca and 
K. The alloying elements have different energies of solution in a- and y-iron. The 
A F ay (T) curve (fig. 2) is shifted in different directions depending on whether A is 
increased or decreased by the alloying element (fig. 9). This leads to an increase or 
decrease of the temperature range in which austenite is stable. For simplicity the curves 
are drawn for a- and y-phase of the same composition (i.e., T 0 -temperatures are shown 
for A = 0). We can divide the alloying elements into two groups, depending on 
whether they close or open the y-field. Typical phase diagrams are shown in figs. 8, 10 
and 11. The following rules tell how the alloying elements behave in this respect: 

a) the y-field is opened by all interstitial elements, because the fee lattice provides sites 
of smaller strain energy than does the bcc lattice; 

b) the y-field is opened by elements with a fee or hep structure (Cu, Au, Ni, Pt, y-Mn); 

c) the y-field is closed by elements that form Hume-Rothery phases with Cu, Ag and 
Au, even if they form a fee lattice (Al, Si, P, Zn, Ga, Ge, As etc.); 

d) the y-field is closed by the bcc transition metals (V, Ti, Mo, W, Cr etc.); The phase 
diagram of Fe-Cr alloys shows that small additions of chromium lower the y — » a 
transformation temperature while the loop is closed by larger Cr-additions. Zener 
[1955] has pointed out that this behavior can be explained by the magnetic term of 
the free energy of a-Fe (fig. 2). The addition of Cr to Fe has two effects: It shifts the 
A F ay (T) curve as shown in fig. 9 and it lowers the Curie temperature T c . Con- 
sequently the term A F ay magn (fig. 2) appears at a lower temperature and changes the 
shape of the curve to that shown in fig. lib. This ^-effect should affect transform- 



Fig. 9. The effect of solute elements with larger heat of solution in y-Fe than in a-Fe (e.g. Mo) and vice versa 
(e.g. Ni) on the transformation temperatures = 0), assuming constant concentrations of a and y-solid 
solution and no change of T c in a-Fe (schematic). 
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Fig. 10. Proposed Fe-Ni phase diagram based on electron microscopy, electron microprobe studies, and 
Mossbauer spectroscopy of iron meteorites (Albertsen et al. [1983]). 


ation temperatures of all alloys of iron. In Fe-Cr alloys it is evident because the 
difference in heat of solution of Cr in a-Fe and y-Fe is relatively small. 

In many alloys the a- and y-field is bounded by a miscibility gap (fig. 11c). The a- 
or y-solid solutions are then in equilibrium with an intermetallic compound or a solid 
solution. Special attention is drawn to the ordered bcc structures Fe 3 Al and Fe 3 Si that can 
form as stable coherent precipitates in a-iron. In addition metastable equilibria occur 



Fig. 11. Closed y-fields in Fe-Mo, Fe-Cr, and Fe-Nb phase diagrams. 
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frequently in substitutional alloys of Fe which form the matrix of microstructures. Such 
coherent phases have been found in binary bee solid solutions with Cu, Au, Al, Mo, W, 
Be (Hornbogen et al. [1966]). They play an important role in maraging steels. 

The crystal structure of the solid solution of Fe at room temperatures, as it follows 
from the phase diagrams (figs. 10, 11), is used to classify steels. If they consist predom- 
inantly of a-Fe solid solutions they are known as ferritic steels; solid solutions of y-Fe 
are the base of austenitic steels. A metastable fee structure can be obtained, for example, 
from the ternary alloy indicated in fig. 12. This metastable phase is the well-known 
austenitic stainless steel. Precipitation of stable or metastable coherent, ordered phases 
(yO is very common in the fee iron alloys. Such (y + y') microstructures provide the base 
for creep resistance of precipitation-hardened stainless steels and Ni-base superalloys. 

2.3. Interstitial plus substitutional alloys 

Even plain carbon steels contain substitutional elements, especially Mn, present either 
intentionally or inadvertently. Therefore interstitials are always in interaction with atoms 
substituted in the iron lattice. The interaction energy as compared to that with iron, Au, 
can be positive or negative. Au = ( u Fe _,) - (« H ). The lattice of iron with interstitial (i) and 
substitutional (s) atom in solution can be expected to show a distribution of the i-atoms 
that depends on that of the s-atoms (fig. 13). A qualitative idea of the sign of Au can be 
obtained from the stability of the carbides of the s-atoms but the situation is complicated 
by the effects of substitutional solutes on carbon activity in cementite and in e carbide. 
For example, aluminum decreases the activity coefficient of carbon in ferrite and 
increases solubility; silicon increases the activity coefficient of carbon in ferrite and 
decreases solubility (Leslie and Rauch [1978]). Manganese decreases the activity 
coefficient of carbon in ferrite but also decreases carbon activity in cementite, the result 
being no change in the solubility of carbon in ferrite, at least up to 1% Mn (Saitoh and 
Ushioda [1989]). 



Fig. 12. Section of the Fe-Cr-Ni diagram showing the composition and heat treatment of austenitic stainless 
steel. Below M, the metastable fee alloys become unstable and transform to bcc martensite. 
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a) 

Fig. 13. Schematic drawing of the structure of interstitial (x) plus substitutional (o) solid solutions, in iron (•): 
(a) attraction, (b) repulsion between interstitial and substitutional solute. 

A very large number of carbides can form in Fe-s-C alloys in stable or metastable 
equilibria. In table 5 the crystal structures of carbides of the elements of the first 
transition series are listed. There are the following possibilities for the composition of 
carbides that form in Fe-s-C alloys. 

a) Fe x C y solute-free carbide; 

b) Fe(s) x C y carbide with substitutional composition of the alloy; 

c) [Fe^C y carbide with Fe replaced by a definite amount of s; 

d) s x C y iron-free carbide. 

Many [Fe^JC^-type carbides occur in alloy steels, still more in rapidly solidified 
alloys of iron, in addition to those listed in tables 4 and 5. 

The M 6 C 7j-carbide is cubic with 96 metal and 16 carbon atoms in the elementary 
cell. The metal atoms are iron and group V, VI transition metals. A typical repres- 
entative, [Fe 4 W 2 ]C, forms in high-speed steels. W, Mo and Cr can be substituted in 
small, Mn in large amounts in Fe 3 C. If a higher concentration of group V or VI elements 
is present a mixed carbide of the type Cr 23 C 6 forms (table 5). Fe, Mn, V, Nb, Mo and W 
can replace Cr in this carbide in a wide range of concentrations. In tungsten steels it has 
a composition [Fe 21 W 2 ]C 6 . Another important group of mixed carbides are the ^-phases. 
Their structure can be derived from substitutional compounds with the /3-Mn structure. 
They form in Cr-Mo and Cr-W steels. C dissolves in the /3-Mn structure and stabilizes 
it. A pure ^-phase has, for example, a composition Fe^Cr^W, which is shifted to higher 
W-contents by C additions. 

Metastable carbides that form in ferrite are characterized by greater similarity to 
a-iron as compared to the more stable carbides. This allows them to form semicoherent 
or coherent interfaces with a-Fe. Figure 14a shows the structure which fulfills the 
requirements for full coherency if the particles are small. An ordered arrangement of 
interstitials that only distorts the bcc lattice (a') has been found by Jack [1951] in Fe-N 
alloys. This structure forms during ageing of Fe-N alloys at a low temperature. The iron- 
free carbides TiC, VC and NbC have face-centered-cubic structures with lattice para- 
meters 20-25% greater than those of the austenite from which they precipitate (fig. 14b). 
They can form as ultra-fine dispersions, which raise the yield stress of micro-alloyed 
steel (HSLA-steel, i.e., high-strength low-alloy steel). 
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Table 5 

Composition and melting temperatures [°C] of carbides that form in equilibrium with transition metals 

(after Goldschmidt [1967]). 


TiC 

V 2 C 



(Fe 3 C) 

(Co 3 C) 

(Ni 3 C) 

3140 

2200 

1580 

1010 

*g 

*g 

*g 

ZrC 

Nb 2 C 

Mo 2 C 

Tc 

Ru 

Rh 

Pd 

3550 

3100 

2410 

g? 

g 

g 

g 

HfiC 

Ta,C 

W 2 C 

Re 

Os 

It 

Pt 

3890 

3400 

2800 

g 

g 

g 

0 


TC cubic (B 1) * carbide not stable in liquid state 

T 2 C hep (L'3) 

T 23 C cubic (D8 4 ) 

T 3 C orthorhombic (fig. 4.8, DO,,) 


3. Transformation reactions 

3.1. Pearlite 

The iron-rich side of the Fe-Fe 3 C phase diagram (fig. 8) provides the background for 
a large number of solid-state reactions. In alloy steels, reactions can be controlled by 
substitutional or interstitial diffusion. The microstructure of a steel can only be under- 
stood if all the individual reactions and their mutual effects are known. 

If an alloy of eutectoid composition is cooled below the eutectoid temperature T p the 
pearlite reaction can occur. Nucleation of two new types of crystals inside the austenite 
crystal is unlikely because of the high surface- and strain-energy required. Instead, 
transformation begins at austenite grain boundaries and moves into grains with a reaction 
front that leaves an aggregate of lamellar particles behind. Austenite grain comers are 
favored nucleation sites, followed by grain edges and planar grain boundaries. In 
commercial steels nucleation can occur at inclusions. In hypereutectoid steels cementite 
forms first during transformation of austenite, then pearlite can nucleate at the austenite- 
cementite boundaries. Similarly, in hypoeutectoid steels ferrite forms first and nucleation 
can occur at ferrite-austenite boundaries. 

Each of these so-called pearlite nodules consists of two interlocking single crystals, 
one of cementite, the other of ferrite (Hillert [1962]). Each nodule continues to grow 
(at velocity G) within an austenite grain until it impinges on another in the same grain. The 
pearlite reaction is completed at a time t { that depends upon the austenite grain diameter, d: 



where d/G is the time required for one nodule to occupy one grain. It is now generally considered 
that the rate-controlling step in the growth of pearlite is the rate of diffusion of carbon in the 
pearlite-austenite boundary modified by some volume diffusion in the austenite ahead of the 
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Fig. 14. (a) Structure of the metastable compound Fe I6 N 2 (a'), crosses indicate N atoms, a = 5.72 A, c = 6.29 A 
(after Jack [1951]). (b) Structure of the carbides TiC, VC, NbC and carbonitride as Ti(C,N)- 


moving interface to provide the variation in carbon content of austenite at the interface. 
Recent studies (Zhou and Shiflet [1991]) have shown that pearlite growth is accomplished 
by the lateral migration of steps across the growth front of cementite and ferrite. 

The addition of a third element to the Fe-C system inevitably decreases the rate of 
formation of pearlite. These effects occur in two ways: 

a) elements that segregate to boundaries can reduce the frequency of nucleation of 
ferrite. Boron is spectacularly effective in this regard. 

b) Substitutional solutes can influence both the rate of nucleation and the rate of growth of 
pearlite. They can do this by changing the activities of C in the three phases involved or 
by partitioning between ferrite and cementite at the pearlite-austenite interface. Since the 
latter process requires diffusion of substitutional atoms it is most effective at elevated 
transformation temperatures. Molybdenum is particularly effective in decreasing the rate 
of growth of pearlite nodules whereas cobalt substantially increases the rate. 

When a hypoeutectoid steel is slowly cooled from the austenite region, ferrite forms 
below the upper critical temperature, which is strongly dependent upon alloy content, and 
continues to form until the eutectoid temperature, A cl , is reached. With more rapid cooling 
ferrite can form well below A el . The ferrite takes on four distinct morphologies (Aaronson 
[1962]). Appearing roughly in order of decreasing temperatures, these are termed grain- 
boundary allotriomorphs, Widmannstatten side-plates, intragranular idiomorphs and 
intragranular plates. The Kurdjumov-Sachs relationship between austenite and ferrite: 

[lll]y // [110}a 
<110>y // <11 l>of 

is observed. The Widmannstatten ferrite is often called acicular ferrite. 
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The formation of cementite plates from hypereutectoid steels follows a similar pattern 
but with the Pitsch orientation: 

(100) cem II (554)y 

(0i0) cem // (H0)r 

(001) wm n (225)7 

Figure 15 shows some of the typical microstructures which are observed in Fe-C steels 
of various compositions. 

3.2. Martensite 

If austenite is cooled fast enough so that no diffusion-controlled transformation 
reaction can begin until an extent of under-cooling can be reached at which the fee 
structure becomes highly unstable, it transforms by shear to a distorted bcc lattice, 
supersaturated with carbon, called martensite. The minimum cooling rate required for the 
formation of martensite varies widely depending upon the composition of the steel, which 
in turn controls the temperatures and rates at which non-martensitic transformation 
products form. The temperature at which diffusionless transformation starts is designated 
M s . The M s -temperature is always lower than the temperature T 0 at which a- and -y-solid 
solutions of the same composition are in equilibrium (T 0 , AF oy =0, fig. 9). The M s - 
temperature can be above or below room temperature, and steels can be classified 
accordingly. For austenitic steels M s « room temperature, martensitic steels M s > room 
temperature; both must of course be cooled faster than the critical rate to avoid formation 
of other transformation products. Between T 0 and M, metastable austenite can transform 
partially into martensite under external stress. During the cooling below M s the marten- 
sitic transformation continues until it stops at the martensite finish temperature, M f . 
Frequently, some small amount of austenite is retained. 

Unalloyed iron or substitutional solutions of -y-iron can transform on cooling to a bcc structure. 

As noted before, interstitial atoms occupy octahedral sites in both a and y iron. Only 
one set of such sites, Z, is common to both lattices. Since the carbon atoms have no time 
to move during the shear (displacive) transformation, the number of carbon atoms at 
these Z sites in bcc iron is increased, causing a tetragonal distortion of the bcc lattice. 
The tetragonality is minimal below about 0.20% carbon because of the association of 
carbon atoms with lattice defects produced by the displacive transformation (fig. 16). 

In addition to the shear (Bain strain) that leads to the change in crystal structure, a 
second deformation, a lattice invariant strain, is required to allow the martensite to 
correspond to the original crystal. The amount of shear for the y — » a transformation is 
about 0.12 (Wayman [1964]). The internal plastic deformation takes place by twinning 
or slip (Kelly and Nutting [1961], It leads to martensite crystals that contain dis- 
locations and thin twins. The M s of a steel depends upon a) the equilibrium temperature, 
T 0 and b) the strength of the austenite. Alloying elements affect both factors, as shown 
by one of several empirical equations used to calculate M s : 

M S (°C) = 539 - 423 C - 30.4 Mn - 12.1 Cr - 17.7 Ni - 7.5 Mo 
(Andrews [1965]) where the chemical symbols represent wt% of the alloying elements. 
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Fig. 15. (a) Single-phase microstructures; ferrite. 100 x. (b) Austenite, 1.2 wt% C, 12 wt% Mn, 1100°C water 
quenched. 200 x. (c) Lamellar structure: pearlite, 0,8 wt% C. 2500 x. (d) Ferrite-pearlite structure, 0.4 wt% C. 
500 x. (e) Anisotropic ferrite-pearlite structure (banding in rolled steel, dark areas: pearlite). 100 x. (f) ferrite- 
martensite structure, dual-phase, 0.11 C, 0. 15 Mo, 0.24 P wt%, intercritical annealing: 60 min, 725°C. 150 x. 
(g) Dispersed FejC-particles, 1 wt% C. 1000 x. (h) Hard-facing Fe-base alloy, high volume fraction of NbC and 
M,Cj, M = Cr, Fe, 5.5 C, 30 Cr, 7 Nb wt%. 1000 x. Figs. 15a,c,e,g. J.R. Vilella, U.S. Steel Corp. Fig. 15b, 
h. H. Berns, Ruhr-University. 
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Fig. 16. c/a ratio and M s for body-centered tetragonal Fe-C martensite. 

All elements save Co lower M s . All the others in the equation produce solid solution 
strengthening of austenite and lower M s , regardless of whether they raise or lower T 0 . 
Also, austenite can be stablilized against transformation by work hardening, by grain 
refinement, by deformation twins, and by prior precipitation of coherent second-phase 
particles, in brief, by anything that increases strength. Both nucleation and growth of 
martensite are inhibited by increased strength (Kajiwara [1986], Leslie and Miller 
[1964], Meyer and Hornbogen [1968]). 

The very large effect on M s of the prior precipitation of coherent particles in the 
austenite of an Fe-Ni-Al alloy is shown in fig. 17. The resistance to displacive transfor- 
mation is maximized with maximum precipitation hardening. With over-aging the M s 
rises as A1 and Ni are removed from solution. Figure 18 illustrates the microstructure 
after partial and complete transformation to martensite. 



Fig. 17. Variation of the M s -temperature of an Fe, 27.4 at% Ni, 12.5 at% Al, 0.06 at% C alloy by aging the 
austenite at different temperatures, to form coherent precipitates. 
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Fig. 18. Incomplete (40%) and complete transformation of austenite to martensite of an 0.06 at% C, 27.4 at% 
Ni, 12.5 at% A1 steel (M, = -69°C). The transformation starts at austenite grain boundaries. Austenite grain size 
determines the maximum dimension of martensite crystals. Former austenite grains can be revealed even in the 
completely transformed alloy. Carbide precipitates are visible at the grain-boundary nodes. 


In order to obtain complete transformation to martensite a steel has to be cooled 
considerably below M s (fig. 19). Retained austenite is found in the spaces between the 
martensite. This austenite becomes work-hardened by the martensite crystals as they 
form, and is thus stabilized against transformation. If cooling is interrupted at a temperat- 
ure below M,, then resumed after a period of time, the transformation does not continue 
until an additional undercooling is reached ( stabilization of austenite). This can be due 
to strain-aging of deformed austenite and the thermally activated reduction of transfor- 
mation stresses that would have aided further transformation (e.g., Glover [1956]). 

The size of martensite crystals is limited by the austenite grain size. In steels, small 
martensite crystals are wanted because isotropic properties and small internal stresses are 
wanted. Heat-treatments that lead to grain growth of austenite before cooling below M s 



Fig. 19. Martensite that has formed below M, in a steel with 1.1 at% C and 2.8 wt% Cr after austenitizing at 
1040°C. M, = 230°C, the transformation is almost complete at 20°C. (After Cohen [1949].) 
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are therefore avoided. 

It has become conventional to classify the morphology of martensite as being either 
the lath type or the plate type. The former is typical of low to medium carbon steels. 
When the carbon content is below about 0.20% the structure is often body-centered 
cubic. The lattice invariant shear is mainly accomplished by dislocations. The units of 
martensite are in the shape of laths containing a high density of dislocations. The laths 
are grouped into larger packets. Plate-type martensite is found in alloys with M s 
temperatures below ambient. This structure is composed of individual lenticular plates 
with internal very fine twins. In many instances the structure consists of mixtures of lath 
and plate martensite. The conventional distinction between the two types has been 
questioned by Tan etal. [1992], who found twins in lath martensite in 0.20% C steel by 
careful transmission electron microscopy and demonstrated that it is easy to miss these. 

3.3. Bainite 

Although bainite was detected in a study of the isothermal transformation of austenite 
in 1929 (Davenport and Bain [1930]) the modes of formation of the various admix- 
tures of ferrite and carbides comprising bainite are still matters of controversy; see the 
proceedings of the International Conference on Bainite, 1990. The controversy is hardly 
surprising, for bainite forms in a wide variety of steels, over a wide range of temperat- 
ures during the transformation of austenite isothermally or during continuous cooling and 
the transformation products are finely divided and only resolvable by transmission 
electron microscopy. The disagreements center on the relative extent of transformation 
by shear or by diffusional mechanisms. The answer may lie in the observation by Jung 
et al. [1994] that bainite can nucleate by diffusion and grow by a displacive process. 
Regardless of the outcome, the importance of bainite lies in the desirable combination of 
mechanical properties it imparts to commercial steels transformed isothermally or by 
continuous cooling. 

Conventionally, the morphology of bainite is divided into two types, upper bainite 
and lower bainite. These terms arose from structural differences noted after isothermal 
transformation and may have little relevance to structures developed during continuous 
cooling. Bainite forms isothermally at temperatures above the M s and below that required 
for formation of ferrite and pearlite, or at cooling rates too rapid for nucleation of these 
other transformation products but not rapid enough to form martensite. 

The structure of upper bainite is generally agreed to consist of aggregates of ferrite 
platelets called sheaves with Fe 3 C particles between the sheaves. In lower bainite the 
carbides, probably e, Fe^C, are contained within the ferrite platelets. In hypoeutectoid 
steels the ferrite platelets form first. The controversy arises over whether the iron atoms 
cross the ferrite-austenite interface by shear or by diffusive jumps. The structure of lower 
bainite is shown in figs. 20a and b. 

There is a fourth type of reaction in iron alloys with M s temperatures above 400°C. 
Even at extremely high cooling rates transformation occurs above M s by a mechanism in 
which the rapid motion of the y-a interface produces ferrite with the same composition 
as the austenite. The morphology of the transformation product differs from that of 
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Fig. 20. (a) and (b) Microstructure of lower bainite. 3.23 wt% Cr; 30 min at 350°C. Partially transformed 0.66 
wt% C (Speich [1962]). (a) Optical micrograph, (b) Electron micrograph (replica). 

martensite because thermally activated motion of the interface occurs by individual jumps 
across the reaction front in a direction opposite to that of its motion. The transformed 
structure is almost free of the defects present in martensite. This reaction is termed a 
massive transformation (Gilbert and Owen [1962]) (see also chap. 15). 

3.4. Transformation diagrams and hardenability 

The evolution of microstructures in steels can be understood best by reference to 
isothermal (IT) or to continuous cooling transformation diagrams. These diagrams 
indicate the beginning and end of each reaction in the transformation of austenite at 
constant sub-critical temperatures or the cooling rates at which various reactions begin 
and end. In the making of an IT diagram, small thin specimens of a selected steel are 
heated into the austenite range, quickly transferred to a liquid medium held at a constant 
subcritical temperature, held for a pre-determined period and then quenched into iced 
brine and examined metallographically to determine the extent of phase transformation. 
The process is illustrated schematically in fig. 21. The result of many such deter- 
minations is a diagram such as fig. 22 for a typical high-strength alloy steel, AISI 4340. 
Strictly speaking, each diagram is representative only of reactions in one sample of steel 
having one austenite grain size. Even within one grade of steel variations in chemical 
composition and austenite grain size will result in small changes in the times for 
beginning and ending of transformation. Nevertheless, such diagrams are indispensable 
for the planning of certain heat treatments to obtain desired microstructures and 
properties, as shown in figs. 23 a, b, c, and d. 

The IT diagram of fig. 22 shows two “C-curves”, the upper one for the formation of 
ferrite and pearlite and the lower for formation of bainite, with a deep “bay” between 
them (bainite is not labelled in the figure). Bainite in this steel forms at constant 
temperatures from just above 500°C down to below M s at about 280°C. At temperatures 
between 500° and about 425°C the reaction does not go to completion; much austenite 
remains. The upper and lower critical temperatures and M s are shown. 

In production and heat treating of steels it is often desirable to have some knowledge 
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Fig. 21. Diagram showing how measurements of isothermal transformation are summarized by the isothermal 
transformation diagram. 


of microstructures following continuous cooling from austenite and this information is 
not readily obtained from IT diagrams. For these estimations we turn to CCT diagrams. 
These are developed through a combination of dilatometry and metallography. In 
dilatometry, temperature, time and specimen length are recorded for a variety of cooling 
rates. The initiation and termination of transformation are marked by an increase in 
specimen length and the cessation of growth, respectively. Microstructures are deter- 
mined after quenching. From these data a CCT diagram can be constructed, fig. 24 
(Eldis [1978]), with the linear temperature-log time scale as used in IT diagrams. 
Alternatively, instead of plotting cooling curves, individual bar diameters are plotted on 
the abscissa instead of transformation times. The diagrams indicate the structures that can 
be obtained at the centers of the bars of the appropriate size, fig. 25 (Atkins [1977]). 
Thus a bar 20 mm in diameter, air cooled, will have a structure of approximately 70% 
bainite, 30% martensite. A bar of 50 mm diameter, water cooled, will have a desirable 
structure of about 100% martensite at the center. 

Hardenability is commonly defined as the depth to which steel of a given com- 
position can be quenched to yield a structure of 50% martensite — 50% other transfer- 
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Fig. 22. IT diagram for AISI 4340 steel. 0.42 C, 0.78 Mn, 1.79 Ni, 0.80 Cr, 0.33 Mo. Austenitized at 845°C. 
Grain size ASTM 7-8. A = austenite, C = cementite, F = ferrite. 


mation products. A criterion of 90% martensite also may be used. Hardenability is 
affected by austenite grain size, carbon content and alloy content. It is measured by the 
Jominy end-quench test (Jominy and Boegehold [1938]) or by a more recent test better 
suited for steels of lower hardenability (Grange [1973]). There are various methods of 
predicting hardenability of steels from austenite grain size, carbon content and alloy 
content, reviewed by Doane [1978], The goal of steel producers is to attain the 
maximum hardenability with the minimum use of expensive alloying elements. Steels can 
be ordered within specified limits of hardenability (hardenability bands) and the producer 
can use whatever composition will yield that result. 

3.5. Tempering of martensite 

Ferrous martensites are highly supersaturated solid solutions of C in iron, obtained by 
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Fig, 23. Four illustrations of the use of IT diagrams in planning practical heat-treatments. 


rapid cooling from temperatures at which the alloy is single-phase fee austenite. The 
transformation to bcc or bet iron produces a very high density of defects — dislocations, 
twins, former austenite grain boundaries, vacant lattice sites and any substitutional solute 
atoms initially present. Generally speaking, such structures impart high strength but poor 
toughness. To improve toughness with some reduction in strength the steel is reheated to 
temperatures above ambient. The result is the removal of C from solid solution and the 
precipitation of carbides. The operation is simple in practice but the processes occurring 
are highly complicated. Despite decades of study there is still no complete agreement on 
all the details. These vary depending upon the carbon content and alloy content, which 
determine the M s temperature. If this is above ambient the tempering process (auto- 
tempering) can begin during quenching. In steels with very low M s temperatures, 
subsequently held at temperatures below 100°C, processes termed “aging” can occur. 
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Fig. 24. Schematic illustration of the construction of a CCT diagram. F-ferrite, P-pearlite, B-bainite, 
M-martensite, HV 10-hardness. 


Uwakweh et al. [1991] state that there are two steps of aging, first clustering of C 
atoms, then the ordering of these clusters into a carbide of composition Fe 6 C. In contrast, 
Speich and Taylor [1992], consider reactions during aging to be one process of 
spinodal decomposition. 

There is a continuing tendency towards lower carbon contents in steels, for eliminat- 
ion of quench-cracking, improved weldability and greater toughness. There is general 
agreement on the processes occurring when these lower-carbon steels with high M s 
temperatures are quenched and tempered: 

1 . Segregation of C to dislocations, former austenite 
boundaries and retained austenite 

2. Precipitation of e -carbide, Fe^C, in the temperature range 50-200°C 

3. Between about 200 to 300°C, transformation of retained austenite to ferrite and 
carbide 

4. Solution of e-carbide and precipitation of x carbide and/or Fe 3 C. 

5. Between 400 and 700°C, recovery of the dislocation structure, spheroidization of 
cementite and recrystallization of the ferrite. 

6. If carbide-forming elements are present in substantial amounts alloy carbides such as 
VC, Cr 7 C 3 or Mo 2 C can replace Fe 3 C during tempering at above about 500°C. 

The hardness of a steel after tempering at any given temperature for the customary time 
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C Si Mn P S Cr Mo Ni Al Nb V 


0.36 0.25 0.80 0.020 0.020 1.00 0.20 



Fig. 25. CCT diagram of a Si-Mn-Cr-Mo steel. A = austenite, B =bainite, F = ferrite, M = martensite, 
P = pearlite. 


of one hour can be calculated from a knowledge of the chemical composition (Grange 
et al. [1977]). Alloying elements increase resistance to tempering, the most effective 
being V, P, Mo and Mn. 

When high-strength medium-C steels are tempered between 250 and 400°C an 
annoying phenomenon called tempered martensite embrittlement appears. Although the 
hardness after tempering decreases continuously throughout this range the toughness, as 
measured by Charpy V-notch tests, reaches a minimum, fig. 26 (Briant and Banerji 
[1978]). During austenitizing, S and P atoms segregate to austenite grain boundaries (see 
chapter 13). The segregration remains during the transformation to martensite and during 
tempering, reducing cohesion at the prior austenite boundaries. Cementite precipitates at 
these same boundaries and serves as nucleation sites for cracks during impact loading. 
These cracks propagate along the weakened boundaries, leading to intergranular fracture. 
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Fig. 26. Schematic illustration of tempered martensite embrittlement of a quenched and tempered steel. 


Tempered martensite embrittlement can be prevented in steels of very high hardenability 
and very high strength by the addition of 1 to 2% Si. Silicon inhibits the growth of e 
carbide and cementite (Owen [1954], Keh and Leslie [1963]), reducing their effective- 
ness as crack starters and allowing such steels to be tempered in the range 260-3 15°C, 
where both yield strength and notch toughness reach a maximum. 

4. Deformation and recrystallization 

4.1. Microstructure of deformed steel 

Iron and ferritic steels, in common with other bcc metals, are characterized by a 
pronounced dependence of the yield and flow stresses upon temperature and strain rate 
(Christian [1983]). As shown in fig. 27 the yield stress of polycrystalline iron increases 
sharply at temperatures below about 325 K, -0.18 T m , where T m is the melting point in 
kelvins. Above that temperature a long plateau appears and the yield stress does not 
begin to drop until a temperature of about 800 K, -0.45 T m , is reached. Although it is not 
clearly shown in fig. 27, a concave-downward region appears in the yield stress- 
temperature plot between about 180 and 280 K (Matsui et al. [1978]). Similarly, the 
yield stress increases markedly increasing with strain rate, fig. 28 (Leslie et al. [1969]). 
These are intrinsic characteristics of the bcc metal lattice and are intimately connected 
with the decreasing toughness of ferritic steels with decreasing temperature and/or 
increasing strain rate. The cause of these phenomena is the slight dissociation of screw 
dislocations in bcc metals which renders them relatively immobile (sessile) at low 
temperatures. The dissociation must be constricted by thermal activation before it can 
glide. In contrast, $dge dislocations can glide at any temperature. 

In bcc metals slip can occur on any plane containing a <11 1> slip direction, so-called 
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Fig. 28. The effect of strain rate on the strength of polycrystalline bcc iron at room temperature. 


pencil glide. The result is wavy slip lines on a polished surface of ferrite deformed at 
ambient temperature (fig. 29a). Deformation at temperatures well below ambient leaves 
a uniform distribution of the sessile screw dislocations, the mobile edge components 
having moved out of the crystal. During deformation of ferrite at room temperature, 
dense tangles of dislocations form a cell structure. The spacing between cell walls 
decreases with increasing strain (Keh [1961]). 

In fee metals (austenitic steels) slip occurs by glide of a/2<110> dislocations on 
{ 111 } planes, leading to a total of 12 possible slip systems. The dislocations are confined 
to their slip planes because of low stacking-fault energy, the result being well-defined, 
straight slip lines (fig. 29b). In the presence of a fine dispersion of shearable coherent 
particles slip steps are well defined and widely spaced (fig. 29c). 

Dislocations can be generated at grain boundaries and at the interfaces of inclusions 
and carbide particles. The density of dislocations increases with deformation (fig. 29d), 
Keh and Weissmann [1962]. The curve shown there does not apply when a dispersion 
of any strong second phases is present; in those instances the density of dislocations for 
a given strain, i.e., the rate of work hardening, is greater than indicated by fig. 29d. 

Cementite, whether present in pearlite or as discrete particles, resists deformation, 
with the important exception of wiredrawing. Beginning with a eutectoid carbon content 
in the form of fine pearlite and upper bainite, a true strain of 4 reduces the thickness of 
Fe 3 C lamellae to about 2 nm, or just a few unit cells. 

When stressed, annealed low-carbon steels usually yield discontinuously, i.e., with an 
abrupt drop in stress, followed by yielding at constant stress and then continuous work- 
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Fig. 29. (a) Wavy slip bands in ferrite after 3% strain at 20°C, 200 x. (b) Traces of (111) slip planes in 
austenitic steel after 2% strain, replica, (c) High slip steps in austenitic steels containing y' precipitates after 2% 
strain, (d) Relation between plastic strain and dislocation density in polycrystalline a-Fe. 


hardening. The strain at constant stress is called the yield point elongation or Liiders 
strain. For this to occur, three requirements must be met: 

1. A low density of mobile dislocations present before yielding. (Most or all of the 
dislocations present are normally pinned by interstitial “atmospheres”, see ch. 20). 

2. Rapid generation and multiplication of dislocations. 

3. A low to moderate dependence of dislocation velocity on stress. 

Under these conditions, when dislocations are generated heterogeneously at some spot of 
stress concentration they multiply rapidly by a multiple cross-slip mechanism. Edge 
dislocations in bcc iron move about 20 times faster than screws at a given stress, but the 
motion of screws is required for macroscopic yielding. The flow initiated at the upper 
yield stress coincides with the motion, cross slip and multiplication of screw dislocations. 
From the point of generation of dislocations a zone of high dislocation density (localized 
strain) sweeps through the specimen. This is called a Liiders band. The cross-section of 
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the specimen is reduced, raising the stress and inducing other such bands until uniform 
work hardening begins. 

Deformation of ferritic steels also can proceed by twinning. This is favored by low 
temperatures, high strain rates, large grain size, lack of mobile dislocations and solid 
solution hardening, especially by Si. As compared to deformation by dislocation generat- 
ion and motion it is of minor importance. Mechanical twins can be produced in austenitic 
steels but only at very high strain rates and/or low temperatures (Hornbogen [1964b]). 

4.2. Recovery and recrystallization 

Recrystallization can be defined as crystallographic reorientation by passage of a 
high-angle boundary. Recovery is the name given to all other processes that reduce the 
energy stored by deformation (see ch. 26). Both are thermally activated and both can be 
driven by stored energy of deformation but recrystallization can also be driven by the 
reduction of surface energy, whether the surfaces are external or internal, i.e., grain 
boundaries. Both can be static or dynamic. We will consider static processes first, those 
that occur at constant temperature after cold work. 

There can be several measures of recovery, such as decreases in electrical resistivity, 
decreases in X-ray line broadening, decreases in hardness, decreases in dislocation 
density and rearrangement of dislocations. These all proceed rapidly initially, then reach 
a plateau. Unless recrystallization intervenes they will go to completion in a range of 
temperature from below room temperature to about 500°C, fig. 30 (Michalak and 
Paxton [1961]). The motion of lattice defects can be hindered by factors such as solute 
atoms — especially interstitials — pre-precipitation clusters and nucleation of precipitates 
on such defects, thereby retarding or stopping recovery. 

Recrystallization at constant temperatures after cold working can begin by migration 
of existing high-angle grain boundaries, migration of sub-boundaries and by sub-grain 



Fig. 30. Recovery of a - Fe after 5% strain at 0°C (Michalak and Paxton [1961]). 
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coalescence (Hu [1962]). These processes will begin where energy density is greatest, as 
at grain-boundary edges, inclusions and intersections of mechanical twins. As stated 
previously, recovery and recrystallization are driven by the stored energy of cold work 
so they can be concomitant, competing processes. After slight cold working recovery can 
be complete before recrystallization begins, and the rate of growth of recrystallized grains 
will be constant until the growing grains impinge. After heavy cold work the reduction 
of stored energy by recovery causes the rate of growth of new, recrystallized grains to 
slow markedly with increasing time at constant temperature (fig. 31). This iron was 
highly purified by zone-refining, which greatly enhances the rate of recovery. 

The rates of formation and growth of recrystallized grains can be changed by several 
orders of magnitude by even small additions of second elements in solid solution 
(Venturello etal. [1963], Leslie etal. [1963], fig. 32). However, the inhibition by fine 
precipitates or pre-precipitation clusters is even greater. If ferritic steels are deformed in 
the austenite range, dynamic recrystallization can occur immediately after working, 
before or during cooling. Control of the process by temperature, degree of reduction and 
the presence of fine precipitates is important in the production of high-strength low-alloy 
(HSLA) steels. The fine precipitates of AIN, Nb(C,N), V(C,N) or Ti(C,N) are used to 
refine austenite grain size and hence ferrite grain size; AIN precipitated during annealing 
after cold work provides a desirable texture in sheets for cold forming. Copper precipit- 
ated before cold rolling and annealing can produce a similar effect (Era etal. [1988]). 
MnS, MnSe, AIN or BN can be employed in grain-oriented silicon steels to provide a 
very pronounced cube-on-edge, {110}<001> texture. 



Fig. 31. Time dependence of rate of grain growth in zone-refined iron, cold rolled 60%, annealed at 480°C. 
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Fig. 32. The effect of small Mo additions on the velocity of the recrystallization front of iron at 5% 
recrystallization, 60% deformation, at% Mo as indicated (After Leslie et al. [1963]). 

Second-phase particles have two different effects on the kinetics of recrystallization. 
Particles larger than about 1 fim, such as inclusions or pearlite nodules promote 
nucleation of recrystallization. If particles are fine, <0.1 /tm and closely spaced, dis- 
locations after cold working are uniformly distributed, without regions of high lattice 
curvature required for nucleation and recrystallization is inhibited (fig. 33). The effect of 
second-phase particles of different sizes is further treated in ch. 26. 

5. Mechanical properties 

5.1. Strength of ferrite 

The ideal strength of iron crystals can be measured by straining iron whiskers. 
Depending on crystallographic orientation, the tensile strength lies between 5 and 13 GPa 
and the shear strength between 3.5 and 8 GPa (Brenner [1956]). In contrast, the critical 
resolved shear stress of high-purity iron single crystals at room temperature is about 10 
MPa and of polycrystals about 30 to 40 MPa. The approximate ranges of strength of 
iron-base materials is shown in fig. 34. 

Single-phase ferrite can be strengthened by 

1. grain refinement; 

2. cold work; 

3. solid solution additions; 

4. shear-type transformation; 

5. radiation. 

The relation between the lower yield stress and the grain size in ferrite is customarily 
expressed by the Hall-Petch equation. 

<r LYS =<T o+ kd ^ 
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log h.time for start of recrystallization 

Fig. 33. Recrystallization of a steel with 0.22 wt% C and different distributions of Fe 3 C particles: P: 
Ferrite-Pearlite; M: Martensite tempered at 300, 550, 650°C. (a) 7"-/-diagram for complete recrystallization, (b) 
F-fi-diagram for start of recrystallization. 

where d is the average intercept grain size. The value of the exponent is now generally 
accepted as Although empirical, this equation is highly useful; fig. 35 is a plot 
covering a wide range of grain sizes. 

After small plastic strains the dislocation density in bcc iron is related to the flow 
stress, as shown in fig. 36 (Keh and Weismann [1963]). The relation can be expressed 
by 

a = <t 0 + aGb^N 
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Fig. 36. Relationship between the square root of average dislocation density and flow stress of a -iron (Keh and 
Weissmann [1963]). 


where a is a proportionality constant, G is the shear modulus, b is the Burgers vector, 
and N is the dislocation density. 

Dislocation density at a given strain is greater in fine-grained than in coarse-grained 
iron. 

For uniform plastic strain the true stress-true strain relation is expressed by the 
empirical equation 

a = Ks n 

where a is true flow stress, e is true strain and K and n are constants (Ludwik [1909]). 
K is the true stress at a strain of unity and n, the strain hardening exponent, is the slope 
of a log true stress-log true strain plot. The strain hardening exponent is numerically 
equal to the limit of uniform plastic strain and is an indication of the stretchability of 
low-carbon steel sheets. 

The work-hardening of iron at large strains depends upon the mode of deformation. 
Titanium-gettered iron containing essentially no interstitial solutes can be cold drawn to 
a true strain of at least 10, to a strength of 1650 MPa (Langford et al. [1972]). In 
torsion, work hardening saturates at a true strain of just under 4.8 because of dynamic 
recovery (Young etal. [1974]). High strains are more difficult to obtain by cold rolling 
because of less severe geometrical restraints, but at a true strain of 2.8, work hardening 
was still increasing (Lake [1977]). 
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True solid solution hardening of ferrite refers to the interaction of the strain fields of 
solute atoms with dislocations when the solute atoms are immobile. It is difficult to 
separate dynamic and static effects when interstitial solutes are involved, but tests done 
at 0°C and at a fairly high strain rate indicate that the change in shear stress with change 
in carbon content, dr/dC is about G/2. 

When substitutional solutes are added to bcc iron, the athermal behavior of the yield 
stress, shown in fig. 27, is eliminated. The yield stress is always a function of temperat- 
ure. Even more interesting is the phenomenon of alloy softening. In restricted regimes of 
temperature and composition, solutes, whether substitutional or interstitial, can reduce the 
yield stress of iron, fig. 37. The magnitude of the softening, the yield stress of the alloy 
less the yield stress of the iron, is comparable to the maximum strengthening. Softening 
can be produced in unalloyed iron by knocking iron atoms into interstitial positions by 
high-energy radiation (Satoh etal. [1970]). 

Outside the composition limits in which alloy softening occurs, solutes strengthen bcc 
iron. The atom size misfit is a rough guide to the magnitude of strengthening but it fails 
when the solute atoms are smaller than iron atoms, fig. 38. 

Elements of limited solubility in bcc iron tend to segregate in grain boundaries, fig. 
39, (Hondros and Seah [1977]) with significant effects on mechanical properties, as 
described later (see also ch. 13). Grain boundaries are sources of dislocations and the 
stress required to activate these sources can vary with solute type and concentration at 
the boundaries. As a consequence, k in the Hall-Petch equation, <r LYS =a 0 +kd m , also 


TEMPERATURE, *C 



Fig. 37. Solid solution hardening and softening in iron-base alloys (1.5 at%), e = 2.5 X 10~ 4 s '. 
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Fig. 38. Correlation of solid solution strengthening of iron-base binary alloys at 298 K with size misfit 
parameter, e a . 


varies, C, N and P producing the largest increases. These segregation effects, equilibrium 
and non-equilibrium, further complicate the studies of solid solution hardening. 

Although the process is not technologically important, high-purity bcc iron can be 
hardened by shear transformations induced at ambient temperatures by passage of a 
shock wave generating a pressure greater than about 13 GPa. The reaction involves the 
shear transformation of bcc iron to e at high pressure then the reverse after the shock 
wave passes. The defect structure remaining resembles that of lath type martensite, with- 
out the presence of carbon. The yield strength is increased by about 540 MPa (Leslie 
and Sober [1967]). A martensite-like structure of high hardness can also be generated in 
pure iron by ultrarapid quenching from the melt. The hardness is increased more than 
sevenfold compared to metal quenched in the solid state (Duflos and Cantor [1978]). 

Iron can be hardened and embrittled by bombardment with high-energy neutrons or 
electrons which knock atoms from their normal lattice positions into interstitial sites 
(Klueh [1990]). 

5.2. Properties of structural steels 

We are concerned here with steels that are not heat treated after manufacture. The 
newest of these are interstitial-free (IF) steels (Takechi [1994]). They are made possible 
by commercially practicable steel-making techniques that can reduce C, N, O and S to 
very low concentrations; C contents can be as low as 0.002%. Sufficient Ti or Ti + Nb 
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Fig. 39. Correlation of grain-boundary enrichment ratio fi b with the inverse of solid solubility for several 
systems. 


is added to combine with C, N and S, reducing C in solution to less than 0.0001% after 
annealing. The steels are continuously cast, hot rolled, cold-rolled and continuously 
annealed. 

The impetus for the rapid growth in production of these steels came from the 
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realization that continuously cast Al-killed sheet steel did not have satisfactory deep 
drawing and non-aging properties, especially after hot-dip galvanizing which is now in 
great demand for automobile construction. For good forming properties, high n (in the 
stress/strain relationship a = K^ n ) values and a well-developed { 1 1 1 }<1 10> texture are 
required. The extent of the latter governs the r value, which is defined as the ratio of 
width true strain to thickness true strain in tension tests of sheet specimens. With the 
assumption of constant volume: 



where W, = initial width 
W f = final width 
L = initial length 
L f = final length 

For deep-drawing steels r should be at least 1.5; some IF steels reach values of 2 to 3. 

Interstitial-free steels were developed initially to have a low yield strength and high 
elongation in a tension test and excellent ductility, but it is now obvious that the IF 
technique can be expanded to higher strength steels. P and Mn can be added for solid 
solution strengthening, B can be added to improve grain boundary cohesion and Cr can 
be added to make IF ferritic stainless steel. Many current grades of cold-rolled sheet 
steels may be superseded by these C-free steels. 

Formerly, many low-C sheet steels were not deoxidized in the furnace. As a result, 
C and O in the liquid steel reacted during cooling of the ingot, leaving an outer rim of 
solid steel lower in carbon than the interior. After cold-rolling and annealing the steel 
surface tended to be free of defects. Now the great majority of low-carbon steels are 
continuously cast and the rimming action cannot be tolerated, so the steel is “killed” with 
an addition of 0.03 to 0.04% Al. Any aluminum that does not combine with oxygen is 
free to form AIN, thereby minimizing subsequent strain-aging. Strain-aging is the term 
applied to various changes in properties of a metal that occur by the interaction of 
interstitial solute atoms with dislocations during or after plastic deformation. Such 
interactions appear in many different alloy systems. In steels the significant interstitials 
are C and N atoms. When property changes occur after plastic deformation the process 
is called static strain-aging; when they are concurrent with plastic strain it is called 
dynamic strain-aging. The effects of static strain-aging on the tensile properties of a low- 
carbon steel are shown schematically in fig. 40. When a specimen is strained, allowed to 
stand for a period of time at temperatures as low as ambient then re-strained the 
following effects are noted: 

1. an increase in yield stress, AT 

2. a return of the Liiders strain, e L 

3. an increase in ultimate tensile stress, Af/ 

4. a decrease in total elongation, Ae 

5. embrittlement. 
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Fig. 40. Effects of static aging on the load-elongation curve of iron containing interstitial solutes: Ay = change 
in yield stress due to strain-aging, e, = LUders strain after strain-aging, A U = increase in ultimate tensile strength 
due to strain-aging, A e = decrease in elongation due to strain-aging. 


There is an unfortunate consequence of the return of the Luders strain. If cold-rolled and 
annealed sheets are temper rolled to flatten, then allowed to age, during subsequent 
forming “stretcher strains” appear. These are streaks in which the thickness of the sheet 
is very slightly reduced, a consequence of inhomogeneous yielding, and the appearance 
of the sheet is spoiled. The principal cause of strain aging in C steel is N in solid 
solution in ferrite. It is more effective than C because of its greater solubility. However, 
if aging after plastic strain is done at temperatures of about 100°C or above, C becomes 
effective. The conventional method of eliminating strain aging is to use aluminum-killed 
steels, but this is not completely effective because some C remains in solid solution. To 
bring the interstitial solutes to the necessary concentration of about one part per million 
requires addition of elements such as titanium, as in IF steels. 

Figure 41 shows the effects of dynamic strain aging on the tensile properties of a 
common carbon steel. When the steel is strained at temperatures between 100 and 250°C, 
serrations appear on the stress-strain curves and the rate of work-hardening is greatly 
increased. Dislocations generated during plastic straining are quickly pinned by mobile 
interstitial solutes and more dislocations must be generated before the strain can continue. 
The results are an increased dislocation density for a given strain and a high rate of 
work-hardening. 

Most effects of strain-aging are detrimental but in at least one instance, bake- 
hardening, the process proves useful. As automobile body sheets have been made thinner 
to reduce weight they become more susceptible to denting. To counteract that, strength 
must be increased, but that causes difficulties in forming. An answer is found with steels 
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Fig. 41. Stress-elongation curves of a carbon steel strained in tension, e = 1.75 x KT* s '. 
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containing 0.005 to 0.01% C with A1 plus Nb or Ti, cold-rolled, continuous-annealed at 
temperatures near 900°C, then cooled rapidly, leaving about 0.001% C in solid solution, 
fig. 42 (Satoh et al. [1992]). When the sheet is formed into a fender, painted, then 
baked at 170 to 200°C, strain aging adds an increment of strength of about 50 MPa. 

The latest innovation in the production of low-carbon sheet steels is the continuous 
casting of slabs 50 mm thick. Without being cooled the slab is reheated to rolling 
temperature, then passed through a five-stand hot rolling mill, rolled to desired thickness 
and coiled. 

A process somewhat further removed from commercial production is the continuous 
casting between water-cooled copper rolls of sheet about 2 mm or less in thickness. 
These require only subsequent cold-rolling and annealing (Shang and Wray [1989]). 
Questions arise as to whether current techniques of obtaining desirable microstructures, 
textures and surface quality will be applicable to these new products. 

The properties of low-C steels are determined primarily by ferrite grain size and the 
type and distribution of carbide particles (Rickett and Kristufek [1949]). Ferrite grain 
size is determined by prior austenite grain size which in turn is controlled by the 
deoxidation of liquid steel, or lack thereof. As discussed before, grain growth in austenite 
is inhibited by precipitation of AIN, or less commonly by Nb, V or Ti carbonitrides. 
Grain size is also affected by the degree of cold work and the annealing temperature and 
time. Grain size can be reduced or made more uniform by re-austenitizing at a low 




Fig. 42. Processing of bake-hardening and extra-deep drawing sheet steel (Satoh et a!. [1992]). 
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temperature and air cooling (normalizing). Carbides can take the form of massive 
particles of cementite or volumes of pearlite, depending upon cooling rate from above A, 
temperature. 

As long as ferrite is the continuous phase, the yield strength of the steel is determined 
by the grain size and solute content of the ferrite. Banded ferrite-pearlite structures are 
common in hot-rolled steels, resulting in anisotropy of mechanical properties. The 
banding results from the segregation of Mn to austenite grain boundaries during heating. 
Hot rolling spreads the Mn-rich regions into sheets. During cooling, pearlite nucleates 
preferentially in the Mn-rich portions, giving rise to sheets of pearlite islands. 

As C content increases, pearlite becomes the continuous phase and we must consider 
the properties of pearlite particularly when the structure is entirely pearlite, as in 
eutectoid carbon steels used for rails. The yield strength of pearlite is dependent upon the 
interlamellar spacing; Gensamer etal. [1942] determined a proportionality between yield 
strength and the logarithm of interlamellar spacing. Pearlite lamellae do not have a 
constant true spacing; there is a distribution of spacings about a mean value, A~. A mean 
intercept spacing J is measured by drawing random straight lines on a micrograph. The 
total length of line divided by the number of lamellae intersected is defined as l It can 
be shown that 

1 = 2A 0 

(Ridley [1984]). 

The yielding of pearlitic is presumed to occur by motion of dislocations in the ferrite 
lamellae (Alexander and Bernstein [1989]). The ductility and impact toughness of 
fully pearlitic steels are more dependent upon prior austenite grain size than upon inter- 
lamellar spacing, large grains being detrimental (Lewandowski and Thompson [1986]). 

5.2.1. High-strength low-alloy steels 

The users of steels are greatly interested in reducing costs by eliminating heat 
treatments and purchasing steels that have properties equal to those of heat-treated alloy 
steels. These newer steels should have minimal carbon content to improve toughness and 
weldability and minimal expensive alloy additions. To answer this demand steel 
producers have developed a wide variety of high-strength low-alloy (HSLA) steels. These 
are intended to meet certain mechanical properties rather than specified chemical 
compositions. They have certain features in common, one being refinement of ferrite 
grains. Grain refinement is the only strengthening mechanism that simultaneously 
increases strength and toughness. Strengthening by precipitation of very fine particles 
reduces toughness so it must be offset by grain refinement. 

Precipitation and grain refinement are managed by control of hot-rolling temperatures 
and deformations. Inclusions of MnS are elongated during hot rolling thereby reducing 
toughness in the transverse direction so elements are added to make the inclusions 
resistant to hot deformation. This is termed inclusion shape control. To control the 
structure formed during cooling from hot-forming processes, (equiaxed ferrite-pearlite, 
or acicular ferrite, sometimes called low-carbon bainite), alloying elements are added to 
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lower the transformation temperature and/or the cooling rate from the hot-rolling 
temperature can be accelerated. 

The fine precipitates used to strengthen HSLA steels and to control grain size are 
carbo-nitrides of Ti, Nb and V. Titanium has the advantage of also serving to change 
elongated sulfide inclusions into discrete particles. Additions of calcium to the liquid 
steel before casting also serve to control inclusion shape. The carbo-nitrides noted are 
chosen because of their very low solubility in austenite, Ti(C,N) being the least soluble 
and V(C,N) the most. By precipitating as fine dispersions in austenite they inhibit 
austenite recrystallization and grain growth. They are much less soluble in ferrite than in 
austenite; in consequence, any C, Nb, or V remaining in solution in austenite tend to 
precipitate at successive positions of the intermittently moving y-a boundaries leaving 
behind in the ferrite planar arrays of fine, closely spaced particles only resolvable by 
transmission electron microscopy. This is termed interphase precipitation. These 
microalloy additions are often combined, Ti or Nb for grain refinement, V for precipi- 
tation in ferrite. 

Copper occupies a special place in HSLA steels. When added in concentrations 
between 0.5 and 1.0% it can remain in solution even during slow cooling from the 
austenite range. An object can be formed when strength is low, then aged by holding in 
the range 500 to 600°C. Coherent bcc particles form, containing iron as well as copper. 
The maximum strengthening, fig. 43, is attained just before the particles transform in situ 
to fee copper (Hornbogen [1964], Hornbogen etal. [1966], Goodman etal. [1973]). 
Copper, in combination with phosphorus, is used in the so-called weathering steels that 
are resistant to atmospheric corrosion. 

The most common types of HSLA steel have a ferrite-pearlite structure after hot 
rolling. They can have yield strengths up to 700 MPa, as compared to 200 MPa in plain 
carbon steels with the same type of microstructure. The C content of most HSLA steels 
is less than 0.20%. 

To obtain an acicular ferrite structure after hot-rolling additions of B, Mo or Mn are 
used to decrease the transformation temperature. For improved toughness and weldability 
at high-strength levels, C must be kept below about 0.08%. 

Not all HSLA steels are shaped in the mills. With microalloying additions, medium 
carbon steels can; be hot forged and with controlled cooling can eliminate the need for 
quenching, tempering, straightening and stress relieving. 

Dual-phase steels are a special category of HSLA steels. Their microstructure consists 
of a ferrite matrix with 10 to 20% martensite (or lower bainite) islands. The virtue of this 
structure lies in the fact that the formation of martensite produces stresses around each 
island. Yielding then occurs at low applied stress and the yielding is continuous, but 
because of the presence of the martensite islands, work-hardening is rapid. These 
features, together with the soft, low-carbon, <0.10%, ferrite provide good formability and 
strength near 600 MPa after 10% strain. Dual-phase steels generally contain about 1.5% 
Mn and 0.50% Si. After hot rolling or cold rolling the steels are intercritically annealed 
(heated into the a + y region). The amount of austenite formed varies with the initial 
carbon content and the annealing temperature. During rapid cooling the austenite pools 
transform to martbnsite, with perhaps some retained austenite. The intercritical annealing 
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Fig. 43. Precipitation hardening of Fe-0.91% Cu alloy, quenched from 840°C and aged at temperatures 
indicated (Hornbogen etal. [1966]). 

is done in continuous annealing lines, if they are designed for sufficiently high temperat- 
ures (see, for instance, Ohashi [1988]). During forming operations the ductility can be 
enhanced by strain-induced transformation of retained austenite. Plastic deformation takes 
place almost exclusively in the ferrite component of the microstructure. Additions of 
strength can be gained by strain aging during paint baking. The principal market is the 
automobile industry. 

5.3. Strength of martensite 

The effect of carbon content on the hardness of untempered martensite is shown in 
fig. 44 (Krauss [1978]). Yield strength is a linear function of carbon content up to about 
0.40% (fig. 45). The yield strength of shock-transformed Ti-gettered iron containing 
virtually no carbon in solid solution establishes the zero point. The departure from 
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WEIGHT PERCENT CARBON 

Fig. 44. Hardness of martensite as a function of C content in Fe-C alloys and steels (Krauss [1978]). 


linearity above 0.40% C shown in fig. 44 is caused by the retention of austenite. As this 
phase increases in amount the hardness reaches a maximum at about 0.80% C, then 
decreases rapidly. The simple relation shown in fig. 45 results from a combination of 
several factors. The relative importance of these factors remains a matter of dispute 
(Leslie and Sober [1967], Owen [1992]). 

Current interest centers on low-carbon martensites because of concern with toughness 
and weldability. In martensite with carbon contents of about 0.15% or less the defect 
structure, including prior austenite grain boundaries, provides the largest contribution to 
strength. At or above carbon contents of 0.30% rearrangement of carbon atoms and 
dislocations during the quench can dominate the strength, unless the M s temperature is 
below about -60°C, when true solid-solution hardening by carbon dominates (Winchell 
and Cohen [1962]). As a further complication it should be noted that iron-carbon 
martensites do not have a unique strength, the yield and flow stresses are greater in 
compression than in tension, by about 15% (Rauch and Leslie [1972]). 
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Fig. 45. The effect of carbon content on the yield strength of untempered martensites. 



5.4. Strength and ductility of tempered martensite 

Tempering is the process of heating martensitic steels to obtain a desired combination 
of strength and ductility (§ 3.5). High strength is easily obtained; to achieve a combina- 
tion of strength, sufficient ductility to resist fracture at points of stress concentration or 
during impact loading, retention of ductility at low temperatures, weldability and low cost 
is a major challenge for designers of steels. For a given carbon or low-alloy steel the 
hardness (strength) can be calculated by the method of Grange etal. [1977], Figure 46 
is a plot of the hardness of iron-carbon alloys, quenched to martensite, then tempered for 
one hour at the temperatures indicated. Additions of Mn, P, Si, Ni, Cr, Mo or V do not 
affect the hardness of as-quenched martensite but each of them retards softening during 
tempering. To estimate the hardness of a given steel the hardness associated with a given 
carbon content and tempering temperature is read from fig. 46. To this is added an 
increment of hardness attributed to each of the alloying elements present, taken from 
plots provided. The temperature range in which tempered martensite embrittlement occurs 
must be kept in mind. 

Within the range of austenite grain sizes normally present in commercial steels, i.e., 
ASTM grain sizes 5 to 10, there is little effect of grain size on yield strength after 
tempering. However, if the grain size is reduced beyond that range there is a significant 
gain in both yield strength and toughness. Grange [1966] attained ultra-fine austenite 
grains by four cycles of brief austenitizing at low temperature, quenching, then finally 
tempering at 205°C. Results are shown in fig. 47 for three kinds of steel. They can be 
represented by a Hall-Petch type equation 
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Fig. 46. Hardness of tempered martensite in Fe-C alloys (Grange et al. [1977]). 


^ = cr 0 + ttf* (7) 

where cr 0 is the intercept at zero on the abscissa and d is grain diameter. The increase in 
yield strength was not accompanied by a decrease in ductility. Later, Tomita and Oka- 
bayashi [1985] followed this procedure but after the last austenitizing step they transformed 
isothermally at 320°C to form 15 to 25% bainite, then quenched to martensite and tempered 
at 200°C. The result was an increase in the yield strength found by Grange in a similar 
steel, with no loss in ductility. The bainite is constrained by the enveloping martensite and 
presents another set of boundaries to stop the growth of martensite crystals. 

In instances when the steel contains elements such as V, Mo, and W, the results of 
tempering can be quite different. When present in concentrations between 1% and 18%, 
together with 0.80 to 1.50% C, these elements form the basis of high-speed tool steels. 
These must be austenitized at a high temperature, sufficient to dissolve most of the stable 
carbides of V, Mo and W. The undissolved carbides provide wear resistance to the final 
product. After quenching, the steels are tempered between 400 and 500°C to precipitate 
fine carbides. As shown in fig. 48, strength is preserved at these temperatures, which can 
be encountered during high-speed machining of steels. 
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Fig. 47. Effect of prior austenite grain size on the yield strength of martensite in three commercial steels 
tempered at 205°C (Grange [1966]). 



Fig. 48. Strengths of a high-carbon steel and a high-speed tool steel (table 2, no. 6) after tempering, illustrating 
secondary hardening of the tool steel. 
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5.5. Ultra-high-strength steels 

5.5.1. Maraging steels 

According to the definition of steel in the first sentence of this chapter, maraging 
steels are not steels at all, for they contain no deliberately added carbon; carbon is an 
undesirable impurity. They are iron-nickel alloys, commonly containing 18 wt% Ni. They 
also contain 3 to 5 wt% Mo, 0.2 to 1.8 wt% Ti and 0. 1 to 0.2 wt% Al. Some contain Co, 
others have none. The heat-treatments to develop strength are simple. The alloys are hot- 
worked at above about 920°C, austenitized at about 850°C and cooled. The M s temper- 
ature is between 200 and 300°C and the alloy is completely martensitic before reaching 
room temperature. The structure is iron-nickel bcc lath-type martensite, which unlike 
Fe-C martensite is soft, ductile and machinable. However, the rate of work hardening is 
low since there are no interstitial atoms available to pin dislocations. The uniform 
elongation in tension is only about 1 to 3%. The function of the martensite is to provide 
a high density of nucleation sites for the precipitation of intermetallic compounds and 
probably to provide paths for accelerated diffusion of substitutional solutes. The alloys 
are aged at about 480°C for three hours. 

The result is the precipitation of intermetallics of the type Ni 3 X, where X may be Ti, 
Mo or V. These particles are of the order of 10 nm in size with a number density of 
10 23 itT 3 (Sha et al. [1993]). Titanium is the element that precipitates most rapidly and 
is the one used to control strength, fig. 49 (Speich and Floreen [1963]). When present, 
Co raises the activity of Mo, thus hastening its precipitation as Fe^o,; or Fe 2 Mo. 
Overaging results in reversion to the stable austenite + ferrite, agglomeration of particles 
and softening (Meyer and Hornbogen [1967]). 

Maraging steels can be provided with yield strengths ranging from 1400 to 2400 
MPa. The most remarkable feature of these alloys is their combination of high yield 
stress with high fracture toughness over a wide range of temperatures, fig. 50 (Bruch 



Fig. 49. Effect of Ti content on hardness of an Fe-20% Ni martensite after aging at 500°C (Speich and 
Floreen [1963]). 


References: p. 1618. 



1608 


W.C. Leslie and E. Hornbogen 


Ch. 17, §5 



500 1000 1500 2000 


Yield stress, o" y MPa 

Fig. 50. Fracture toughness of an 1 8 Ni, 8 Co, 5 Mo, 0. 1 C maraging steel as a function of test temperature and 
yield stress (Bruch and Hornbogen [1978]). 

and Hornbogen [1978]). Fracture toughness is not greatly sensitive to the aging 
treatment so at maximum yield stress the alloy is still tough. 

In a drastic modification of maraging steels, the nickel content is reduced from 18% 
to 11%, molybdenum is reduced to 1.2%, 3% Cr is added and Co is increased to 13.4% 
and about 0.25% C is added. The presence of Co increases the activity of C and raises 
the temperature. The alloy is austenitized at 885°C, leaving some undissolved 
carbides. After air cooling the alloy is refrigerated at -73°C to transform retained 
austenite and finally aged at 480°C. Precipitates formed during aging appear to be M 2 C 
carbides. Some martensite reverts to austenite in the form of thin films. The yield strength can 
be 1650 MPa and the plane strain fracture toughness 130 MPay^n • Charpy V-notch 
energy at -100°C can be about 32 J (Ayer and Machmeier [1993]). 

5.5.2. Modified standard steels 

An arbitrary definition of an ultra-high strength steel is any steel having a yield 
strength greater than 1380 MPa. The standard against which modifications are judged is 
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AISI 4340 and equivalent medium-carbon Ni, Cr, Mo steels produced in other nations. 
These can be oil quenched and tempered to any desired hardness. Properties can be 
improved by vacuum arc or electroslag remelting to lower the H, N and O contents and 
to reduce the number of non-metallic inclusions. 

An improvement is obtained by increasing the Si content of 4340 from 0.15-0.30% 
to 1.45-1.80% to take advantage of the well-known effect of Si in inhibiting the growth 
of carbide particles. This steel, called 300 M, can be tempered at a higher temperature 
than 4340 to develop the same hardness, thereby reducing quenching stresses. The 
addition of Si also moves the temper embrittlement range to higher temperatures. 
Tempering at 315°C produces a maximum in notch impact toughness and near a 
maximum yield strength. The additional Si also increases hardenability and adds a 
component of solid solution strengthening. A small addition of vanadium may provide 
fine V(C,N) particles during tempering. 

Crucible Hy-Tuf is a modification of AISI 4130 with increased Mn and Si and an ad- 
dition of Ni. In Ladish D6AC the Cr and Mo contents are increased over those in 4340. 

5.5.3. Thermomechanically-treated steels 

Ultra-high-strength steels can be produced by thermomechanical treatments. For our 
purposes we can divide such treatments into three groups: 

1. Supercritical TMT (above Ae 3 ) 

2. Intercritical TMT (between Ae 3 and Ae 3 ) 

3. Subcritical TMT (below Ae,) 

Considering the first group, if austenite is heavily deformed without recrystallizing it 
takes up a very sharp texture, {124} <112>. During transformation the martensite also 
takes up a sharp texture, (11 2)+( 111) which is retained during tempering (Hu et al. 
[1976]). The texture provides a high modulus of elasticity in a direction normal to the 
surface and an increased yield strength in compression in the through-thickness direction, 
thereby improving the ballistic properties (Ghosh and Paton [1978]). 

Forming in the ferrite-plus-austenite range, followed by quenching before recrystal- 
lization, yields a structure of martensite fibers in ferrite, with very high toughness in a 
direction normal to the fibers (Grange [1970]). 

Deformation of austenite below A e , is the process called ausforming. It requires a 
steel having a large bay between pearlite and bainite in an isothermal transformation dia- 
gram. The steel is cooled — avoiding pearlite — heavily worked, then cooled below M s . 
Yield strengths of about 2600 MPa can be developed. The strengthening is attributed to 
refinement of martensite platelets formed in flattened austenite grains, strain-induced pre- 
cipitation of carbides in austenite and development of a strong texture in the martensite. 

In the third category of thermomechanical treatments, a steel can be deformed as it 
is transforming to pearlite or bainite. 

As an example of the strength that can be developed in thermomechanically-treated 
steels, consider the following: a steel containing 0.6% C and 5% Ni was ausformed, 
quenched, tempered, strained 3% then aged at 120°C, a combination of ausforming and 
strain aging. Yield strength was 3590 MPa, tensile strength 3630 MPa (Matas et al. 
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[1964]) This compares with attainable tensile strengths in cold-drawn eutectoid carbon 
steel wire of 4800 MPa. 

With the exception of super-critical treatment (controlled hot-rolling) these thermo- 
mechanical processes have not been applied in industry for the following reasons: 

1 . Large reductions in cross section must be made, so large sections cannot be treated. 

2. At the temperatures involved, loads on equipment are severe. 

3. The steel must be used in the simple shape in which it is formed. 

4. Joining is difficult; welding is impossible. 

5.6. Tool steels 

Tool steels are used for the shaping of other metals or for the cutting of other hard 
or abrasive materials. There are seven categories of tool steels but lack of space forbids 
discussion of more than one, the most complex and important, high-speed steels. These 
are of two types, containing Mo or W. A typical composition of a W type is given in 
table 2. 

The heat treatment of high-speed steels consists of austenitizing at a very high 
temperature, 1250 to 1300°C to dissolve M 23 C 6 carbides but leaving about 10% of the 
volume consisting of MC and M 6 C carbides. Hardenability is very high with a deep bay 
in the IT diagram. Transformation is accomplished by quenching to about 500°C to 
relieve thermal stresses, then air cooling. The structure is composed of martensite, about 
15 to 25% retained austenite and about 10 to 20% undissolved carbides. The heat 
treatment is distinguished by the necessity for multiple tempering to transform the 
retained austenite. The first temper at 550 to 595°C “conditions” the austenite (presum- 
ably austenite loses carbon to martensite in which carbides are precipitating). The second 
(or successive) temper at the same temperature transforms the austenite and completes 
the secondary hardening process in which the precipitation of carbides raises the hardness 
to 63 to 65 Rockwell C, equivalent to the hardness after quenching, with greater 
ductility. The volume of carbides in the structure can be as great as 40%. These high- 
speed steels can be used at temperatures of 500°C or more. 

5.7. Austenitic steels 

Some alloying elements, principally Ni and Mn, expand the y loop in the Fe-C 
system and other alloying elements can retard the y -a transformation so that the fee 
structure can be metastable to ambient temperatures and below. Therefore, we must 
consider the differences between the mechanical properties of fee and bcc iron. These 
differences are inherent in the crystal structures. Diffusivities of solutes are lower in fee 
than in bcc at the same temperatures; the solubility of interstitial solutes is greater in fee 
than in bcc. The stacking fault energy in bcc is very high, in fee it is much lower but 
varies with solute additions. 

In fee iron alloys dislocations tend to dissociate readily and cross slip is difficult. 
Dislocations are largely restricted to {111} slip planes, leading to planar arrays, narrow 
slip lines (fig. 29), a high rate of work hardening and large uniform elongations in tensile 
tests. There is only a small dependence of strength on temperature or strain rate. In 
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contrast, in bcc iron cross-slip is easy; slip occurs on any plane that contains the <11 1> 
slip direction, leading to cellular arrays of dislocations. Macroscopic deformation depends 
upon motion of screw dislocations which are slightly dissociated and immobile and must 
be constricted by thermal activation. The result is a pronounced dependence of strength 
on temperature and strain rate and a tendency toward brittleness at low temperatures 
and/or high strain rates. 

Because of the relatively high solubility of interstitial elements in fee iron, strain 
aging is infrequent. A notable exception occurs in Hadfield manganese steel, our oldest 
alloy steel. This contains 10 to 14 wt% Mn and 1 to 1.4 wt% C. As a result, the fee 
structure is stable at ambient temperatures. The rate of work-hardening in gouging wear 
is exceptionally high. Early on this was attributed to a strain-induced transformation to 
martensite but when the carbon content is maintained at the proper level this does not 
occur. Later, the high work-hardening and wear resistance were attributed to profuse 
mechanical twinning but when twinning is eliminated by a small addition of an element 
that increases the stacking fault energy, the high work-hardening remains (Zuidema et al. 
[1987]). Because of the high concentrations of Mn and C and their mutual attraction in 
solution, it was concluded that dynamic strain-aging brought about by the reorientation 
of C members of Mn-C couples in the cores of dislocations is the principal cause of the 
rapid work-hardening (Dastur and Leslie [1981]). 

The most important class of austenitic steels is that of the stainless steels containing 
17 to 25% Cr and 8-35% Ni. The Cr content is the basis of all stainless steels, whether 
ferritic, martensitic, austenitic or duplex. It provides corrosion resistance. Nickel, as 
stated previously, expands the y loop so that the steels remain austenitic at low temperat- 
ures. They have excellent cryogenic properties as a consequence of their fee structure. 
They can be hardened by cold work, by elements in solid solution, especially N, or by 
precipitation strengthening. Murata etal. [1993] have reviewed these mechanisms. The 
solubility of C in the standard 18-8 stainless steel declines sharply with decreasing 
temperature; at 600°C it is less than 0.01%. Since the diffusion coefficient of interstitial 
atoms in austenite is much lower than in ferrite precipitation of carbides is delayed. 
However, if heated as during welding, Cr 23 C 6 carbides can form at grain boundaries, 
thereby causing a local depletion of Cr and providing paths for corrosive attack. This 
problem can be alleviated by reducing the C content to less than 0.03% or by adding Ti 
or Nb in sufficient quantity to combine with C and precipitate as stable carbides. 

5.8. Steels for low-temperature applications 

Austenitic Ni-Cr steels would be ideal for low-temperature service, except that cost 
is prohibitive and the strength is often inadequate. Therefore, ferritic steels must be 
modified in composition and microstructure to counteract their inherent tendency to 
fracture in a brittle manner at low temperature. In brittle fracture plastic strain and energy 
absorbed are negligible. Cracks propagate along grain boundaries or along [100] planes of 
ferrite. A steel to be used at low temperature must fulfill the requirement that its ductile-to- 
brittle transition must be below the temperature of use. Questions still remain regarding the 
relation between methods of determining this transition temperature and behavior in service. 


References: p. 1618. 



1612 


W.C. Leslie and E. Hornbogen 


Ch. 17, § 5 


In designing ferritic steels for use at low temperatures certain rules must be followed; 
grain size must be minimized, C content must be kept below 0.2% and the lower the 
better, Ni must be added and strength should be obtained by a structure of acicular ferrite 
or low-carbon bainite. 

When Ni is added in amounts of 5 or 9% the notch toughness of ferritic steel at low 
temperatures can rival that of austenitic steels, at a much lower cost, fig. 5 1 . It has been 
known for some time that the elements Pt, Ni, Ru, Rh, Ir and Re in solid solution both 
strengthen and toughen bcc Fe (Floreen and Hayden [1967]). All other additions 
decrease toughness. The toughening effect is on the bcc lattice, not on interactions with 
interstitials. There is no theory pertaining to this toughening effect, but we can speculate 
that these metals increase cross slip of screw dislocations by reducing their dissociation 
and regret that Pt is not more abundant. 

5.9. Segregation of solutes and steel purity 

Segregation of solutes to grain boundaries in steel is important because it is closely 
connected to decreasing ductility. Tempered martensite embrittlement is described in 
section 3.5. It occurs when as-quenched martensite is tempered between 250 and 400°C. 
In contrast, temper embrittlement occurs in steels that are quenched, tempered at a high 
temperature, 600 to 700°C, then placed in service at temperatures from 350 to 575°C. 
After slow cooling, as in the center of heavy sections, or after long exposure to those 
temperatures in service, the steel becomes embrittled. This is of particular concern in the 
generation of electricity because of embrittlement of steam turbine rotors. 

As indicated in fig. 39, the grain boundary segregation of a solute is inversely 
proportional to its bulk solubility. However, the phenomenon of co-segregation also must 
be considered (Guttmann etal. [1982]). When two solutes in Fe are mutually attracted, 
as are Mn and S or Ni and Sb, they will tend to segregate together at equilibrium. A 
third factor is site competition at boundaries. Carbon and P, or C and S, or B and P 
compete for these sites. Carbon and B have the desirable characteristic of enhancing 
grain boundary cohesion. Because of co-segregation between (Mn + Si) and (P + Sn + 



Fig. 51. Low-temperature toughness of three Ni steels, with varied microstructures. 
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As + Sb) it is desirable to reduce the concentrations of these elements to the lowest 
possible level. 


The culmination of 70 years of reserach was the casting of 120 ton ingots of 3.5% Ni 
Cr Mo V steel for turbine rotor forgings (Jaffee etal. [1989]). The chemical composition 
in weight % was 


c 

1 

Mn 

Si 

S 

Ni 

Cr 

Mo 

V 

P 

Al 

Sn 

As 

Sb 

0.25 

0.02 

0.03 

0.001 

3.65 

1.75 

0.42 

0.13 

0.002 

<0.005, 

0.003 

0.003 

0.0012 


This super-clean steel was made by electric-arc furnace melting under oxidizing 
conditions, ladle furnace refining, vacuum carbon deoxidation with argon bubbling and 
casting with vacuum stream degassing. It is free of temper embrittlement over the entire 
temperature range and safe operation at 480 to 500°C is possible, thereby increasing the 
efficiency of power generation. 

Removal of P and S from austenitic stainless steels can improve their resistance to 
intergranular corrosion and intergranular stress corrosion cracking (Briant and Andresen 
[1988]). This topic is also treated in ch. 13. 

6. Other physical properties 

6.1. Steels for nuclear applications 

Steels used in the construction of nuclear reactors, for example as canning materials, 
must not contain elements with isotopes that have a high absorption cross-section for 
neutrons. B, Co, Ta, Nb, Ti, Al, N are such elements that are normally contained in 
steels. 

The opposite is true for steels that are used as absorber materials or for shielding. 
Among alloying elements B has the best ability to absorb neutrons over a wide energy 
range. The macroscopic neutron-absorption coefficient depends on the B concentration 
in the steel. Because B is hardly soluble in a-Fe it is either present segregated or as 
particles (Fe 2 B). Above 2 wt% B ferritic steels become so brittle that they cannot be 
worked (grain-boundary brittleness). Therefore austenitic steels are often used that 
contain up to 5 wt% B as a dispersion of (Fe,Cr) 2 B. The absorption coefficient of a steel 
is structure-sensitive, because self-shielding leads to a decrease of the macroscopic 
absorption coefficient above a certain particle size. 

Irradiation with neutrons produces a variety of effects on a very fine scale. In the 
C-Mn-Mo low-alloy plate steels used in the containment vessels of nuclear fission 
reactors the flux of neutrons can produce embrittlement (Gelles [1990]). The cause is 
nanoscale precipitation processes that would not be expected to occur at the temperature 
of use. Copper is present as a residual element in low-alloy steels, normally in con- 
centrations of about 0.15% or less. Although solubility at temperatures below 500°C is 
very low the diffusivity of copper at those temperatures is so slight that precipitation 
does not occur. However, under irradiation the diffusivity is enhanced and the irradiation 
kinetics at 300°C is about equal to thermal kinetics at 500°C. As a result, e Cu-enriched 
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precipitate particles about 5 nm or less in size have formed at 288°C. These particles 
strengthen the steel and raise the ductile-brittle transformation temperature (Simonen 
[ 1993]). Steels for containment vessels are now required to have extra low Cu contents. 

Another unexpected result of radiation is Cr depletion at grain boundaries in 
austenitic stainless steels, not connected with precipitation of Cr carbides. The depletion 
amounts of about 5% Cr over a width of about 5 nm. The depletion correlates with 
increased intergranular stress-corrosion cracking. 

A major problem with austenitic stainless steels under neutron irradiation is develop- 
ment of cavities and volumetric expansion, called swelling. Ferritic/martensitic steels are 
less susceptible and can replace the stainless steels. 

Steels used in fusion reactors have problems in addition to swelling. They are exposed to very 
high energy neutrons. The alloying elements Cu, Ni, Mo, Nb and N are converted to radioactive 
isotopes with long half lives, intensifying the problem of waste disposal. New alloys, called 
low activation materials have been developed to replace the standard stainless steels with Mn 
substituting for Ni. Martensitic Cr steels with W added for strengthening appear to have 
promise. This issues are treated in great detail in a recent monograph (Frost [1994]). 

6.2. Steels for electrical applications 

Steels for electrical and magnetic applications are sold on the basis of core loss, 
which is the amount of electrical energy dissipated as heat when the steel is magnetized 
by an alternating current. The reduction of core loss leads to greater efficiency in the 
generation, transmission and use of electrical power. Silicon is the most effective element 
for increasing the electrical resistivity of Fe which in turn reduces eddy current losses in 
alternating current magnetization. 

There are two types of Si steels, non-oriented and oriented. Non-oriented grades are 
low-C alloys containing 0.8 to 3.5% Si plus Al. They are used in less critical applications 
such as small motors and radio or television transformers. They are hot-rolled, cold- 
rolled, annealed and decarburized to develop large, clean ferrite grains with no particular 
effort made to develop a favorable texture except perhaps by the addition of a solute 
such as Sb which can inhibit growth of certain grains during recrystallization. 

The easy direction of magnetization in bcc metals is <100>; to take advantage of this, 
grain oriented steels are made with <100> in the rolling direction of the sheet and [110] 
planes in the plane of the sheet. When transformer cores are made, the rolling direction 
is put in the main flux path. Grain oriented silicon steels are the most sophisticated mass 
products of the steel industry. They are essentially high-purity Fe-Si alloys with a 
constant Si content of about 3.25%. Higher Si contents are not used because of difficul- 
ties in cold rolling. Production is begun by reheating slabs to the unusually high 
temperatures of 1315-1375°C, the purposes being to dissolve various compounds (MnS, 
AIN, BN, TiC, MnSe, etc.) that are subsequently precipitated to serve as grain growth 
inhibitors and also to ensure that hot rolling is completed in the ferrite region. After cold 
rolling the steel is annealed to recrystallize and to remove C then annealed at about 
1150°C in dry H 2 . The purposes of the second anneal are to develop the desired 
<100>[ 1 10} texture by secondary recrystallization in which most primary recrystallized 
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grains are pinned by the inhibitors so that the favorably oriented grains grow at their 
expense and to remove the S, N, C or Se in the inhibitors by reaction with H 2 . Finally, 
the magnetic flux density and core loss can be improved by coatings that provide a smooth 
surface and a residual tensile stress to the very thin steel sheet (Inokuti el al. [1993]). Si 
steels for transformer laminations are also treated in ch. 28, § 4.4.2 and ch. 29, § 4.2. 

7. Solidification 

7.1. Rimming steel, killed steel 

The solubility of oxygen in liquid steel decreases rapidly with decreasing temperature 
(fig. 52): Fe, + 0— > FeO. The product FeO reacts with C to form CO which evolves as 
bubbles in the liquid. When poured into an ingot mold this evolution of gas causes the 
steel to rise in the mold. The boiling action can be stopped by putting a steel cap on top 
of the mold — thus the term capped steel. This procedure is often used for steels 
containing 0.15% C or more. If the evolution of gas, the “rimming action” is allowed to 
continue, the formation of honeycomb blowholes in the solidifying steel is sufficient to 
offset the shrinkage that occurs during solidification. The amount of gas evolved must be 
controlled so that a skin of solid metal forms close to the mold wall. This metal that 
solidifies first has about the same composition as the steel being poured but as solidif- 
ication continues the remaining liquid becomes enriched in solute elements, especially S, 
P, C, Si and Mn. Rimmed steel has certain advantages for the manufacture of sheet 
products; the rim zone contains few non-metallic inclusions leading to high surface 
quality and ingot-to-product yield is higher than for killed steel. 

The term killed steel is applied to steels that have been deoxidized by addition of Al 
or Al + Si to the ladle of liquid steel before the ingots are cast. Because there is little or 
no oxygen free to combine with C to form bubbles a shrinkage cavity called “pipe” 
forms at top of the ingot which must be cropped before hot rolling, thus the ingot-to- 
product yield is reduced. Also, there is a greater number of non-metallic inclusions 
present in killed steel than in rimmed steel but less segregation of solutes. In rimmed 
steel the turbulence caused by gas evolution promotes segregation, so alloy steels are 
killed to obtain greater homogeneity. 

With the increased dominance of continuous casting in the production of steels and 
especially for sheet steels, rimmed and capped steels may soon fade into history. Gas 
evolution cannot be tolerated in continuous casting, so vacuum carbon deoxidation or 
deoxidation with a minimal addition of Al are employed. 

Vacuum carbon deoxidation is necessary for the production of ultra-low-carbon steels 
and for the removal of H 2 . Carbon is removed as CO. The H 2 content of the liquid steel 
is proportional to the square root of the partial pressure of H 2 in the vacuum chamber, 

%[H] = kJP~ 

The removal of hydrogen prevents the formation of internal cracks, called flakes, that 
occur during cooling from the forging or hot rolling temperature. 
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Fig. 52. Partial Fe-0 phase diagram. 


7.2. Cast irons 

Cast irons are alloys of Fe with more than 2 wt% and usually 2.6 to 3.6 wt% C. By 
varying the composition, cooling rate and minor additions to the melt, the solidification 
can follow either the thermodynamically stable eutectic reaction liquid — » y + graphite or 
the metastable reaction, liquid — » y + Fe 3 C. During cooling through the eutectoid interval 
the structure can be the stable one, y + graphite —>a + graphite, or the metastable y + 
graphite — > pearlite + graphite. These yield the common gray iron, so called because of the 
appearance of its fracture. If the eutectoid reaction follows the metastable path y + Fe 3 C 
— » pearlite + Fe 3 C, the result is white cast iron. Microstructures are shown in fig. 53. 

Which of these paths is followed depends upon the graphitization potential of the 
melt which in turn is determined by the C and alloy content of the melt, principally Si. 
Gray irons usually contain 1 to 3% Si. Silicon increases the activity of C, as commonly 
expressed by the carbon equivalent, CE= %C + 0.3(%Si) + 0.33(%P) - 0.027(%Mn) 
+ 0.4(%S). Gray iron is widely used because of its low cost, strength in compression and 
high damping capacity. 

White irons contain from 1.0 to 2.0% Si. They can be annealed after casting at tem- 
peratures between 900-970°C to produce irregularly shaped particles of graphite called 
temper carbon, resulting in malleable iron. If slowly cooled the matrix is ferrite. More 
rapid cooling or subsequent heat treatments yield matrix structures of pearlite or marten- 
site. Thus, a wide range of mechanical properties is available in these malleable irons. 

Ductile cast iron, also called nodular cast iron, differs from malleable iron in that the 
graphite particles form as spheroids during cooling from the liquid (fig. 53). The matrix 
in the as-cast condition can be ferrite or pearlite or can be altered by subsequent heat- 
treatment. Ductile iron has advantages over other cast irons in that the graphite spheroids 
form directly from the melt and no annealing is required as in the production of 
malleable iron. Strength and toughness are greater than in malleable iron and can be 
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Fig. 53. Microstructures of grey, while, and nodular cast iron: (a) graphite and ferrite; (b) graphite and pearlite; 
(c) cementite and pearlite; (d) nodular graphite and pearlite. (Courtesy J. Motz). 


varied over wide ranges by heat treatment. The section size limitation on malleable iron 
does not apply to ductile iron. 
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In the production of ductile iron, two additions are made to the melt (Skaland et al. 
[1993]). The first is the spheroidizing agent, Mg, added as ferrosilicon with 3 to 12% 
Mg. The Mg vaporizes and rises through the liquid iron, removing S from solution. The 
second is the inoculant, which provides nucleation sites for graphite spheroids. The 
inoculant is Ca, Sr or Ba, added in ferrosilicon containing Al. These elements form 
hexagonal silicates whose (001) basal planes provide interfaces with graphite that are 
coherent or semi-coherent with low interfacial energy. 

There are special grades of cast iron for particular applications; 15% Si provides 
resistance to corrosion, 11 to 28% Cr gives oxidation resistance, Ni provides a stable 
austenite matrix. 
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1. Introduction 

Point defects are lattice defects of zero dimensionality, i.e., they do not possess lattice 
structure in any dimension. Topical point defects are impurity atoms in a pure metal, 
vacancies and self -interstitials. This chapter covers the properties of vacancies and self- 
interstitials, their interaction with other lattice defects, the production mechanisms and 
their importance for radiation damage of materials. Properties of atomic solutes per se 
are of interest mainly with respect to thermodynamics of alloys which are treated in chs. 
5 and 6. 

Vacancies are produced simply by heating, in concentrations sufficiently high for 
quantitative investigations. To obtain similar concentrations of interstitials, they must be 
produced by doing external work on the crystal. Such work is done on an atomic scale 
by energetic particle irradiation. Collisions between the projectiles and lattice atoms 
cause displacements of atoms from substitutional sites to interstitial sites. Thus, vacancies 
and interstitials are produced in equal numbers. As one vacancy and one interstitial 
together form a Frenkel defect, irradiation is essentially a Frenkel defect production 
process. This is disadvantageous with respect to experimental research on interstitial 
properties, because radiation induced property changes of crystals always involve 
vacancy contributions. Vacancy formation is treated in §2.2.2 and Frenkel defect 
production in §3.1. 

Plastic deformation also produces vacancies and interstitials. Although its application 
is much less expensive than particle irradiation, it has not become a common procedure 
for point defect production because it does not allow a controlled defect production 
independent of complex dislocation networks. The significance of point defects in plastic 
deformation is treated in chs. 21 and 22, and their recovery behavior after plastic 
deformation is touched upon in ch. 28. 

Anomalously high point defect concentrations occur in some non-stoichiometric 
intermetallic compounds. There, vacancies and interstitials clearly play the role of 
additional alloying elements and are of thermodynamic significance in this sense. A brief 
insight is given here in § 2.3. 

Other methods such as rapid quenching, evaporation on cold substrates or laser 
annealing depend on thermally activated production, which is treated in the present 
chapter for controlled conditions which allow successful vacancy investigation. 

In pure metals and in the majority of alloys, vacancies provide thermally activated 
atom transport and, hence, vacancy properties directly influence the atomic transport (ch. 
7). Vacancy properties give information on interatomic forces by means of specific 
perturbations due to the vacant lattice site. 

The interstitial is a very interesting defect because of the large lattice perturbation it 
causes. This large lattice perturbation and the related dynamic and static properties play 
a key role in the evolution of typical damage structures. These involve not only the 
microstructure but also the microchemistry of alloys. 

The properties of vacancies and interstitials are described in §§ 2 and 3, respectively. 
In the late 1960s, a controversy arose on the interpretation of the generally observed 
radiation damage recovery. The question was whether a satisfactory explanation requires 
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the existence of two interstitial species annihilating at quite different temperatures or 
needs only one interstitial species. Accordingly, quite different migration energies were 
assumed for vacancies and interstitials, respectively, by the advocates of the two different 
models. The reader is referred to the report on the panel discussion held at the Jiilich 
Conference on Vacancies and Interstitials in Metals (see further reading: Seeger et al. 
[1970]) in order to get the flavour of the atmosphere of that period. Many papers in the 
literature give a lively impression. In the meantime, a large number of experimental 
results were reported and even completely new methods were applied. They certainly 
narrowed the scope of the controversy considerably. It would be impossible to outline 
within this article the controversy in terms of all experimental results which were claimed 
at the time to be of relevance. Instead of this, the scientific content and today’s state of 
the controversy is briefly outlined in a dedicated section (§ 33.2.5). 

The application of alloy materials in nuclear reactors, particularly in the core region 
with its high neutron flux density, causes radiation damage to be a technological 
problem. This is, in particular, true for the fast breeder reactor and the future fusion 
reactor. Therefore, §4 is dedicated to the basic mechanisms of point defect production 
and reactions under reactor irradiation conditions. As reactor irradiations are frequently 
simulated by heavy ion irradiation, the characteristics of this are treated as well. The 
interaction of swift heavy ions with metals has been included (§ 4.3) in order to complete 
the description of the interaction of heavy ions with metallic materials although this topic 
is not directly related to reactor technology. Important property changes such as swelling, 
radiation-induced creep and radiation-induced atom redistribution are outlined in §§4.4, 
4.5 and 4.7. Irradiations under reactor conditions also demonstrated the pattern formation 
of vacancy clusters. This topic is treated in § 4.6. 

2. Vacancy properties 

2.1. Theoretical background 

The entropy of a defect-containing crystal is larger than that of a perfect crystal. 
Therefore the Gibbs free energy change resulting from changing the atomic-defect 
concentration c by Sc is given by: 

SG = (A H l - TAS 1 + k B T In c)&, (1) 

where A If is the activation enthalpy of formation, k B In c is the ideal entropy of mixing 
and AS* is that entropy change which arises in addition to k B Inc (excess entropy of 
mixing), and k B is the Boltzmann constant. In the close-packed metals, formation of 
vacancies requires the smallest amount of energy when compared with other lattice 
defects. Hence, they are dominant in thermal equilibrium. 

The entropy change AS 1 is mainly due to the change of the phonon spectrum of the 
crystal by the introduced defects. For the high-temperature harmonic approximation one 
obtains: 
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A5' v f = ^X ln ( w o,/ w i) (2) 

I 

where and a), are the eigenfrequencies of the crystal without and with vacancies, 
respectively. The phonon spectrum is changed by the change of the atomic coupling for 
the nearest neighbors of the vacancy and by the overall lattice volume change. The 
entropy change AS* is positive if on average the a are lowered and negative if they are 
enhanced. 

The simplest model assumes nearest-neighbor interaction by a “spiral spring" with force 
constant/ and calculates the change of the Einstein frequencies of the nearest neighbors of the 
vacancy by removing one of its twelve coupling springs (in the fee lattice). For the 
vibration towards the vacancy, a> 1 0E =4f/M (M= atomic mass) is changed to a> 2 0E =3f/M. 
Vibrations perpendicular to the removed spring are unchanged. For all other atoms all 
eigenfrequencies are unchanged. Hence we obtain AS/ = j(12 k B In 4/3) = 1.73 k B . 

Refinement of the calculation must take into account the static atomic relaxations 
around the vacancy (Dederichs and Zeller [1980]). These change the force constants 
for many more atoms than the nearest neighbors. Correct consideration of this effect 
seems to be difficult. A negative sign of the relaxation contribution to the formation 
entropy change was obtained by Schottky etal. [1964] and a positive one by Burton 
[1971]. In total, A was obtained by Schottky etal. [1964] to be 0.49 kg for Cu, 0.92 
k B for Ag and 1.22 k B for Au, whereas Burton [1971] obtained 1.8 k B -2.0 k B for fee 
and 2.2 k B - 2.4 k B for bcc metals. 

The sensitivity of the result to the number of atomic shells considered as contributing 
to the frequency redistribution has been investigated systematically for different pair- 
interaction potentials by Hatcher etal. [1979]. In the model crystal about 5 x 10 3 atoms 
around the vacancy were allowed to relax statically and for 102 to 5x 10 2 atoms in the 
central region dynamical displacements (vibrations) were allowed. The pair interaction 
was simulated by the Bom-Mayer potential introduced by Gibson et al. [1960] for 
computer simulation of Cu (see also § 3.3.1) and by the Morse potential introduced by 
Cotterill and Doyama [1966]. This last potential was smoothly cut-off at r=1.2a, 
where a is the lattice-constant by Hatcher etal. [1979]. It then fits the observed lattice 
constant, bulk modulus and enthalpy of vacancy formation for Cu. The entropy of 
formation was found to depend approximately linearly on the inverse of the number of 
atoms allowed to vibrate. The relationship was followed up to the 19th atomic shell 
around the vacancy. By extrapolation, AS{ V =2.3 k B was obtained for the Morse potential 
and 1 .6 k B for the Bom-Mayer potential. The influence of the static lattice relaxation on 
the magnitude of A s{ v is demonstrated by comparing the above with 2.28 k B for the 
Morse potential and 2.52 k B for the Bom-Mayer potential; both values are for the 
unrelaxed atoms (perfect lattice around the vacancy). Hatcher etal. [1979] have also 
found A Sf v = 2. 1 k B for a potential simulating a-Fe. The cut-off Morse potential describes 
the formation and migration properties of mono- and di-vacancies in Cu reasonably well. 
The results are unsatisfactory for the Bom-Mayer and the cited long-range Morse 
potential (Cotterill and Doyama [1966]). For the hep Mg, A =1.5 k B -2 k B was 
found by using an empirical potential (Monti and Savino [1981]). 
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Discrepancies between the results from different potentials are particularly large for 
the relaxation volume i.e., the volume change of a crystal caused by removing one 

atom from the interior and withdrawing it from the crystal. The quantity is measured by 
the lattice-parameter change. The Bom-Mayer potential for Cu yields -0.47 atomic 
volume, whereas the Morse potential yields -0.02 atomic volume for both the extended 
and the cut-off version (Dederichs et al. [1978]). With Ah{ v =-0.41 eV, the 
Bom-Mayer potential fails to give a reasonable enthalpy of formation. With the 
corresponding values +1.17 eV and +1.29 eV for the extended and the cut-off Morse 
potential, respectively, satisfactory agreement with experimental values is obtained. 

Ab-initio calculations of vacancy formation enthalpies are obviously problematic. 
Reviews are by Friedel [1970], Evans [1977], Heald [1977], and Stott [1978]. The 
problem consists of calculating the energetics of the electron system for a vacancy 
containing crystal. Only so-called simple metals have been treated. It had to be assumed 
that the core electrons are rigidly confined to the nucleus and any charge redistribution 
caused by the missing atom core only affects the conduction electrons. For transition 
metals the vacancy effect on the outermost d electrons has not been described successful- 
ly as yet. For the simple metals two entirely different approaches have been applied. 

The first consists in replacing the vacancy by a repulsive impurity potential acting in 
a free electron gas (jellium). The second approach is based on pseudo-potential theory 
and assumes the vacancy effect to be weak enough to follow a linear response formal- 
ism. The results of both approaches are unsatisfactory in terms of agreement with 
experimental data. Reasons are the improper treatment of the effect of the local charge 
redistribution as well as of that of the lattice relaxation around a vacancy on the electron 
system. More promising seems to be the combination of selfconsistent cluster calcula- 
tions (quantum-chemical approach) with lattice defect calculations (see Adams and 
Foiles [1990]). 

Vacancy migration has been treated by analytical methods (Flynn [1968]) as well as 
by computer simulation (Beeler [1983]). The model developed by Flynn [1968] 
predicts activation enthalpies for the vacancy migration A H™ in remarkably good 
agreement with experimental data for quite a number of metals. The model relates AH" 
to the elastic moduli. According to the author, the jumping atom passes the saddle point 
by means of a fluctuation of the kinetic energy which was picked up near the atom’s 
equilibrium position where the kinetic energy takes maximum values. There, however, the 
atomic movement can be described by the harmonic approximation. 

2.2. Experimental methods and results 

2.2.1. Introductory remarks 

Vacancies are of atomic size and therefore difficult to observe directly. However, 
agglomerates of numerous vacancies are visible even with a naked eye. Such agglomer- 
ates are formed in suitable diffusion samples owing to the Kirkendall effect (Smigelskas 
and Kirkendall [1947]). Two metals A and B are brazed to each other and then 
thermally annealed at high temperatures. After sufficient atom transport, one observes 
that the two elements diffuse into each other, but with different velocities. As a conse- 
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quence, more atoms leave the fast diffuser crystal than slow diffusing atoms arrive there. 
The crystal responses to this migrational loss of matter by forming voids as shown in fig. 
1 . The quantitative evaluation of such experiments moreover prove that the migration of 
single vacancies is the origin of the atom diffusion (ch. 7, §5. 3. 1.1). 

Field ion microscopy does resolve the atomic structure of crystals and single 
vacancies have indeed been imaged (Seidman [1973]). But these are vacancies at the 
surface or just beneath a surface and might not prove the existence of this defect within 
the bulk of a crystal. 

The volume change of a crystal due to vacancy formation (§ 2.2. 2.2) seems to be the 
most direct measurable change caused by single vacancies. Other properties are treated 
in §§ 2.2.2. 3 and 2.2.2.4. 

2.2.2. Enthalpy and entropy of formation 

2.2.2.I. Single vacancies and di-vacancies. For thermal equilibrium one obtains from 
eq. (1): 

c v ° = exp(A5 v f //: a )exp(-A// v f //: a r). (3) 

The determination of enthalpy and entropy of formation according to this equation 
requires the determination of equilibrium vacancy concentrations c v °. Two different 
procedures are applied: either measurement of c v at high temperatures such that c v is 
indeed in thermal equilibrium during the measurement, or quenching of the sample from 
high temperatures to low temperatures such that the defects are immobile. The latter 
method avoids difficult measurements at high temperatures but requires considerable 
efforts to avoid vacancy losses during quenching. Review articles are by Hoch [1970], 
Balluffi etal. [1970] and Siegel [1978], 

Even the most careful measurement of c v ° at high temperatures cannot yield the 
wanted formation enthalpy A H f u and entropy A S{ y of the monovacancy because c v ° does 



Fig. 1. Cross section of a diffusion couple Cu (above) against Ni (below). Thin black line represents the Cu-Ni 
interface before the diffusion annealing. Black dots near the interface line are cavity images. 
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not necessarily involve only monovacancies. Actually it is given by 
c v = c lv + 2 c 2v +... = nc„y, where c„° is the equilibrium concentration of clusters 
consisting of n vacancies. For equilibrium concentration measurements, higher aggregates 
than di-vacancies need not be considered owing to their negligibly small concentration 
in the normal cubic metals. We then have in the dilute solution approximation: 

c v ° = c° + 2 c 2 ° v = exp( AS^ As ) exp(-AHl/k B T) 

(4) 

+ 2 g 2v exp[(2A5, f v - A5 2 b v )A a ]exp[(A// 2 h v - 2A Hl)/k B T\ 

where A S 2v and A H%, are entropy and enthalpy of binding of the di-vacancy, respective- 
ly. For the di-vacancy formed by two vacancies at nearest-neighbor sites in the fee 
structure, the ageometry factor becomes g 2v = 6. In bcc and hep structures different di- 
vacancy configurations may exist simultaneously with non-degenerate values of g 2v , A S 2v 
and A// 2v . Up to now, there is no reliable method of measurement which separates c 2 ° v 
and c ,y. Hence, the formation enthalpy as derived from the temperature dependence ofc v ° 
is an effective one: 


A H 


r 

V 


«?lnc v °(r) 

4i/k B T) 


= AHl + 2g 2v 


(AHl -AHl), 

exp[ 

(pi - Gl)/k B T 

1 + g 2v exp 


, ~ Gl)/k B T 



(5) 


where G denotes the Gibbs free energy. For bcc and hep metals, eq. (5) becomes much 
more complicated due to the possible occurrence of different di-vacancy configurations 
(Seeger [1973a]). Equation (5) introduces a temperature dependence of A H{ which must 
be considered whenever A H{ is determined over a large temperature range. Binding 
enthalpies of the di-vacancy have been calculated for Cu, Ni, Au, and Pt to amount to 
0.08, 0.07, 0.08, and 0.11 eV, respectively (Klemradt et al. [1991], §2.2.5). Such 
values would not lead to measurable deviations from the Arrhenius behavior. 

2.2.2 .2. Differential dilatometry. The classical method of direct c v ° determination is 
differential dilatometry (Wagner and Beyer [1936], Simmons and Balluffi [1960a, b, 
1962, 1963]). It is based on a theory by Eshelby [1956], which relates the microscopic 
volume change AV rW of randomly distributed dilatation centers (point defects) to the total 
volume change of the defectcontaining crystal as compared to the perfect crystal. The 
total crystal volume change consists of the Eshelby contribution plus that caused by 
adding to the total crystal volume the atomic volumes of those atoms which are taken out 
of the interior of the crystal in order to produce vacancies. The crystal volume change 
can be measured by the length change, A///, of the sample for cubic crystals and the 
average microscopic (lattice cell) change, A a/a, by the lattice parameter change of the 
same sample. One obtains: 

c v = 3(Al/l - A a/a), (6) 
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which is correct for cubic crystals and the small vacancy concentrations, usual in such 
metals. 

Precise measurements of this type are extremely difficult. For illustration of the 
orders of magnitude to be measured, the A1 data by Simmons and Balluffi [1960a] are 
shown in fig. 2. From these data the authors derived A H{ =0.76 eV. Since then A H* has 
frequently been determined also by means of the methods of measurement described 
below with the result of values ranging from 0.60 eV to 0.77 eV with 0.67 ± 0.03 eV as 
recommended by Balluffi [1978]. Recommended values for the other metals are given 
in table 1. A complete collection of data was given by Ehrhart [1991] and Schultz 
[1991a], 

The relaxation volume is given by AV*‘/(l = 3A a/c° (where ft is the atomic 
volume). The diffuse X-ray scattering cross section (DXS, §3.3.2.2.) is given by the 
product of c v and the square of the scattering amplitude caused per vacancy (Dederichs 
[1973]) for low vacancy concentrations and random spatial distribution. The scattering 
amplitude is essentially given by the elastic moduli and the dipole force tensor which 
describes the atomic displacements resulting from the insertion of the vacancy. AsAV™ 1 
is determined by the same quantities and can be derived from A a/a measurements as 
shown above, the combination of DXS and A a/ a measurements allows the evaluation of 
both, c v and AV' el (Ehrhart et al. [1979]). The DXS — A a/ a measurements yielded 
AV' e ' =-0.15 ft for gold when measured at 4 K. The DD method yielded -0.5 ft instead 
(Simmons and Balluffi [1962]). Ehrhart et al. [1979] trace this difference back to a 
potential temperature dependence of AV' el . For other metals, AV™ 1 data are given in 
table 1. (See also ch. 12, §3.3). 

The third method determines the volume of vacancy formation AV* = AVj^+ft by 
measuring the quenched resistivity increment (as a measure for c v ° at the quenching 
temperature T q ) as a function of temperature and hydrostatic pressure. The order of 
magnitude of the pressure dependence is such that for gold, a 6 kbar pressure increase 
corresponds to a temperature decrease of about 30 K around 900 K for constant c v °. 

The way of determining AV* 1 from (A a/a) q and A p q measurements on quenched 



Fig. 2. Differential dilatometry in vacancy equilibrium conditions for aluminium. AL/L is the relative length 
change of a sample, Aa/a is the relative lattice-constant change. (After Simmons and Balluffi [1960a].) 
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Table 1 


Properties of vacancies in pure metal. 


Glossary: 

Property symbol 

Definition and units 

cJTJ 

Vacancy concentration at the melting temperature (x 10 4 ) 

as; 

Activation entropy of formation (units of kg), referring either to single vacancy 
values or to effective values depending on data evaluation 

A Hi 

Activation enthalpy of formation (in eV) referring to single-vacancy or effective 
values, depending on data evaluation 

AV? 

Relaxation volume of the vacancy (in atomic-volume units) 

P, 

Electric resistivity contribution per unit concentration of vacancies 
(in units of 10“* flcm) 

AHf 

Activation enthalpy of migration (in eV) 

Measurement symbol 

Method of measurement for data evaluation 

DD 

Differential dilatometry 

V,. ^Pim 'V» 

Electrical resistivity of quenched, irradiated and cold worked samples respectively 

TEM 

Transmission electron microscopy 

PAS 

Positron-annihilation spectroscopy 

DXS 

Diffuse X-ray scattering 

NMR 

Nuclear magnetic resonance 

/tSR 

Muon spin rotation experiments 

THDS 

Thermal helium desorption spectroscopy 


Property 

Value 

Measurement 

Reference 

Ag (silver, fee) 

cfT M ) 

1.7 

DD 

Simmons and Balluffi [1960b] 

as; 

1.5 

DD 

Simmons and Balluffi [1960b] 

A 

1.11 ±0.05 


Siegel [1982a] 

Pv 

1.3 

APq 

Doyama and Koehler [1962] 

Atf" 

0.66 ± 0.05 


Balluffi [1978] 

A1 (aluminium, fee) 
cfT u ) 9.4 

DD, A p. 

Siegel [1982b] 

as; 

0.7 

DD, A p. 

Siegel [1978] 

A 

0.67 ± 0.03 


Balluffi [1978] 

AVf 

-0.05 

DXS 

Siegel [1978] 

Schaefer etal. [1987a] 
Ehrhart et al. [1986] 

Pv 

1.0 


Siegel [1978] 

Atf? 

0.61 ± 0.03 


Siegel [1982b] 

Au (gold, fee) 


7.2 

DD 

Simmons and Balluffi [1962] 

a a; 

0.72 


Siegel [1978] 

A 

0.93 ± 0.04 


Siegel [1982a] 

AV? 1 

-0.15 

Ap„, HDS, A a/ a. 

Ehrhart et al. [1979] 

P. 

1.70 

valid at 4 K 

Ap q 

Ehrhart etal. [1979] 

AHf 

0.71 ± 0.05 


Sonnbnberg and Dedbk [1982] 
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Property 

Value 

Measurement 

Reference 

Be (beryllium, hep) 



ah? 

0.8 


Schumacher [1970] 

Bi (bismuth, rhombohedral) 



c,(T») 

6.17 

DD 

Matsuno [1977] 

tel 

0.3 

DD 

Matsuno 11977] 

AH[ 

0.35 

DD 

Matsuno [1977] 

AH? 

0.5 

PAS, A 

Corbel etal. [1987] 

Cd (cadmium, hep) 



cJtTJ 

4.5 

DD 

Janot and George [1975] 


0.5 

DD 

Janot and George [1975] 

AHl 

0.46 ± 0.05 

PAS 

Ehrhart [1991] 

AV?' 

-0.54 

PAS, pressure dependence 

Dickman etal. [1978] 

AH? 

0.4 ± 0.05 


Ehrhart [1991] 

Co (cobalt, hep) 

AV7 1 

-0.05 

DXS, valid at 4 K 

Ehrhart and SchOnfeld [1982] 

Atf v M 

0.72 

Ap* 

Dander and Schaefer [1977] 

Cr (chromium, bcc) 



MJ[ 

2.0 ± 0.2 

PAS 

LOPER etal. [1985] 

AH? 

-0.95 


Schultz [1991] 

Cu (copper, fee) 

c v (T,,) 

2.1 ±0.1 

DD 

DlFFERT etal. [1987] 

tel 

2.8 

APq 

Berger etal. [1979] 


1.6 

DD 

Differt etal. [1987] 

Mil 

1.28 ± 0.05 


Schaefer et al. [1987b] 

AV?' 

-0.25 

DXS, low temperature 
value 

Ehrhart etal. [1986] 

Pv 

0.62 

A p v DD 

Berger etal. [1979] 

AHl' 

0.70 ± 0.02 


Balluffi [1978] 

Fe (iron, bcc) 
AVf 

-0.05 

A a/ a. Ait DXS 

Ehrhart [1985] 

AHl (para) 

1.79 ± 0.1 

PAS 

De Schepper et al. [1983] 


1.85 ±0.1 

PAS 

Shirai etal. [1989] 

AHl (ferro) 

1.59 - 1.73; 

/iSR, ferromagnetic state 

Furderer etal. [1987] 

A H? (ferro) 

0.55 


Takaki etal. [1983] 


1.11 ±0.2 

Q SD -Hl. 

Shirari etal. [1989] 

In (indium, tetragonal) 



A5J 

5 

PAS 

Weiler and Schaefer [1985] 

Mil 

0.52 ± 0.04 


Ehrhart [1991] 

AV? 1 

-0.6 

PAS 

Dickman et al. [1977] 

AH? 

(0.31) 

Qv°-H? 

Weiler and Schaefer [1985] 

K (potassium, bcc) 



A Hi 

0.34 

Ql°-AH? 

Wallner etal. [1986] 

A H? 

0.038 

APi„ 

Wallner etal. [1986] 

Li (lithium, bcc) 
Alii 

0.48 

Ql°~AH? 

Schultz [1991] 

AH? 

0.038 


Wallner [1983] 



Ch. 18, §2 


Point defects 


1631 


Property 

Value 

Measurement 

Reference 

Mg (magnesium, hep) 



cfT M ) 

7.2 

DD 

Janot etal. [1970] 

AS 

0 ± 0.3 

DD 

Janot etal. [1970] 

AH[ 

0.79 

APq 

Tzanetakis et al [1976] 

AH? 

0.5-0.6 

Ap q 

Tzanetakjs et al. [1976] 

Mo (molybdenum, bcc) 



cfTJ 

0.13 


Schultz [1991] 

as; 

1.6 


Schultz [1991] 

A H[ 

3.0 ± 0.2 

PAS 

Schaefer [1987] 

AV? 

-0.1 

DXS 

Ehrhart [1978] 

Pv 

4.3 


Schwirtlich and Schultz [1980] 

AH ? 

1.35 

A P 

Schwirtlich and Schultz [1980] 

Na (sodium, bcc) 




cJTJ 

7.8 ± 0.3 


Adlhart etal. [1975] 

AS[ 

3.9 ± 0.3 


Adlhart et al. [1975] 

A Hi 

0.354 ± 0.035 

A Aa, A 1/1 

Adlhart etal. [1975] 

AH? 

0.03 


Wallner [1983] 

Nb (niobium, bcc) 




3.07 

Qf-AH? 

Schultz [1991] 

AH? 

0.55 


Faber and Schultz [1977] 

Ni (nickel, fee) 




A Hi 

1.79 ± 0.05 


Schaefer [1982] 

AV? 

-0.2 

DXS 

Bender and Ehrhart [1983] 

Pv 

2. 3-3.6 

APq 

Wycisk and Feller-Kniepmeier [ 1978] 

AH? 

1.04 ±0.04 


Seeger and KronmOller [1987] 

Pb (lead, fee) 




cfT M ) 

1.7 

DD 

Feder and Nowick [1967] 

Asvf 

2.6 

Ap q , DD 

Leadbetter etal. [1966] 


0.7 ± 2.0 

DD 

Feder and Nowick [1967] 

A Hi 

0.58 ± 0.04 

PAS 

lTO et al. [1982], 




Hara and Nanao [1982] 

Pv 

2.8 

APq 

Knodle and Koehler [1978] 

AH? 

0.43 ± 0.02 

Ap» 

Schroeder and Schilling [1976] 

Pd (palladium, fee) 



c/r M ) 

0.14 

PAS 

Schaefer [1982] 

A Hi 

1.85 

PAS 

Schaefer [1982] 

AH? 

1.03 ± 0.3 

Ap™ 

KOster and Kehrer [1965] 

Pt (platinum, fee) 



cfT M ) 

9.4 

V, 

Khellaf etal. [1988] 

as; 

0.4 

APq 

Khellaf etal. [1988] 

ah; 

1.35 ± 0.05 


Ehrhart [1991] 

AV? 

-0.28 

APq 

Emrick [1982] 

Pv 

5.75 

F1M, A p v upper limit 

Berger etal. [1973] 

AH? 

1.43 ± 0.05 


Balluffi [1978] 

Re (rhenium, hep) 



AH? 

2.2 

Ap C w 

Vandenborre et al. [1968] 


0.5-0.6 

Vi,, 

VandenbORRE et al. [1972] 
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Property 

Value 

Measurement 

Reference 

Sb (anthnon, rhombohedral) 



CJJ, m) 

10 


de Diego etal. [1987] 

ah[ 

1.4 

e SD -H' , v 

Ehrhart [1991] 


-0.3 

PAS 

de Diego etal. [1987] 

Sn (tin; /3-Sn (for T > 13°C): tetragonal, bet (A5)) 


as: 

1.1 

DD, PAS 

Balzer and Sigvaldason [1979] 


£0.3 

DD 

Balzer and Sigvaldason [1979] 

A Hi 

0.51 

PAS 

Shah and Catz [1984] 

A 

0.3-0.4 


McIlwain et al. [1975] 

Ta (tantalum, bee) 



A H[ 

3.1 

Qf-AH? 

Schultz [1991] 

AH? 

0.7 


Faber etal. [1974] 

Th (thorium, fee) 



A Hi 

1.28 + 0.2 

PAS 

Kim et al. [1984] 

AH? 

2.04 

Q SD -Hl. 

Kim et al, [1984] 

V (vanadium, bee) 



A Hi 

2.2 ± 0.4 

PAS, life time 

Janot etal. [1982] 

AH? 

0.7 


Schultz [1991] 


0.5 

Ql°-AH?, PAS 

Schultz [1991] 

W (tungsten, bee) 



Pv 

7 


Park etal. [1983] 

c„,(T m ) 

3 


Park etal. [1983] 

A Si 

3.2 


Park etal. [1983] 

A Hi 

3.6 ± 0.2 


Park etal. [1983] 

AH? 

1.70 + 0.1 

valid for low T 

Mundy et al. [1987] 

Zn (zinc, hep) 



cJTvb 

5 ±0.3 

DD 

Balzer and Sigvaldason [1979] 

AS' 

1 + 1 

DD 

Balzer and Sigvaldason [1979] 

A Hi 
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samples must be based upon well known p v values. As these are obtained by use of 
absolute c v ° data from DD measurements this method goes back essentially to DD data 
but does involve all quenching problems. 

For the bcc metals AV™‘ values do not exist because of difficulties in determining 
absolute c v ° values (Schultz [1991]). For hep metals, similar values have been reported 
as for the above-quoted fee metals. 

2.2.2.3. Positron-annihilation spectroscopy. The second common method of 
measuring vacancy concentrations in thermal equilibrium is positron-annihilation 
spectroscopy (PAS). Review articles on PAS application for studies of vacancy properties 
are by Seeger [1973b], Doyama and Hasiguti [1973], Triftshauser [1975], Siegel 
[1978], Munarends [1979], West [1979] and Hautojarvi [1987], High-energy 
positrons injected into metal crystals are rapidly thermalized by electron-hole excitations 
and interactions with phonons. The thermalized positron diffuses through the lattice and 
ends its life by annihilation with an electron. The lifetime depends on the total electron 
density occurring along the diffusional path of the positron. Vacancies obviously trap 
positrons in a bound state, and because of the missing core electrons at the vacant lattice 
site, the local electron density is significantly reduced. This condition causes the lifetime 
of trapped positrons to be enhanced by 20-80% as compared to that of free positrons in 
the perfect lattice. Consequently, positrons in a vacancy-containing crystal end their lifes 
by annihilation either as free positrons or as trapped positrons. Critical discussion on 
non-thermal trapping was given by Kluin and Hehenkamp [1991]. The lifetimes for 
both fates are different and the probability of trapping is proportional to the vacancy 
concentration. Lifetime measurements are possible as y -quanta are emitted at the birth of 
a positron as well as at its decay. Fortunately, thermalization happens within about one 
picosecond whereas the average lifetime in the metal crystal is in the order of 200 ps. 

With PAS, information is extracted not only from the positron lifetimes but also from 
the angular correlation between the directions of the emitted annihilation y-rays and from 
the Doppler broadening of these y-rays. These quantities yield information on the net 
momenta of the annihilating electron-positron pairs. They allow a distinction to be made 
between annihilatipns with the higher-momentum core electrons and the lower-momen- 
tum valence or conduction electrons. 

Vacancy concentrations are determined by means of lifetime spectrum measurements 
as well as momeijtum techniques (angular correlation and Doppler broadening). The 
former avoids additional assumptions on positron-annihilation parameters but requires 
high-resolution measuring techniques and expanded data deconvolution. Although the 
deconvolution of momentum-distribution data requires critical assumptions on the 
temperature dependence of positron-annihilation parameters, these methods have become 
more popular for A H{ determinations. In fig. 3, vacancy concentrations derived from 
angular correlation measurements are shown for Cu and Au. The range of measurement 
by differential dilgtometry is indicated for comparison. The PAS data extend to about 
two orders of magnitude lower vacancy concentration than the differential dilatometry 
data. As monovacancies certainly predominate in this concentration range, PAS studies 
are of great importance as a complement to differential dilatometry on the one side, and 
to resistivity measurements in quenched samples on the other side. 
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Fig. 3. Arrhenius plot of the vacancy concentration as derived from positron annihilation spectroscopy for gold 
(full symbols) and copper (open symbols) according to Triftshauser and McGervey [1975]. The left hand 
side of the arrows represents the range covered by the differential dilatometry. 


2.2.2.4. Resistivity measurements after quenching. All problems of measurements 
at temperatures near the melting point can be avoided if equilibrium vacancy concentra- 
tions can quantitatively be quenched from such temperatures 7 q to temperatures which 
immobilize the vacancies. Properties which are proportional to the vacancy concentration 
could then be measured on quenched and unquenched samples for direct comparison. 
This idea appeared so challenging that large efforts were put into the development of 
quenching methods following the pioneering work by Kauffman and Koehler [1952], 
Methods and problems are reviewed by Balluffi et al. [1970] and Siegel [1978]. There 
are indeed problems inherent to the quenching process. During quenching, the vacancies 
are still highly mobile in a significant part of the total temperature interval being passed 
through. The migrating vacancies are able to react with other defects or with one another, 
with the following consequences: (i) Vacancy losses to sinks such as dislocations, grain 
boundaries and surfaces. The quenched concentration c y will therefore be smaller than 
c v °(7 q ). The inhomogeneous distributions of sinks will cause locally varying concentra- 
tions. (ii) Vacancy clustering, which causes repartitioning of the cluster size distribution 
existing at 7 q . High-order clusters are favored in comparison to the equilibrium distribu- 
tion at 7 q . 

By modelling the vacancy reaction scheme for the conditions of quenching, vacancy 
loss and repartitioning of cluster sizes were studied in great detail (Balluffi et al. 
[1970]). With the aid of such calculations, quenching results obtained for systematically 
varied quenching rates could be corrected and extrapolated to infinite quenching rates in 
the case of Au (Flynn et al. [1965]). Systematic studies of this kind were performed 
also on Al (Bass [1967] and Berger et al. [1978]). 

An entirely different approach was successfully followed by Lengeler [1976] and 
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Lengeler and Bourassa [1976]. They were able to rapidly quench single crystals of Au 
and Cu with dislocation densities so low that vacancy losses are negligible even for high 
T q values. The effect is demonstrated in fig. 4. Quenching data were reviewed by 
Balluffi [1978]. 

2.2.3. Activation enthalpy of migration 

2.2.3.I. Problems of methods of determination. The common way of investigating 
the migration properties of vacancies consists of quenching a sample from high tempera- 
tures and subsequent annealing at increasing temperatures in order to induce the 
diffusion-controlled annihilation of the excess vacancies at the annealing temperature 
(further reading: Seeger et al. [1970]). The residual resistivity has mostly been taken as 
a measure of the vacancy concentration in the lattice. This property is certainly preferable 
to all others in view of its convenient application and high sensitivity. The isochronal 
recovery behavior of a pure metal containing point defects is sketched in fig. 5. The 
resistivity increase recovers in distinct steps which are labelled according to Van 
Bueren [1955]. Stage I is observed in irradiated samples only. Stage II occurs in 
irradiated and plastically deformed samples. Quenched samples show resistivity recovery 
at temperatures varying from stage III to stage IV, depending on the individual metal. 
Stage V is caused by recrystallization (ch. 28) and occurs in all samples which contain 
larger defect agglomerates. As the first recovery stage of quenched samples does not 


t i° c 

1000 900 800 700 600 



Fig. 4. Arrhenius plot of the quenched-in residual resistivity of gold. The influence of the dislocation density 
N d is shown by the curve for quenched polycrystals. (After Lengbler [1976].) 
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Fig. 5. Isochronal recovery curve of the electrical resistivity of Cu electron irradiated at 4 K. 

generally coincide with stage III of irradiated samples, a straightforward classification of 
this stage was not possible. This situation created the stage III controversy (§ 33.2.5). 

The resistivity method is not appropriate for deducing information on details of the 
underlying defect reactions. Its inability to indicate secondary vacancy reactions like 
clustering and trapping at impurities beside annihilation is one of the origins of the stage 
IE controversy. It was not before the mid seventies that other methods such as PAS or 
perturbed y-y angular correlation (PAC) came up, which allowed a quantitative 
separation of different vacancy reactions. 

A breakthrough was achieved by applying PAS in order to determine the temperature 
at which vacancies become mobile. Another significant step was taken when the ability 
of nuclear probe atom methods such as Mossbauer spectroscopy and PAC to distinguish 
between interstitials and vacancies was discovered. 

As many as 21 experimental results are listed by Balluffi [1978] which favor stage 
III interpretation by vacancy migration. Since then methods such as the perturbed y-y 
angular correlation measurements have been applied fruitfully. Their results also favor 
the vacancy interpretation (see, for example, Pleiter and Hohenemser [1982]). 

2.23.2. Two selected pieces of evidence for vacancy migration in stage III. Among 
the earlier results, the positron-annihilation behavior in stage HI seems especially 
conclusive with respect to vacancy migration iri that stage. The important quantity is the 
so-called lineshape parameter /? = !(/„' - ll)/(l‘ c - //)! The energy of the positron- 
annihilation y quanta is Doppler-broadened because of the non-vanishing momenta of the 
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annihilating electron-positron pairs. The center of the curve (intensity versus energy) is 
caused by low-momentum pairs, i.e., involves conduction and valence (v) electrons, 
whereas the branches arise from high-momentum pairs, i.e., involve core (c) electrons. 
The quantity I is simply a suitably broad section of the Doppler line integral, one below 
the maximum (/ v ) and the other below the branches (/ c ). The upper subscripts t and / 
refer to trapped and free positrons, respectively, as obtained from the irradiated and 
unirradiated state. 

The lineshape parameter R was found to be considerably larger for positrons trapped 
by voids than for those trapped by single vacancies (Mantl and Triftshauser [1978]). 
For trapping by dislocation loops its magnitude is close to that for single vacancy 
trapping. The result from measurements in electron-irradiated Cu through stage III 
recovery is shown in fig. 6. The corresponding resistivity-decrease between 200 K and 
300 K amounted to about 85% of its value at 200 K. The increase of R indicates a 
further fractional decrease of core electron density as compared to the presence of merely 
single vacancies below 200 K. This can only be visualized by vacancy agglomeration in 
three-dimensional arrangements. The effect of interstitial-type dislocation loops which 
could be formed in stage HI according to the two-interstitial model was found to produce 
R = 0.62 ±0.03 in neutron-irradiated aluminium (Gauster et al. [1975]). Interstitial 
agglomeration can therefore be excluded as an origin of the observed R enhancement. 
Mantl and Triftshauser [1978] assume the three-dimensional agglomerates to be the 
nuclei of the vacancy loops observed at higher temperatures by electron microscopy. The 
same behavior of R as in Cu has been observed in the bcc Mo whereas in Al the 
observed increase in R did not significantly exceed the error limit (Mantl and Trifts- 
hauser [1978]). 

Fortunately, the PAC technique indicated the arrival of a vacancy type defect in stage 
III at the probe atom Cd in Al (Rinneberg etal. [1978], Rinneberg and Haas [1978]). 
The PAC technique essentially measures the hyperfine interaction of the probe-atom 
nucleus with the electric crystal field. Interstitials or vacancies situated at the nearest- 
neighbor sites of such probe atoms obviously cause sufficiently large field gradients to 
produce measurable signals. Review articles on this type of application of PAC are by 
Pleiter and Hohenemser [1982], Wichert [1982] and Niesen [1981]. The PAC signals 
are determined by the electric field gradient at the site of the probe-atom nucleus and are 



Fig. 6. Lineshape parameter R of low-temperature electron-irradiated Cu upon isochronal annealing (after 
Mantl and Triftshauser [1978]). 
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in that sense specific for the type of defect which is placed at a nearest-neighbor site. If 
it happens that the probe atom in a given host metal traps the vacancy, a quenched 
sample containing probe atoms will exhibit the PAC signal caused by the vacancy. 
Similarly one obtains the signal caused by the interstitial when one irradiates the sample 
at low temperatures and anneals through stage I or irradiates in stage II. In this way one 
obtains the “fingerprints” of vacancy and interstitial. Clear evidence for the arrival of 
vacancies at the probe atoms in stage III has been obtained for Al, Ag (Butt et al. 
[1979], Deicher et al. [1981a]), Au (Deicher et al. [1981b]), Cu (Wichert et al. 
[1978]), Pt (Muller [1979]), Cd and Zn (Seeboeck etal. [1982]). 

The PAC signal also gives information on the deviation of the electric field-gradient 
tensor from axial symmetry. When measurements are performed in single crystals, the 
tensor orientation with respect to the crystal orientation can be obtained. Knowledge of 
orientation and deviation from axial symmetry provides valuable criteria for answering 
the question whether mono-, di-, or tri-vacancies decorate the probe atom and cause the 
observed signal. Indeed, the PAC technique has resolved a number of different configur- 
ations for the metals quoted above (review by Recknagel etal. [1983]). The variety of 
defect arrangements trapped at In in Au after different treatments is illustrated in fig. 7. 
From the field-gradient tensor properties, it has been concluded that defect 3 is a 
monovacancy at a nearest-neighbor site, defect 4 is a planar <11 1> vacancy loop and 
defects 1 and 2 are multiple vacancies in different arrangements (Deicher et al. 
[1981b]). It should be kept in mind that the spectrum of visible vacancy arrangements 
does depend on the nature of the probe atom because of the required trap property. 
Nevertheless, the variety of arrangements to be seen in fig. 7 gives a good impression of 
the complexity of the stage III processes which, in the earlier controversy, were often 
identified with either monovacancy or monointerstitial reactions. 



Fig. 7. Fraction of In atoms in gold which emit perturbed y-y angular correlation signals upon isochronal 
annealing after quenching and low-temperature irradiation with the particles indicated. The different types of 
lines indicate different types of signals and hence, different defects trapped by the probe atoms (after Deicher 
el al. [1981b]). 


Ch. 18, §2 


Point defects 


1639 


2.2.33. Experimental determination of A H™ , The primary problem of experiments 
involving quenching and subsequent annealing consists in the large variety of defect 
reactions which are likely to occur simultaneously. In addition, the boundary conditions 
for these diffusion-controlled reactions are significantly influenced by the quenching and 
annealing procedure. The following conditions are important: (i) locally inhomogeneous 
distribution of vacancies, due to losses at sinks or cluster formation (§ 2.2.4), (ii) time- 
dependent sink concentration and population of different configurations during the 
annealing process, e.g., by nucleation and growth of vacancy clusters, (iii) sink effi- 
ciencies which depend on both, vacancy concentration and temperature, (iv) vacancy- 
impurity interactions, (v) drift diffusion effects caused by elastic interaction of vacancies 
and sinks. Computer simulation by means of the rate equation approach shows that the 
main defect parameters can hardly be inferred from a standard analysis of isothermal and 
isochronal data if, for example, the actual clustering kinetics are unknown (Johnson 
[1968]). 

The experimentally observed influence of clustering is illustrated in fig. 8 which 
shows for Au and A1 the observed temperatures T, of annealing stages as function of the 
quenching temperature T q . Within the temperature range shown for vacancies either 
annihilate at fixed sinks or form immobile clusters which dissociate at considerably 
higher temperatures than 450 K. The behavior of T a for A1 is illustrated in more detail in 
fig. 9 which shows the temperature-differentiated isochronal recovery curve for the 7^’s 
indicated. Peak B was quantitatively interpreted by a single diffusion-controlled annihila- 
tion process with an activation enthalpy of 0.65 eV, which was ascribed to monovacancy 
migration. Peak A could not be interpreted by a single process and is characterized by an 
effective activation enthalpy of 0.44-0.5 eV. Obviously, at low vacancy concentrations 
as quenched from low temperatures, monovacancies are the dominant defects. For higher 
vacancy concentrations as quenched from temperatures above 500°C, multiple vacancies 
are formed which migrate faster than monovacancies and accelerate the annihilation 
process. This feature describes, in general, the annihilation behavior of quenched-in 
vacancies in most metals, although the separation between the processes involving 




Fig. 8. Dependence of the temperature T. d , characterizing the temperatures of vacancy-annihilation stages in 
isochronal annealing experiments on quenching temperature T q for Au and A1 (after Balluffi [1978]). 
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Fig. 9. Temperature-differentiated isochronal resistivity recovery curve of A1 for different quenching 
temperatures T q (after Levy etal. [1973]). 


monovacancies and those involving mobile multivacancies is not so obvious as in Al. 

In order to obtain reliable data, the boundary conditions must carefully be controlled 
in the experiments (Lengeler and Bourassa [1976], fig. 4) and/or systematically varied 
solutions of clustering models must be fitted to the data for evaluation (Sahu et al. 
[1978]). The results of such fits are not single sets of enthalpy and entropy changes for 
migration of single vacancies but “fields of existence” for possible parameter sets which 
include also the vacancy-binding parameters up to higher clusters. Enthalpies of 
monovacancy migration are listed in table 1. 

The stage III controversy arose when the recovery stage III of irradiated samples 
always occurred at lower temperatures than the main recovery stage in quenched 
samples. A systematic comparison is by Balluffi [1978], As an example, the tempera- 
tures of recovery T a measured by more than 10 groups of authors in Pt are compared in 
fig. 10. The widespread data-scatter for both types of treatment as well as the systematic 
difference between irradiation and quenching is obvious. It is quite suggestive to take the 
discrepancy of 200 K between the most often observed recovery temperatures in 
irradiated and quenched samples as overwhelming evidence for the migration of two 
entirely different defects, as was done by the advocates of the two-interstitial model. 
According to this model, stage III in irradiated metals is caused by the migration and 
subsequent recombination of the so-called stable interstitial which arises from the 
metastable one by conversion (Seeger [1958, 1975]). Its enthalpy of migration is 
supposed to equal the effective enthalpy of migration found for stage III, whereas the 
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Fig. 10. Temperatures T a (definition see fig. 7) measured in Pt between 400 K and 800 K after quenching or 
irradiation, by different authors. Each circle represents one published isochronal recovery stage. The ordinate 
gives the observation frequency of a given T a . (after Balluffi [1978].) 

vacancy migration enthalpy follows from the respective quantity found in quenched 
samples. The behavior of prequenched and irradiated samples has been studied for A1 
(Bauer [1969]), Au (Lee and Koehler [1968]) and Pt (Duesing and Schilling [1969], 
Sonnenberg et al. [1972]). Typical curves for Pt are shown in fig. 11. The splitting of 
the single stage III in the unquenched state into two stages in the prequenched state has 
been interpreted by Schilling etal. [1970] in terms of a mono-di-vacancy model which 
takes into account the different concentrations and local distributions of sinks in the 
quenched and unquenched samples. Very careful measurements and data evaluation 
revealed a monotonic variation of A//^ when proceeding through the stage IH-IV 
region. The data were quantitatively interpreted by a mono-divacancy model with 
A//" = 1.45 eV, Atf 2 ™ = 1.00 eV and A =0.15 eV (Sonnenberg etal. [1972]). 

Balluffi (1978) compared the reaction processes in stage III for the irradiated and 
quenched state by plotting A against T a , the T a values averaged over all reported 
measurements, as shown in fig. 12. The data for the different metals lie well on straight 
lines through the origin, but quenched and irradiated states are characterized by different 
lines. The equality AH^/k B T a for the different metals indicates the close similarity of the 
parameters determining migration and annihilation, such as frequency factor, sink 
geometry and density, and initial defect concentration. The different slopes for quenched 
and irradiated states indicate deviation of at least one of these parameters. The difference 
can be explained by a number of jumps-per-vacancy until annihilation which is larger by 
a factor of 10 3 in the quenched state than in the irradiated state. This order of magnitude 
had already been derived earlier (Sonnenberg etal. [1972], Johnson [1970], Schilling 
etal. [1970], Lee and Koehler [1968], Lwin etal. [1968]). 
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Fig. 1 1 . Temperature-differentiated isochronal recovery curves of Pt after low-temperature electron-irradiation. 
Shown is the influence of pre-quenching and that of initial defect concentration (the ratio of quenched-in 
resistivity to irradiation-induced resistivity is large for the upper curves, small for the lower curves). The curves 
are normalized by different values of Apm (see ordinate). Merely for that reason a deviation between dashed 
and solid curves occurs below 350 K. (after Schilling etal. [1970].) 


2.2.4. Agglomeration 

Vacancy agglomerates which are large enough to be observable in the electron micro- 
scope have been studied in great detail (ch. 11). The topology of configurations involves 
dislocation loops, stacking-fault tetrahedra and voids. Review articles are by Eyre et al. 
[1977], Kiritani [1982], and Shimomura etal. [1982]. Since the resolution of common 
electron microscopy is limited to 1-2 nm, the observable agglomerates certainly contain 
more than ten vacancies (see also §4.2.2). About the same resolution limit holds for 
diffuse X-ray scattering (Ehrhart etal. [1982] and Larson and Young [1982]). Field 
ion microscopy does allow imaging of agglomerates consisting of less than ten vacancies 
(Wagner [1982]). It has, however, not been applied yet to questions like shape and size 
distribution of small vacancy agglomerates as formed by the encounter of migrating 
vacancies. 
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Fig. 12. Effective enthalpies of defect migration versus T, (definition see fig. 7) averaged over the results by 
different authors for quenched and irradiated samples (after Balluffi [1978]). 


While experimental information on configuration and stability of di-, tri- and tetra- 
vacancies is lacking, computer simulations give some information by using a special 
nearest-neighbor interaction potential constructed to similate the bcc a-Fe (Johnson 
[1964]). This potential was also used for simulating a fee model crystal which behaves 
similar to Ni with respect to the elastic properties. This model yielded for the single 
vacancy: A H{ v = 1.49 eV, A H” v = 1.32 eV and AV I ™ ( =-0.15 atomic volume. The stable 
di-vacancy consisting of the vacancies at nearest-neighbor sites is characterized by 
A//, fc v =0.25 eV and AH™ = 0.9 eV. The most stable tri-vacancy, shown in fig. 13a, 
yielded A// 3 * =0.75 eV and AH£ = 1.02 eV. The migration includes an intermediate 
dissociation step. Reorientation of the trivacancy with a lower activation enthalpy than AH£ 
occurs by jump of the atom which forms a tetrahedron together with the tri-vacancy (fig. 
13b). The stable tetravacancy is that shown in fig. 13d with A // 4 * = 1.51 eV. The stability 
of the three-dimensional tetrahedron as compared to the two-dimensional rhombic 
configuration (fig. 13c) might be a direct consequence of the nearest-neighbor interaction 
potential (Dederichs etal. [1978]). 

For the a - Fe model ciystal A H™ =0.68 eV was found for the nearest-neighbor jump. 
The most stable di-vacancy configuration places the vacancies at second-neighbor 
distances A H* v =0.2 eV. Migration happens either via the metastable nearest-neighbor 
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Fig. 13. Tri- and tetra-vacancy configurations in the fee structure (see text). 


configuration or via a fourth-neighbor configuration, both with an activation enthalpy 
being about the same as for the mono-vacancy migration. The most stable tri-vacancy has 
two nearest-neighbor and one second-neighbor spacings with =0.49 eV (Johnson and 
Beeler [1977]). The first immobile aggregate was found to be the tetravacancy. The 
local vibration densities of states of the neigbouring atoms of vacancy clusters were 
calculated for Cu, a-Fe and a-Ti by Yamamoto [1982]. Formation and migration 
enthalpies of di-vacancies have frequently been derived from experiments as described 
in §§2.2.2 and 2.2.3 (Ehrhart [1991]). But the derived numbers result from fitting 
models to measured data by assuming that eq. (4) holds. However, direct evidence for 
divacancies causing the observed deviation of c v °(l/7) from linearity has not been 
reported. Another reason would be a temperature dependence of the activation enthalpies 
of vacancy formation (Audit [1982], Ganne and Quere [1982], Varotsos and 
Alexopoulos [1982]). 


2.2.5. Interaction with solutes 

The interaction of vacancies with solutes controls solute diffusion, also known as 
impurity diffusion, and the self-diffusion of the solvent atoms. This topic is treated in ch. 
7. Here, we briefly look at the interaction energies and their influence on c v °. In theory, 
a number of different approaches have been followed in the past in order to obtain 
quantitative information about the vacancy solute interaction (review by Doyama 
[1978]). Nowadays ab initio calculations are performed by means of the KKR Green’s 
function method based upon the density functional theory in the local spin density 
approximation (Klemradt etal. [1991]). The method allows one to calculate the nearest 
neighbor interaction of vacancies with 3 d and 4 sp solute atoms in Cu and Ni as well 
as with 4 d and 5 sp solute atoms in Ag and Pd. As a result the data shown in fig. 14 
were obtained. A positive sign of the interaction energy means repulsion and a negative 
sign, attraction between vacancy and solute. For the sp solute atoms in Ag and Pd, the 
attraction is approximately proportional to the valence difference. Such proportionality 
was discussed for a long time in the literature for experimental data and was often 
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Fig. 14. Calculated interaction energies of a vacancy with a solute atom of the element given by the abscissa 
for the solvents silver ( — ) and palladium ( — ) in (a) and copper ( — ) and nickel ( — ) in (b) according to 
Klemradt et al. [1991]. 


compared with a proportionality to the lattice parameter according to a different approach 
of modelling (review by Benedek [1978]). The same behavior is seen for the sp solute 
atoms in Cu and Ni. The significantly different behavior of the 3 d solute atoms in Cu 
and Ni as compared to that of the 4 d solute atoms in Ag and Pd is due to their magnetic 
moment. The magnetic exchange energy reduces the repulsive energy to very small 
values. Klemradt et al. [1991] find remarkably good agreement of their data with 
experimental ones as obtained from solute diffusion measurements. The authors also 
report on first results for binding energies of a second solute atom with a vacancy solute 
pair. This configuration and high order clusters are of great interest for the understanding 
of the dependence of solvent and solute diffusion on solute concentration. 

The direct experimental method of determining A H* sa for vacancy solute atom pairs is the 
comparison of DD measurements for dilute alloys and the pure solvent metal. In the dilute 
alloy c° is enhanced by a term being proportional to c sa txp[-(AH^-A.H^ a )/k H T\. 
Further methods are equilibrium PAC measurements and resistivity measurements after 
quenching, both comparing dilute alloys and pure solvent metals (Doyama [1978]). A 
careful comparison of DD measurements and PAS measurements in Cu and dilute Cu-Ge 
alloys performed by Kluin and Hehenkamp [1991] yields good agreement for the 
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results from the two methods after a reassessment of the PAS models applied for the 
evaluation of c v °. 

2.3. Vacancies in ordered alloys 

Long-range atomic order is a common phenomenon in intermetallic phases, (ch. 4). 
Examples are Cu 3 Au and CoGa. For many intermetallic compounds the critical tempera- 
tures for the order-disorder transformation exceed the melting temperature, which 
indicates high ordering energies. For such alloys the point defects show additional 
features to those known from pure metals or disordered solid solutions. Let us inspect the 
already quoted CoGa which belongs to the group of /3-brass electron compounds and 
crystallizes in the CsCl structure. This structure is made up of two cubic primitive 
sublattices a and /3 occupied by A (Co) and B (Ga) atoms, respectively. The latter atoms 
are positioned at the body-centered sites of the first lattice. Vacant lattice sites in the two 
sublattices are not likely to be energetically equivalent, owing to their different nearest 
neighbor atomic shells. The vacancy fractions in the two sublattices may be quite 
different, which can be inferred from the following naive picture. The vacancy in the 
a-sublattice is surrounded by only B atoms. This arrangement might suggest for the 
enthalpies of formation A H“ = A where the right hand side describes the pure B 
metal case. This assumption is indeed confirmed for a number of group VlH-group IHA 
compounds. For Ga we have A ~ 0.5 eV leading to vacancy concentration in the Co 
sublattice of about 10% at 900°C. The more detailed treatment by Miedema [1979] 
yields A //“ = 0.48 eV while equilibrium measurements yielded 0.23 ± 0.06 eV. The same 
reasoning leads to A =1.4 eV and, hence, vacancy fractions in the /3-sublattice being 
many orders of magnitudes smaller than that in the a-sublattice. Certainly, such large 
total amounts of vacancies will significantly determine macroscopic properties of the 
material. 

How are the excess vacancies in the a-sublattices being formed? An excess vacancy 
(excess with respect to the vacancies in the /3 -sublattices) can be formed only by 
transferring an A atom into the /3-sublattice. Such an atom forms an anti-site defect (also 
called antistructure atom), i.e., a point defect specific for ordered alloys. Because of the 
requirement of equal numbers of sites per sublattice the one anti-site defect must be 
accompanied by two vacancies in the a-sublattice. We have a triple defect. 

The creation of anti-site defects introduces the ordering energy as a controlling 
parameter for concentration and mobility of the vacancies in the a-sublattice. The usual 
nearest neighbor jump distance (<lll>/2) must be replaced by the next nearest neighbor 
distance (<100>). In particular, the vacancies are retained from annihilation during 
quenching or even slow cooling. By this means, vacancy concentrations of the 1% order 
of magnitude can easily be observed at room temperature for a number of intermetallics. 
On the other hand, the degree of order does not only depend on the ordering energy but 
also on the formation enthalpy of the a-vacancies. This condition leads to a curved 
Arrhenius plot for the degree of order as being observed. 

For some of the intermetallics significant repulsive interaction of the vacancies has 
been observed and explained by the electrical charge which may be as high as one 
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electron per vacancy (for FeAl, Koch and Koenig [1986]). Repulsive interaction leads 
to dispersed vacancy distribution as observed for FeAl whereas formation of large voids 
(50-100 nm in diameter) is observed for NiAl. For NiAl and for CoAl the vacancy 
concentration is significantly smaller than for FeAl and CoGa. The latter compounds 
show a higher degree of intrinsic disorder than the former ones. Long-range ordering of 
vacancies has been observed for a number of more complex intermetallics (Liu Ping and 
Dunlop [1988]). 

An important feature of many intermetallic compounds is their stability with respect 
to deviation from stoichiometry. The compound CoGa is found to be stable from 45 to 
65 at.% Co for slowly cooled specimens. The vacancy concentration in the Co sublattice 
amounts to about 10% for 45 at.% Co and falls below 0.1% at 65 at.% Co. At the 
stoichiometric composition it amounts to about 2.5%. At first sight, these vacancies 
might be taken as structural ones, i.e., formed to assure the lattice stability for the 
respective composition. On the other hand, the above-mentioned sluggish approach 
towards the thermal equilibrium concentration of vacancies does substantially aggravate 
the discrimination between thermal and structural vacancies. For CsCl structure com- 
pounds the existence of structural vacancies is still discussed controversially in the 
literature (Kim [1986], Wever [1992]). 

For the intermetallic compounds of more complex structures the above-treated 
questions are studied to even lesser extent by systematic means which is mainly due to 
the great difficulties with the preparation of reliable sample material. Nevertheless, for 
a few cases, most interesting properties have been studied in great detail. One example 
is the Zintl phase /3-LiAl which is of great interest as an anode material for lithium 
sulfur batteries. At room temperature this material contains 0.4% and 7% Li vacancies 
for 47 at.% A1 and 52 at.% Al, respectively. The activation energy for Li self-diffusion 
is of the order of 0.1 eV. The Li vacancies are long-range ordered at 80 K (Kim [1986]). 

The intermetallic compounds Nb 3 Sn, V 3 Ga, and Nb 3 Ge (A15 structure) are of great 
interest because of their excellent superconducting properties. These properties are 
closely related to a specific feature of the A15 structure which contains linear chains of 
the transition metal atoms. The transition temperature for the superconducting to the 
normal state was found to depend strongly on the quenching temperature for V 3 Ga. This 
dependence has successfully been interpreted in terms of the anti-site defects (Ga atoms 
within the V atom chains) produced by thermal disordering at the quenching tempera- 
tures (Barker [1987]). Diffusion properties of intermetallic compounds are reviewed by 
Wever [1992]. (See also ch. 7, §5.2). A good recent review is by DeNovion [1995], 

3. Self-interstitials 

3.1. Production of interstitial atoms 

3.1.1. Introduction 

The interstitial formation energy of a few eV is easily provided by irradiating a 
crystal with energetic particles. For example, an electron of 400 keV energy transfers 
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19 eV recoil energy to a Cu nucleus via head-on collision. The maximum energy 
transferred to Cu by fission neutrons of 2 MeV amounts to 125 keV. The fundamental 
problems of radiation damage in materials intensively stimulated research regarding 
particle-lattice-atom interaction and the Frenkel defect production which results. In the 
present section we deal with this matter with regard to the atomic defect concentration 
c d produced by a given fluence $ (time-integrated flux density 0) of particles penetrating 
a crystal. Among the numerous questions on the Frenkel defect production this point is 
particularly important for the evaluation of specific (microscopic) defect properties from 
radiation-induced changes of macroscopic materials properties. 

3.1.2. Atomic displacement cross-section for electron irradiation and the produc- 
tion of stable Frenkel defects 

The atomic fraction of collisions between projectiles and target atoms for a fluence 
is given by <x<b. The term collision ought to be specified in order to render the cross- 
section a a well-defined quantity. Electrons penetrating a crystal interact with the 
electrons as well as with the nuclei of the target atoms. In metals, the electron-electron 
interaction produces heat and, hence, requires cooling of irradiated samples. At the same 
time it causes an angular spread of the electron beam around the direction of incidence. 
This spread is of relevance for the angular resolution of threshold-energy determinations 
(see below) and for the flux-density determination in any kind of electron-irradiation 
experiment except for very thin samples. 

The electron-nucleus interaction causes the displacement of atoms from regular lattice 
sites to interstitial positions. The necessary specification of this collision is given by the 
minimum recoil energy required for permanent displacement of an atom. Only collisions 
with recoil energies T larger than the displacement threshold T d are of relevance for the 
Frenkel defect production. The total cross-section can be written as the recoil-energy 
integral of the differential cross-section d ex, 

, r 7 ^ da(E, T) „ 

a(E) = — dT, (7) 

K 'KdT 

taken from the displacement threshold energy to the maximum transferred energy (head- 
on collision), E is the electron energy. The differential cross-section for the scattering of 
a relativistic (Dirac) electron by a point nucleus was calculated by Mott [1932]. 
Relativistic electron scattering favors mean recoil energies and reduces high ones when 
compared to Rutherford scattering, for which dcr/dT « r 2 = T^. cos" 4 0 holds, where 0 
is the starting angle of the recoiling atom with respect to the incident electron-beam 
direction (further reading: Corbett [1966]). 

The lattice structure causes T d to depend on the recoil impact direction with respect 
to the lattice orientation. This dependence is shown in fig. 15 for Cu (method of 
measurement see further below). 

Irradiation of textureless polycrystalline samples means randomness of the recoil 
impact directions with respect to the lattice orientations. If for T d in eq. (7) the absolute 
minimum T dmin of the threshold-energy surface were taken, a would be calculated much 
larger than the experimental total displacement cross-section. All those collisions with 
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[ 111 ] 



Fig. 15. Displacement threshold energy T d as a function of the atomic recoil impact direction within the 
fundamental orientation triangle in Cu (after King etal. [1981]). 


T> T inikl but with impact directions for which T<T i holds do not produce Frenkel 
defects. This effect is often considered phenomenologically by introducing the displace- 
ment probability, p(T) into eq. (7), leading to the displacement cross section 

<r*(E) = J 7 W) P(T) da( f T T ) dT. (8) 

When the angular dependence of the threshold energy is known, p(T) and hence <r d can 
be calculated when dcr/dT is known. For the determination of the angular dependent 
threshold energy, particle irradiation of a single crystal is desired which causes atomic 
displacements with just one recoil impact direction. By variation of the crystal orientation 
with respect to the recoil impact direction and by variation of the particle energy, the 
angular dependence would thus be scanned. Unfortunately, the statistical nature of 
particle interaction causes a distribution of recoil impact directions which fills the solid 
angle 2tr (maximum angle between incident projectile direction and target-atom recoil 
impact is tt/2) for any given incident particle direction. The distribution is governed by 
the angular dependent differential cross-section of the particular type of particle-atom 
collision. Hence, any scanning of the angular dependent threshold energy requires 
deconvolution of the measured data with respect to the angular dependent differential 
cross-section. An additional problem arises for electron irradiation from the considerable 
beam spread upon passing a sample. It is due to the multiple scattering by target 
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electrons. For electrons of 0.5 MeV, the angular distribution at the reverse side of a 
25 p,m thick Cu sample is such that about 40% of the electrons are scattered out of their 
incident direction by more than 40°. Review articles on earlier threshold-energy surface 
determinations are by Sosin and Bauer [1968], Vajda [1977] and Jung [1981a]. 

The angular resolution for T A has been improved substantially by applying high- 
voltage electron microscopy (HVEM) for in-situ defect production. Two measuring 
methods for defect production rates have been developed: (i) residual resistivity 
measurements in electron-microscope samples, i.e., samples of about 400 nm thickness 
and 0.1 x 0.1 mm 2 irradiated area (King et al. [1981]) and (ii) rate measurements for 
nucleation and growth of interstitial-type dislocation loops (Urban and Yoshida [1981]). 
The first method is applied at irradiation temperatures below 10 K, whereas the second 
one requires mobile interstitials, i.e., irradiation temperatures above 50 K (see § 3.3.2.4). 
Both methods benefit by the small sample thickness, by the ease of sample tilting with 
respect to the beam direction and by the electron flux density which is some orders of 
magnitude larger than at common accelerator irradiations. The orientational dependence 
of threshold energy shown in fig. 15 was obtained by resistivity-change rate measure- 
ments at six different electron energies and about 35 different crystal orientations, 
yielding about 200 data points altogether. 

The displacement probability p(T) derived from the threshold energies in fig. 15 is 
shown in fig. 16 with error bars. Calculations of the total displacement cross-section 
according to eq. (8) with such p(T) data leads to uncertainties of 10-15%. A similarly 
exhaustive determination of the angular dependence of T d as in Cu does not exist for any 
other metal yet. As a consequence, defect production rates can generally not be calcu- 
lated with satisfactory accuracy for given irradiation experiments by applying eq. (8). 
Another method often applied is the measurement of a standard quantity as the electrical 
resistivity under the given irradiation condition and relating all measured property 
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Fig. 16. Displacement probability versus recoil energy T d as resulting from the angular dependence in fig. 15. 
The error bars indicate the sensitivity of the threshold-energy surface fit to the measured damage rate data. 
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changes to that of the standard quantity. This procedure is described in §3.2. The 
resistivity contribution per unit concentration of Frenkel defects, p F enters into the 
determination of the angular dependent T d as the displacement cross-section cr d {E, 9) for 
the electron energy E and the incident beam direction 9 is measured by the resistivity 
damage rate dp/dd> =p F cr d . If certain conditions regarding (i) the angular dependence of 
the displacement probability (not discussed here) and (ii) the magnitude of the available 
recoil energies T relative to T d are fulfilled, the value of p F can be derived from such 
measurements as well as the angular dependent T d (Abromeit [1983]). The results in 
figs. 14 and 15 have been obtained with p F = 2x 10^0 cm. But the optimal fit assuming 
the above-mentioned conditions to be fulfilled yielded p F =2.85x 10^17 cm (King and 
Benedek [1981], see also table 2). 

The data in fig. 15 tell us that the production of stable Frenkel defects is energetically 
more favorable for recoil impacts near <100> and <110> than for those near <11 1> and 
its environment. Analytical calculations (Leibfried [1965]) and computer simulations 
(Gibson etal. [I960]) have shown that impacts along these favorable directions initiate 
replacement collision sequences (fig. 17) which produce the interstitial separated from the 
vacancy by several nearest neighbor distances (nnd). The energy release per nnd along 
the replacement chain amounts to about 0.7 eV. A recoil impact along <11 1>, for 
comparison, leads to simultaneous collisions of the knock-on atom with three nearest 
neighbor atoms which form a equilateral triangular window around the travelling path of 
the primary knocik-on atom. As a result, the recoil energy of this 4-body collision is 
divided by three and accordingly the kinetic energy of the recoiling atoms which remains 
available for defect production. Around 60 eV primary recoil energy is required to 
produce one Frenkel defect while about 20 eV are sufficient for recoil impacts along 
<100> and <110>. 

It is obvious that an interstitial cannot be stable at or close to the nearest neighbor 
position of a vacancy. Spontaneous recombination, occurring even at T= 0, will annihilate 
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Fig. 17. Frenkel defect production via replacement collision sequence by an incident electron in the MeV range 
(schematic). 
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the displacement process. Hence, a minimum distance between vacancy and interstitial 
is required to render the Frenkel defect mechanically stable. This fact and the above- 
discussed energetically unfavorable many-body collisions along higher-indexed lattice 
directions suggest that the direct transfer of the primary knock-on atom into a stable 
interstitial position is unlikely to occur. The displacement of the primary knock-on atom 
to the nearest neighbor position has indeed been proven experimentally by means of PAC 
analysis after neutrino recoil-induced Frenkel pair production (Metzner etal. [1987]). 
The occurrence of replacement collisions is proven by the radiation induced disordering 
of long-range ordered alloys (Kirk etal. [1978]). 

The minimum distance between vacancy and interstitial assuring their stability at 
7’=0K has been studied by computer simulation (Drittler et al. [1969]). The result is 
shown in fig. 18. The volume given by the unstable lattice sites around the interstitial at 
which the vacancy is unstable at T= 0 K is called the spontaneous recombination volume 
v„ (fig. 18a). An irradiated crystal allows stable Frenkel defect production only in that 
volume fraction which is not occupied by the recombination volumes of the preceedingly 
produced Frenkel defects. Due to the spontaneous recombination the defect production 
is reduced according to the factor / sr = (l~2v 0 c d ) to a first order approximation (review by 



z 



Fig. 18. Unstable vacancy positions around the <100> dumbbell in Cu for (a) T = 0 K, • interstitial, 0 
unstable, O stable, O not investigated, (b) 0.1 eV recoil energy given to one of the dumbbell atoms, 0 unstable 
at T = 0,Q dynamically unstable (0.1 eV), U probably unstable, □ stable. 
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Wollenberger [1970]). Under electron irradiation the spontaneous recombination 
volume is apparently enlarged by the so-called subthreshold collisions. The effect is 
demonstrated in fig. 18b showing those lattice sites which are unstable for the vacancy 
when one of the interstitial atoms is recoiling with 0.1 eV. The cross section of the 
lattice atoms for recoil energies larger than 0. 1 eV are following from eq. (7) when the 
lower boundary T d of the integral is replaced by 0. 1 eV. The cross-section for such sub- 
threshold collisions is by two orders of magnitude larger than the displacement cross 
section and is therefore of similar magnitude as the term 2v 0 . The computer simulations 
yield v 0 = 62 H. 

From measurements of the resistivity damage rate as a function of the radiation- 
induced resistivity increase (defect concentration) for electron irradiation, f ST has been 
derived (review by Wollenberger [1970]) with the result v 0 = 125 fl for Cu. The 
subthreshold annihilation rate was indeed found to approximately equal 2v 0 c d . Similar 
values were found for other metals. The difference for v 0 between experiment and 
computer simulation is most probably due to the fact that in reality the Frenkel defect is 
produced with about 50 eV recoil energy in the average accompanied by strong lattice 
exitation while in the model the configuration was initially set without any lattice 
excitation. The cross-section for subthreshold Frenkel defect annihilation was found 
experimentally to be 4 x 10" 25 m 2 , which agrees well with 0.1 eV for the lower limit of 
effective subthreshold recoil energies as assumed in the computer simulations. 

At finite temperatures, the displacement process is influenced by the lattice vibration 
affecting the energy release (phonon production) during the replacement sequence and 
the minimum separation of vacancy and interstitial required for the Frenkel defect 
stability. The minimum displacement threshold energy for Cu was found to decrease with 
increasing irradiation temperature from 19 eV at 4 K to 1 1 eV at 450 K (Urban et al. 
[1982]) indicating a substantial assistance of the energetically most favorable Frenkel 
defect production by lattice vibrations. 

The minimum vacancy-interstitial separation naturally increases when the interstitials 
become mobile. The close pairs (see §3.3.2.4) recombine at temperatures from 15 K 
upwards to about 30 K for Cu. As a result, the recombination volume is larger than v 0 
by the volume shell given by the close pair interstitial sites (when the vacancy is 
assumed to be at the center of v 0 ). At the same time the effective defect production rate 
for c d = 0 is reduced by nearly 7^ = 25% (fractional recovery due to close pair annihila- 
tion). 

At temperatures above 30 K, the correlated recombination of freely migrating 
interstitials (§3.3.2.4) further reduces the production rate. The fraction of freely 
migrating interstitials recombining correlatedly is given by 1 -<r v /r p >, where r p is the 
average vacancy interstitial separation attained by the replacement collision sequence and 
r v the radius of the spherically assumed recombination volume for thermally activated 
migrating interstitials. Both quantities have been determined by means of resistivity 
damage rate measurements. The ratio <r p /r v > = 2 was found for incident electron energies 
above 1 MeV (Becker et al. [1973]) and 80 K irradiation temperature. Near the 
displacement threshold, <ry r w > rises steeply up to about ten. The radius r v was found to 
depend on temperature according to T U1 between 50 K and 105 K (Lennartz et al. 
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[1977]). This temperature dependence is expected when the elastic interaction of vacancy 
and interstitial controls the migration path of the interstitial. In effect, the initial 
displacement rate at temperatures at which interstitials migrate freely (above 32 K for 
Cu) is given by p d (l-/ cp ) (l-cr^r >). The dependence of the defect production rate on 
c d is strongly influenced by the microstructure of the sample and thus provides a useful 
method of analysis for the microstructure (review by Wollenberger [1978]). 

3.2. Determination of Frenkel defect concentrations 

Specific properties of Frenkel defects are determined by measuring a certain property 
change of a macroscopic sample upon irradiation and relating the measured change to the 
underlying atomic defect concentration, c d . Determination of c d from the irradiation 
parameters as treated in the foregoing section is subject to substantial uncertainty. In 
order to avoid such uncertainties it has become usual to measure the electrical residual 
resistivity increase Ap of a sample made of a proper standard material for which the 
constant of proportionality p F is known and to calculate c d = Ap/p F . Resistivity samples 
can generally be irradiated together with the sample under specific investigation. Thus p F 
plays the role of a defect-concentration standard. As has been shown in the foregoing 
section, p F can be derived from resistivity-damage rate measurements in single crystals 
under electron irradiation at low temperatures and under suitable variation of crystal 
orientation and electron energy. 

For the most extensively investigated Cu the uncertainty p F amounts to about ±15 %. 
For other metals the uncertainty is larger. Values are given in table 2. 

An entirely different method of determining c d is the proper evaluation of the X-ray 
scattering intensity in the Huang range and in the range between the Bragg reflections 
(see §3.3.2.2, also ch. 12). The Huang scattering intensity is proportional to 
c d [( A V[‘ l ) 2 + ( A V t re J ) 2 ] (Haubold [1975], Dederichs [1973]). By determining the lattice 
constant change, A a/a, one obtains c d ( AV{‘‘ + AV{f). From both results, AV{f as well 
as c d can be derived once AV™ 1 is known from other measurements. Resistivity 
measurements on the same sample or under the same irradiation conditions then give p F . 
The error limits amount to about ±20% (Schilling [1978], Ehrhart [1991] and 
Schultz [1991]). 

3.3. Interstitial properties 

3.3.1. Results of model calculations 

Methods for calculation of point-defect properties are provided by the theories of 
lattice statics and dynamics (reviews are by Dederichs and Zeller [1980], Leibfried 
and Breuer [1978] Maradudin etal. [1971]) and by computer simulation (reviews are 
by Beeler [1983], Johnson [1973], Gehlen et al. [1972]). Computer simulation was 
introduced into the point-defect research field by Gibson et al. [1960] and Erginsoy et 
al. [1964]. They studied the fundamental displacement processes initiated by recoiling 
atoms and with that gave the first hints for an understanding of the shape of the angular 
dependence of the threshold-energy (§3.1.2) on the basis of replacement-collision 
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Table 2 


Electrical resistivity contribution per unit concentration of Frenkel defects p F [10"’ t Cl cm] for various metals. 


Metal 

Contribution 

Method* 

Reference 

Ag 

2.1 

&Pin 

Lucasson and Walker [1962] 

Roberts et al. [1966] 

Al 

4.0 

APi* 

Jung [1981b] 


3.9 

HS & A a/a 

Bender and Ehrhart [1983] 

Au 

3.0 

Ap in , HS & A a/a 

Segura and Ehrhart [1979] 

Co 

15 

APto 

Vajda [1977] 


16 

HS & A a/a 

Ehrhart and Schonfeld [1982] 

Cr 

37 


Biget et al. [1979] 

Cu 

2.5 

DXS & A a/a 

Ehrhart etal. [1986] 


2.85 

bp* 

King and Benedek [1981] 


2.8 

HS & A a/a 

Bender and Ehrhart [1983] 

a-Fe 

30 

APir 

Vajda [1977] 


20 

DXS & A p ta 

Ehrhart [1985] 

Ga 

5.4 

APir 

Myhra and Gardiner [1975] 

K 

3 

Api* 

Wallner etal. [1986] 

Mo 

15 

HS & A a/a 

Ehrhart and Segura [1975] 


13 

APin 

Vajda [1977] 

Nb 

14.6 

DXS 

Ehrhart [1978] 

Ni 

6 

APi„ 

Lucasson and Walker [1962] 


7.1 

HS & A a/a 

Bender and Ehrhart [1983] 

Pt 

9.5 

APi„ 

Jung etal. [1973] 

Re 

20 

APi„ 

Lucasson [1975] 

Rh 

15 

APin 

Guinan el al. [1973] 


19 

Ap te 

Lucasson [1975] 

Sc 

50 

&Pin 

Daou etal. [1985] 

Sn 

1.1 

tyin 

Dimitrov and Dimitrov [1985] 

Ta 

16 

APta 

Jung el al. [1973] 

Biget et al [1979] 

a-Ti 

14.. 18 

APta 

Shirley and Chaplin [1972] 

Zn 

15 

APin 

Vajda [1977] 


15 

HS & tsa/a 

Ehrhart and Schonfeld [1982] 

Zr 

35 

API* 

Neely [1970] 


40 


Lucasson [1975] 


”Ap m : electrical resistivity of irradiated samples; HS: Huang scattering; A a/a: lattice parameter, DXS: diffuse 
X-ray scattering. 


sequences (Jung [1981b], Urban and Yoshida [1981], Tenenbaum and Doan [1977], 
Vajda [1977], Roth et al. [1975], Becker etal. [1973], Wollenberger and Wurm 
[1965]). 

Such computer simulations are based on the numerical solution of the equations of 
motion for a set of atoms arranged as in the crystal and coupled by spring forces which 
simulate the actual lattice-atom interaction. Critical aspects of the method are the choice 
of the the interaction potentials and the adjustment of the boundary conditions for the 
necessarily size-limited model crystal in a way which represents the correct embedding 
in an elastic continuum. The potentials have generally been chosen so as to generate two- 
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body central forces and to match known physical properties, such as the elastic constants. 
Mostly, a particular functional form of the potential, such as the Morse, Lennard-Jones 
or Bom-Mayer type, is taken to be valid and the respective coefficients are determined 
by matching the model’s properties to various physical properties of the modelled metal. 
In other cases, purely empirical potentials have been developed by the matching 
procedure. Potentials which result from pseudopotential theory have also been used. 
Nowadays, realistic manybody potentials are prepared by means of the embedded atom 
method (Foiles etal. [1986]). Computer simulations have been applied to study not only 
defect production mechanisms but also static and dynamic properties of defects (§2.1). 

For analytical calculations of the interstitial properties, the real space relaxation model 
is often replaced by the normal coordinate expansion model. The latter was introduced 
by Kanzaki [1957] and treats the defect-lattice interaction as a source function ( Kanzaki 
forces) for the displacement in a harmonic lattice. The force equations for equilibrium 
are Fourier-transformed, yielding variables which are Fourier inverses of real space 
displacements. All energy changes are related to the first and second derivatives of the 
interactions, and these derivatives can directly be matched to the force constants as 
following from Born-v. Karman fits to the phonon dispersion curves. The properties of 
primary interest are the activation enthalpies of formation and of migration of the 
interstitial. The latter is determined as the difference between the interstitial formation 
enthalpy in the saddle point configuration and that in the equilibrium configuration, in 
accordance with common use in rate theory. 

3.3.1.1, Activation enthalpies of formation in equilibrium and saddle-point 
configurations. Specific feature of the self-interstitial is the strong lattice distortion with 
its large displacements in the neighboring atom shells (0.14 nnd for the nearest neighbors 
of the two atoms forming a split interstitial configuration in A1 (Haubold [1976])). 
Consequently the lattice relaxation significantly influences the repulsive energy contribu- 
tion. One easily realizes that the large capacity of computers is very helpful for handling 
just this contribution as accurately as possible. 

The electronic contribution consists of the electron-energy change due to the volume 
change of the crystal upon insertion of the interstitial atom. Those results given in table 
3 which were not obtained by computer simulation essentially deviate by different 
treatment of the electronic contribution. 

In order to find the stable interstitial configuration, the formation enthalpy has been 
calculated by a number of authors for all configurations shown in fig. 19. In table 3 only 
minimum and maximum values are given. They indicate that with one exception the 
difference between minimum and maximum of the formation energy is less than 15% of 
the minimum value, and detailed comparison would show that there are always two or 
three different configurations for which the formation energy deviates by less than 5%. 
The origin of these small differences lies in the large relaxation effect (see § 3.3.2. 1). 
Without the lattice relaxation the formation enthalpy would be larger by nearly one order 
of magnitude (Dederichs etal. [1978]). Rearrangement of the interstitial atom or atom 
pair does not cause a significant change of the relaxation volume (see table 3) and, 
hence, changes the formation enthalpy only by a small amount. The data reported by 
Dedericiis et al. [1978] are obtained by computer simulation with a Morse potential 
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Table 3 


Calculated properties of self-interstitials. 


Metal 

Configuration 

<1 A 

a vr‘ 

(at. volume) 

A//" 

(eV) 

Reference 

Cu 

<100>-split 

5.07-5.82 


<0.24 

Huntington [1942, 1953] 


Octahedral 

5.14-6.09 





Octahedral 

2.5-2.6 

1.67-2.01 


Tewordt [1958] 


<100>-split 

3.2 

1.10-1.25 




Octahedral 

2.73-3.42 

1.21-1.78 


Seeger and Mann [1960] 


<100>-split 

2.187 

1.126 

0.103 

Bennemann and Tewordt [1960] 


Octahedral 

2.43-2.44 

1.219-1.441 




<100>-split 

4.351 


0.090 

Bennemann [1961] 


<100>-splil 

4.139 

2.20 

0.05 

Johnson and Brown [1962] a 


Crowdion 

4.840 

2.57 




<100>-split 

2.47-2.84 

1.37-1.44 

0.46-0.57 

Seeger etal. [1962]“ 


<lll>-split 

3.83-4.27 

1.43-1.65 




<100>-split 

3.39 

1.34 

<0.06 

Doyama and Cotterill [1967] ‘ 


Tetrahedral 

3.70 

1.38 




<100>-split 

3.42 

1.5 

0.13 

Dederichs etal. [1978] 


Tetrahedral 

3.S9 

1.48 



Ni 

<100>-split 

4.0S 

1.7 

0.15 

Johnson [1966] 


Crowdion 

4.10 


0.04 



<lll>-split 

4.24 


0.13 


a-Fe 

<100>-split 

4.G 

2.34 

0.21 

Johnson [1965] 


Octahedral 

5.73 



Dederichs etal. [1978]“ 

A1 

<100>-split 

2.S9 


0.15 

Lam etal. [1980] 

Mg 

A-split 

2.36 



Imafuku etal. [1982] 


Hexahedral 

2.66 





‘Gives data on more configurations than listed in this table. 






Fig. 19. Self-interstitial configurations in the free lattice. Symmetries: O — octahedral; T - tetrahedral, C - 
crowdion; H 0 , fi T , H c — dumbbell having axis along <100>, <11 1>, and <110>, respectively. 
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modified in order to soften the strong repulsive core of this potential and to fit (in 
addition to other properties) lattice constant, compression modulus, vacancy formation 
enthalpy) the relaxation volume of the interstitial to the experimental value (see table 4). 
For A1 an inter-ionic potential as derived from pseudo-potential theory was used. 

The activation enthalpy for migration of the Cu interstitial was accordingly obtained 
to be around 0.1 eV, with the exception of 0.5 eV by Seeger et al. [1962], This 
difference was one of the sources which energized the stage III controversy (§ 3.3.2.5). 
The small difference between the formation enthalpies of the saddle point configuration 
and that of the equilibrium position shows again that the activation enthalpy of formation 
depends rather weakly on the interstitial configuration. 

Details of possible migration steps can conveniently be investigated by computer 
simulation. The most probable migrational step obtained for the fee structure (Dederichs 
etal. [1978], Lam etal. [1980]) is shown in fig. 20a. It consists of a translational motion 
of the center of gravity of the dumbbell by one atomic distance and a rotational motion 
of the dumbbell axis by 90°. It should be noted that any other form of motion of the 
<100>-split interstitial requires a considerably higher activation enthalpy. Especially the 
90° axis rotation with fixed center of gravity requires a four times larger activation 
enthalpy. 

For the bcc a-Fe potential, the elementary jump is found again to be a translational 
step of the center of gravity of one atomic distance and rotation of the axis by 60° as 
shown in fig. 20b. The activation enthalpy is 0.21 eV. Contrary to the situation in the fee 
structure, the pure 90° axis rotation requires only a little larger activation enthalpy, 
namely 0.25 eV. 

3.3.1.2.Dynamic properties. Computer simulation furthermore revealed the occur- 
rence of low-frequency resonant modes of the <100> dumbbell in the fee structure 
besides high-frequency localized modes (Scholz and Lehmann [1972], Imafuku etal. 
[1982]). This unusual occurrence of both types of vibrational modes at one defect 


Table 4 

Relaxation volume of self-interstitials*. 


Metal 

AV" 1 

(at. volume) 

Measurement 

Reference 

Al 

1.9 

HS & A a/ a 

Bender and Ehrhart [1983] 

Cd 

11-19 

HS & A a/ a 

Ehrhart and Schonfeld [1982] 

Co 

1.5 

HS & Aa/a 

Ehrhart and Schonnfeld [1982] 

Cu 

1.3 

A a/a 

Dworschak et al. [1972] 


1.45 

DXS & A a/a 

Haubold and Martinsen [1978] 


1.7 

HS & A a/ a 

Bender and Ehrhart [1983] 

Fe 

1.1 

DXS & A a/a, Ap 

Ehrhart [1985] 

Mo 

1.1 

HS & A a/ a 

Ehrhart [1978] 

Nb 

1.11 

DXS & A a/a 

DOnttz et al. [1974] 

Ni 

1.8 

HS & ha/ a 

Bender and Ehrhart [1983] 

Pt 

2.0 

A a/ a 

Hertz and Peisl [1975] 

Zn 

3.5 

HS & A a/a 

Ehrhart and Schonteld [1982] 

Zr 

0.6 

HS & A a/a 

Ehrhart and Schonfeld [1982] 


‘Glossary see tables 1-3. 
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Fig. 20. Migrational steps (a) of the <100>-split interstitial in the fee lattice and (b) the <110>-split in the bcc 
lattice, according to computer simulation results. 


originates from the highly compressed lattice around the interstitial and its special 
configuration (Dederichs etal. [1973]). For the oppositely directed vibrations of the two 
dumbbell atoms along <100> as shown in fig. 21a, the small equilibrium separation 
between the two atoms (0.77 nnd) leads to a very strong force-constant which couples 
the two atoms. This constant leads to a localized mode and for the modified Morse 
potential one obtains the mode A lg shown in fig. 22, which lies well above the maximum 
lattice frequency, co mm . 

For the displacement directions of the two dumbbell atoms as shown in fig. 21c the 
strongly compressed spiral spring between the two atoms exhibits a negative bending 
spring component which acts perpendicular to the spiral spring axes. Quantitative 
evaluation shows that the force-constant of the negative bending spring becomes 
comparable to that of the restoring force of the perfect lattice. The resulting force- 
constant is small so that the librational mode in fig. 21c is a resonant mode with a very 
low frequency, as shown in fig. 22 for E g . Another resonant mode (A 2u) is excited with 
the direction of atomic motion as shown in fig. 21b. As the motions in figs. 21b and c 
also strain the compressed springs with the nearest neighbors of the dumbbell, localized 
modes are excited as well (see A 2u and E g beyond co^ in fig. 22). The low-frequency 
resonances of the dumbbell lead to comparatively large thermal displacements of the 
interstitial atoms. In fig. 23 the mean squares of the atomic displacements are compared 
for dumbbell atoms and those in perfect lattice positions, as calculated for a Morse 



Fig. 21. Localized and resonance modes of the <100>-split interstitial: (a) localized mode /t, s (see fig. 22); (b) 
resonance and localized mode /l 2u ; (c) resonance and localized mode E g . (From Dederichs et al. [1978].) 


References: p. 1710. 
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max 


Fig. 22. Local frequency spectrum of the <100>-split interstitial (averaged over all directions) for a modified 
Morse potential (from Dederichs etal. [1978]). Dashed curve is for perfect lattice. 


potential. It becomes obvious that the resonant modes are thermally populated already 
around 30 K, the beginning of stage I D recovery (§ 3. 3.2.4) in Cu. Indeed, large ampli- 
tudes of the resonant modes lead directly to the saddle point configuration for the 
migration jump. The flatness of the energy contour along the migration jump path is a 
direct consequence of the negative bending spring effect. 

The presence of resonant modes should cause a low temperature maximum in the 
temperature dependence of the specific heat (Zeller and Dederichs [1976]), and some 
experimental indication for its occurrence has been found in electron-irradiated Cu 
(Monsau and Wollenberger [1980]). Experimental difficulties, however, prevented a 
reliable error analysis of the result. 



T IK 

Fig. 23. Averaged square of the thermal displacements of one of the <100>-split interstitial atoms compared 
with that of a regular lattice atom (from Dederichs etal. [1978]). 
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The influence of the resonance modes on the temperature dependence of elastic 
moduli was observed by Holder et al. [1974], They derived <u R =<u max /8 which is in 
good agreement with the calculated modes in fig. 22. The most effective consequence of 
the resonant mode is the considerable elastic polarizability. In fig. 24 the displacement 
directions of the dumbbell atoms and their nearest neighbors are shown for two different 
applied shear stresses and for uniform compression. Obviously the <100>-shear stress 
excites the resonant mode E g , and computer experiments show that the rotation angle of 
the dumbbell axes is by a factor of twenty larger than the shear angle because of the 
bending spring action. The presence of dumbbell interstitials in a crystal therefore must 
decrease the elastic constant c u with respect to a perfect crystal. Large decreases of the 
shear moduli of irradiated polycrystals have indeed been observed many years ago 
(Konig etal. [1964], Wenzl etal. [1971]). This negative sign of the modulus change 
was a long-standing problem in theory and had not been solved earlier than the detection 
of the resonant mode in the computer experiments. 

The existence of a para-elastic polarizability of interstitials was discussed by 
Dederichs etal. [1978]. For anisotropic defects it could be even one order of magnitude 
larger than the dia-elastic polarizability. It can be observed directly, however, only for 
such interstitials as are able to reorient in an external field without performing migration 
steps towards annihilation. On the other hand, migration in an external field will be 
influenced by the para-elastic polarizability. This effect might be of great importance for 
irradiation-induced creep (Bullough and Willis [1975], see also §4.5). 

3.3.I.3. Arrhepius behavior of diffusion. The low activation enthalpy for interstitial 
migration and thb low temperature at which it takes place (stage I D recovery in Cu 
around 35 K, § 3.3.2.4) raised the question whether Arrhenius behavior, as it has always 
been assumed for stage I recovery interpretation, actually exists (Flynn [1975]), Indeed, 
the majority of vibrational lattice modes occupy their ground states of zero-point motion 
at temperatures much lower than the Debye temperature (7’ Deb = 310 K for Cu). Hence 
little justification can be given for a classical description of the migration process from 
this point of view. On the other hand, computer experiments clearly show that the 
migrational step is always a consequence of sufficient excitation of both resonant modes 
of the split interstitial (§§ 3.3. 1.2 and 3.3.2.6). Excitation of these modes is possible only 
by phonons of matching frequency. Since A H™ = 30 tuo R , extremely high excitations 
must be provided by incoming phonons. The fluctuation possibility for this state is 



o b c 


Fig. 24. Homogeneous deformation of a crystal containing a <100>-split interstitial: (a) <100> shear (modulus 
c«); (b) <110> shear (modulus (c u -c 12 )/ 2); (c) compression (modulus (c,, + 2c 12 )/3). 
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essentially classical, i.e., the migration velocity depends on exp-(A/7/Vk7). This is 
indeed the classical Arrhenius behavior because the phonon states corresponding to a> R 
are classically populated already at low temperatures. The motion of the atoms 
neighboring the jumping dumbbell observed in computer simulations suggests a descrip- 
tion in terms of the classical activated state. 

3.3. 1.4. Multiple interstitials. In the foregoing sections it has been shown that 
interstitials are dilatation centers with large relaxation volumes. Two interstitials 
approaching one another interact via these atomic displacement fields. The interaction 
energy can be calculated by means of Kanzaki forces simulating the defect-induced 
displacement field. For distances large compared to the range of the Kanzaki forces (a 
few atomic distances), the energy can be calculated by means of the Green’s function of 
the elastic continuum. From a multipole expansion of the forces one obtains a leading 
term of the direct interaction, the dipole-dipole interaction or the so-called first order size 
interaction. Here, the interaction energy decreases with increasing distance, r, as r" 3 . Its 
sign and magnitude depends on the direction of the defect-connecting line with respect 
to the axes of the elastic anisotropy. For cubic crystals and isotropic defects one obtains 
Eshelby’s formula [1956] as first-order term of a perturbation expansion with respect to 
the anisotropy parameter d=c n -c n ~2c 44 : 


15 / 5 "j AVj rel AV 2 rcl 3 y r* 

87 r l^3c n +2 c i2 +4 cm J r 3 [5 Y r 4 


(9) 


For d> 0 and interstitial-interstitial interaction, i.e., A V{ eI = AV™ 1 >0 we have an attractive 
interaction along <100> and repulsive ones along <110> and <11 1>. For d=0 the dipole 
interaction vanishes but the multipole interaction does not. The influence of higher-order 
terms of the perturbation expansion was studied by Dederichs and Pollmann [1972], A 
numerical solution free of approximations was given by Masamura and Sines [1970]. 

Higher multipole interactions decrease with r“ 5 or even faster (Siems [1968], Hardy 
and Bullough [1967]). Of special interest is the induced interaction based upon the 
polarizability of the defects. It depends on r as r -6 (Siems [1968]). For the interaction of 
two interstitials in Cu or A1 with their high polarizabilities, the induced-interaction 
energy is comparable to the dipole interaction only for r< 3 nnd (Trinkaus [1975]). 

The above described relationships are certainly not applicable to interstitials approach- 
ing one another as close as is necessary to form multiple interstitials. This range of 
interaction has been studied by computer simulation (Ingle etal. [1981]). Quite detailed 
information on static and dynamic properties of di- and tri-interstitials for different Cu 
potentials were obtained. The stability of configurations was studied up to clusters 
consisting of 37 interstitials. The activation enthalpies of binding were found to be larger 
than the activation enthalpies of migration for all cases. Dissociation of multiple 
interstitials must therefore not be expected according to this fee model. The relaxation 
volume change per clustering interstitial amounts to -5% to -15% of AV- d . The 
formation enthalpy per interstitial decreases by about 30% when going from two to ten 
interstitials. 

The stable di-interstitial consists of parallel dumbbells at nearest-neighbor sites, tilted 
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by a small angle (< 10°) in the <110> plane. Stable tri-interstitials are formed by 
mutually orthogonal dumbbells at nearest neighbor sites. Adding one atom in the 
octahedral lattice position to the tri-interstitial configuration yields the stable four- 
interstitial. Larger clusters arise from the four-interstitial cluster by adding further 
dumbbells such that each is equidistant from the central octahedral interstitial and is 
aligned orthogonally to its nearest-neighbor dumbbell. Such three-dimensional structures 
are more stable than two-dimensional ones for fewer than nine interstitials and are less 
stable than these for more than 13 interstitials. The stable two-dimensional clusters are 
platelets of octahedral interstitials on <11 1> planes. 

The activation enthalpy of di-interstitial migration was found to lie below or above 
that of mono-interstitial migration, depending on the details of the potential. For the 
higher-order clusters, the migration enthalpies increase rapidly with the number of 
interstitials in the cluster. 

The dynamic behavior of the di-interstitial is essentially similar to that for the mono- 
interstitial. A variety of resonant and localized modes occur. According to the resonant 
modes the thermal displacements of the dumbbell atom are nearly as large as for the 
single dumbbell. The change of the, elastic constant C u per interstitial is even larger 
than that of the single dumbbell. Although the quantitative results depend significantly 
on the details of the pair-interaction potential used for the simulation, the general 
dynamic behavior makes at least di-interstitials a very interesting object for experimental 
studies. 

For information on interstitial loop formation from MDC studies see § 4.2.2. 

3.3.2. Experimental methods and results 

3 .3.2.1. Relaxation volume. Introduction of an additional atom into a perfect finite 
crystal enhances the volume of this crystal by the relaxation volume of the self-intersti- 
tial, AV,' W . Because of possible relaxation anisotropies, interstitials of concentration c, 
must be introduced randomly with respect to the anisotropy axes in order to produce the 
correct total volume change q AV * 1 . The method of determining AV* 1 is given by the 
appropriate evaluation of the Huang scattering (Ehrhart [1978]) or the diffuse 
scattering between the Bragg peaks (Haubold and Martinsen [1978], §§3.2 and 

3.3.2.2, also ch. 12). As the relaxation volumes of interstitial and vacancy enter quadra- 
tically into the Huang scattering intensity and AV' el is considerably smaller than AV, rel , 
the former needs to be known only approximately. The experimental results are collected 
in table 4. The deviation of AV" 1 from one atomic volume can be imaged as that 
additional volume which must be spent because of the perturbation of the regular packing 
of spheres by the inserted interstitial atom. Perturbation of the fee structure obviously 
requires more additional volume than the less dense bcc structure. 

3.3.2.2. Configuration. Experimental information on interstitial configurations comes 
from diffuse X-ray scattering and mechanical (§ 3.3.2.6) or magnetic relaxation experi- 
ments. The open question of the potential existence of two interstitial configurations in 
one and the same metal as postulated by the two-interstitial model generated a strong 
impact on the development of both the theory of the point-defect-induced diffuse X-ray 
scattering (DedeRichs [1973]) and the measuring technique (Ehrhart etal. [1974]). 


References: p. 1710. 
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As the result of these efforts, the configuration of the interstitial in Al, Cu, Ni, Mo, 
Fe and Zn has been determined. For the fee metals the <100> split interstitial, for the bee 
the <110> split and for the hexagonal Zn the <000 1> split interstitial has been found. 

The analysis of diffuse X-ray scattering makes use of the information which is 
included in the intensity profile scattered outside the Bragg reflections due to the 
imperfect crystal. Here, the relevant imperfections are the atomic displacements around 
interstitial atoms and vacancies. An order-of-magnitude comparison of the different 
scattering contributions to the total diffuse scattering cross-section is shown in fig. 25, 
according to Ehrhart etal. [1974], The interesting intensity profile occurs superposed 
to a background which is one to two orders of magnitude larger than this. Information 
can be obtained only by a careful subtraction of the intensity scattered by an interstitial- 
free crystal from that of an interstitial-containing one. 

Owing to the minor lattice distortions around a vacancy (§§2.1 and 2.2.2.2) the 
diffuse intensity caused by this defect is only a small correction to the interstitial 
contribution. 

The diffuse intensity profile contains the symmetry of the atomic distortions as well 



Fig. 25. Cross-sections for Compton scattering, thermal diffuse scattering (TDS) at 4 K and point-defect X-ray 
scattering by Al as a function of the scattering wave vector (A = 15.4 nm) (after Ehrhart el al. [1974]). Line 
presentations: — Compton scattering, — thermal diffuse scattering at 4.2 K, — • contribution of <100> split 
interstitial and ... by the vacancy, both of concentration 5 x 10 -1 . 
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as the distortion strength, i.e., the spatial extension of the distortion field. The unknown 
configuration is determined by comparing calculated intensity profiles with the measured 
ones. Figure 26 shows calculated scattering profiles for three different interstitial 
configurations in A1 and the comparison with measured data. The agreement between 
measured and calculated data for the <100> split interstitial is convincing. The profiles 
for this configuration have been calculated with a distance between the two dumbbell 
atoms of 0.85 nnd. The four nearest neighbors of the dumbbell were found to be 
displaced outwards by about 0.07 nnd. The long-range atomic displacement field around 
lattice defects determines the scattering at small values of the scattering vector, q. 
Measurements close to the Bragg peaks (Huang scattering) can therefore resolve the 
symmetry of interstitials (Ehrhart [1978]). Although high-angular-resolution techniques 
must be applied, the overall experimental expenditure and the requirements for sample 
quality are less exceptional than in the case of diffuse X-ray measurements between the 
Bragg peaks. This advantage originates from the much higher defect-induced scattering 
intensity. Huang scattering measurements are particularly powerful for point-defect 
clustering studies. 

3.3.2.3. Formation enthalpy. The formation enthalpy of a Frenkel defect can be 
determined by measuring the heat release caused by annihilating Frenkel defects. In 
practice, Frenkel defects are generated in a sample by low-temperature irradiation. In 
order to obtain a reliable measure for the defect concentration generated, a resistivity 
sample is irradiated simultaneously. After sufficient irradiation, the calorimetric sample 
is then heated at a constant heating rate. The power input required for this heating 
condition varies as the stored energy releases. Different methods of measurement for this 
entity have been applied with varying success. One of the most serious problems in 
measuring stored energies in irradiated samples is the required decoupling of the sample 
from good thermal contact with a cooling stage during the irradiation to optimal thermal 




Fig. 26. Comparison of the diffuse X-ray-scattering intensity measured in A1 electron-irradiated at 4 K with 
carves calculated for different interstitial configurations. The numbers 1 to 4 indicate the circles of the Ewald 
sphere shown in the insert (left upper comer) along which the intensity was measured and calculated, (after 
Ehrhart etal. [1974].) 
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isolation during the calorimetric measurements. This procedure must be performed at 
temperatures below 10 K (stage I recovery). Another very inconvenient condition is the 
continuous nonlinear increase of the heat capacity of the sample with increasing 
temperature. The total enhancement of the heat capacity exceeds one order of magnitude 
across the stage I recovery interval. The stored-energy release must be measured on this 
strongly varying background (review by Wenzl [1970]). 

Results of stored-energy measurements have generally been given in terms of the 
ratio stored-energy release per resistivity recovery A<2/Ap. The A/// values in table 5 
were calculated from the reported ratios by multiplying them by the p F values given in 
table 2. The enthalpy of interstitial formation has been calculated according to 
Ah j f — AH J y = Ah{ = ah(, with the vacancy data from table 1. 

3.3.2.4.Activation enthalpy of migration. Most experimental information on the 
migration enthalpy of interstitials comes from electrical resistivity recovery measurements 
performed after low-temperature irradiation. The recovery curve as found for Cu after 
electron irradiation at 4.2 K is shown schematically in fig. 5. The curve can be taken as 
a prototype for many metals, although the detailed structure slightly varies from metal to 
metal. Substantially different curves are observed for Au, Cd and V (Ehrhart [1991] 
and Schultz [1991]). 

The generally agreed interpretation of the stage I recovery consists of the so-called 
close pair recombination in stages I A -I C and the free migration of the interstitial in stages 
I D and I E with interstitial-vacancy recombination upon random diffusion and eventual 
encounter with a vacancy. In close pairs vacancies and interstitials have such a strong 
elastic interaction that the thermally activated jumping interstitial has no chance to escape 
recombination with its correlated vacancy at temperatures not higher than stage I c . For 
an extensive review on earlier experimental work in fee metals and its interpretation, see 
Schilling etal. [1970]. The local correlation of those vacancies and interstitials which 


Table 5 


Activation enthalpy of Frenkel defect and self-interstitial formation 3 . 


Metal 

A 

(eV) 

A/f] 

(eV) 

Measurement 6 

Reference 

A1 

4.3 

3.6 

n 

Schilling and Tischer [1967] 


3.7 

3.0 

e~ 

WOLLENBERGER [1965] 

Cu 

2.9 

1.6 

n 

Blewitt [1962] 


4.1 

2.8 

n 

Losehand etal. [1969] 


4.2 

2.9 

e~ 

Meechan and Sosin [1959] 


5.5 

4.2 

d~ 

Granato and Nil an [1965] 


5.4 

4.1 

e~ 

WOLLENBERGER [1965] 

a-Fe 

6.3 

4.7 

e" 

Moser [1966] 


6.6 

5.0 

e~ 

Bilger etal. [1968] 


13.6 

12.0 

n 

Bilger etal. [1968] 

Pt 

2.9 

1.5 

e~ 

Feese etal. [1970] 


2.4 

1.1 

d~ 

Jackson [1979] 


“Denoted by (Frenkel defect) and A/f] (self interstitial). For evaluation procedure see text. 
b Stored-energy release and resistivity recovery measured after irradiation with electrons (e - ), neutrons (n) and 
deuterons (d**), respectively. 
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were produced by one and the same displacement process (displacement distance r p of a 
correlated Frenkel defect) must cause a recovery stage for which the significant average 
number of migrational jumps per interstitial is determined by r p . For recombination with 
non-correlated vacancies the specific number of interstitial jumps depends on the 
concentration of the Frenkel defects. Hence, we expect two recovery stages which are well 
separated when the average distance of the Frenkel defect is large compared to r p and overlap 
when both quantities are similar in size. These two stages are identified as I D and I E . 

The curve in fig. 5 does not fall to zero in stage I, which means incomplete Frenkel 
defect recombination in stage I E . Among many possible reasons, two are mainly effective 
in those cases represented by fig. 5: (i) formation of immobile di-interstitials, tri- 
interstitials and so forth, (ii) trapping of interstitials by solutes. A complete description 
of the stage I E recovery must consider these reactions. A review on the theory of 
diffusion-controlled defect reactions has been given by Schroeder [1980]. 

The first systematic quantitative study of stage I D -I E recovery was performed by 
Corbett et al. [1959] for electron-irradiated Cu by applying the numerical description 
of a diffusion-controlled model reaction scheme (Waite [1957]) to suitably measured 
electrical resistivity recovery curves. Excellent agreement between measured data and 
theoretical prediction was obtained. The obtained A H" values are collected in table 6. 

Since then a large number of stage I resistivity recovery investigations have been 
performed. A critical review with respect to the experimental verification of interstitial 
configuration and migration models was given by Young [1978]. The corresponding 


Table 6 

Activation enthalpy of interstitial migration*. 


Metal 

AH™ 

Measurement b 

(eV) 

Reference 

Ag 

0.088 

Ap ta 

Rizk et al. [1977] 

Al 

0.115 

MR 

Spiric etal. [1977] 


0.112 

Apu, 

Rizk etal. [1976] 

Co 

0.1 

MA 

Dander and Schaefer [1977] 

Sulpice etal. [1968] 

Cu 

0.117 

Apu, 

Corbett etal. [1959] 

Ga 

0.073 

Ap ta 

Myhra and Gardiner [1975] 

a-Fe 

0.3 

MA 

Schaefer etal. [1975] 

Mo 

0.083 

APirr 

Kugler etal. [1982] 

Ni 

0.15 

Ap^ MA 

Knoll etal. [1974] 

Pb 

0.01 

Ap te 

Birtcher etal. [1974] 

Schroeder and Schilling [1976] 

Pt 

0.063 

APi,r 

Sonnenberg etal. [1972b] 

Dibbert etal. [1972] 

W 

0.054 

APin 

Dausinger [1978] 

Zn 

0.015 

A p to 

Myhra and Gardiner [1973] 

Zr 

0.26 

Ap ta 

Neely [1970] 


0.30 

MR 

Pichon et al. [1973] 


* Denoted by AH™. 

b See glossary of foregoing tables; in addition MR and MA indicate mechanical relaxation and magnetic after- 
effect, respectively. 
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behavior of bcc metals has been reviewed by Schultz [1987]. Complete data collection 
see Ehrhart [1991] and Schultz [1991]. 

As an example of data obtained by means of a rather elaborate measuring technique, 
fig. 27 shows differential isochronal resistivity-recovery curves for electron-irradiated Pt. 
The influence of the initial defect concentration (given by the initial residual resistivity 
Ap 0 and corresponding to about 10 at-ppm to 1000 at-ppm) becomes quite obvious. The 
quality of the fit of appropriate rate equation solutions to the stage I D -I E recovery is 
shown in fig. 28. The series of figures clearly shows the increasing weight of both 
uncorrelated recovery and suppression of recovery by di-interstitial formation with 
increasing initial defect concentration. The influence of interstitial-trapping solutes is 
shown by i +/ in the upper two figures. 

Table 6 gives the migration enthalpies obtained by this type of investigations. The 
table also includes data obtained by elastic after-effect measurements in A1 (Spiric et al. 
[1977]) and magnetic after-effect measurements in Ni (Knoll et al. [1974]) and in Fe 
(Schaefer etal. [1975]). These properties, measured in single crystal samples at a fixed 
temperature after isochronal annealing periods at increasing temperatures, give informa- 
tion not only on the recombination rate but also on the symmetry and a possible 



Fig. 27. Temperature-differentiated isochronal resistivity recovery curves of electron-irradiated Pt for different 
initial irradiation-induced resistivity increments Ap (after Dibbert etal. [1972]). 


References: p. 1710 





Fig. 28. Experimental resistivity-recovery data of electron-irradiated Pt for different initial Frenkel defect concentrations c 0 , compared with calculated curves. The 
dashed curves give the fractional influence of the different point-defect reactions, (after Sonnenberg et al. [1972b].) 
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orientation relaxation of the migrating defect. In A1 the results show that a defect with 
tetragonal symmetry reorients while migrating, until it recombines with a vacancy. This 
behavior is exactly predicted by the computer experiments with the combined 
translational and rotational jump of the <100>-dumbbell (§3.3.1). The annihilation 
kinetics of the after-effect follows closely that of the resistivity. At temperatures below 
stage I D no reorientation relaxation has been observed which could be ascribed to free 
interstitials. Therefore, a pure dumbbell rotation without migration can be excluded, 
which is again in agreement with the theoretical prediction. 

For Ni, reorientation relaxation of a defect with <100>-symmetry in stage I D has been 
derived from magnetic after-effect measurements. 

For or-Fe, magnetic alignment experiments have identified the <1 10>-symmetry of the 
interstitial (Chambron etal. [1975]) and magnetic after-effect measurements have shown 
the reorientation relaxation during annealing in stage I D (Schaefer etal. [1975]). 

For other metals, including Au (Dworschak et al. [1981]), Cd (Coltman et al. 
[1971], O’Neal and Chaplin [1972]), Nb and Ta (Schwirtlich etal. [1987]) and V 
(Schultz [1987]), no recovery stages which could unambiguously be ascribed to 
correlated and uncorrelated recombination of freely migrating interstitials have been 
detected. On the other hand, long-range mobility of the interstitials below 5 K has been 
concluded from Huang scattering experiments which do show interstitial cluster 
formation during electron irradiation at 5 K in Au, Cd, Mg and Nb (Ehrhart [1978]). 
Many other observations give more indirect hints of long-range interstitial mobility. In 
particular, Au has been studied with respect to this problem in great detail (Dworschak 
et al. [1981] and Seeger [1970]). A stage I E , if it exists, must occur below 0.3 K 
(Birtcher etal. [1975], Schroeder and Stritzker [1977], The unexpectedly large 
displacement distances (Dworschak etal. [1981]) at low temperatures could be due to 
the special coupling of interstitials to the vibrational lattice modes (§3. 3. 1.2). 

As for Mo and W, recovery stages observed around 35 K for Mo (Maury et al. 
[1978]) and 27 K for W (Dausinger [1978]) do show a temperature shift as function of 
the initial defect concentration. But the kinetics are completely different from what is 
observed in Cu, for example. According to Schultz [1987], the observed kinetics would 
be in accordance with a model that allows not only for vacancy-interstitial recombination 
but also for interstitial trapping at certain neighboring sites of a vacancy. By this effect 
the amount of recovery in stage I E will be reduced. At some higher temperature the close 
pair formed by the trapping event is allowed to recombine leading to the typical close- 
pair recovery stage, as indeed observed a few degrees above the quasi-I E stage. Direct 
evidence for this interpretation is, still lacking. 

3.3.2.5. The controversy on the interpretation of the recovery-stage HI. The 
controversy resulted from the fact that the main recovery stage of quenched samples did 
not exactly coincide with stage III in irradiated samples (§2.2. 3. 3). Unfortunately, 
initially the difference was particularly large for Cu, the standard research material of the 
metal physicists. The lack of total recovery at the end of stage III also played a 
significant role in the dispute. The latter problem, however, is of less significance than 
the former and is omitted here. 

The primary source of disagreement is the interpretation of the stage III recovery. 
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The advocates of the “one-interstitial-” or (a better expression) vacancy model explain 
stage HI by diffusion-controlled annihilation of vacancies. The temperature difference of 
the recovery-rate maxima of isochronal annealing curves were ascribed to the different 
number of jumps the vacancies had to perform in order to reach the sinks which are of 
different density and configuration after quenching and irradiation, respectively (§ 2.2.3.3). 

The advocates of the two-interstitial model solved the problem by ascribing stage III 
after irradiation to migration and annihilation of the stable interstitial, and by calling the 
first stage after quenching stage IV and ascribing it to migrating vacancies. The 
interstitial migrating in stage I (fig. 5) was thought to be a metastable configuration. 
Throughout a period of 20 years, the open question of the stage Ill-stage IV origin 
energized research in the point defect field. Considerable improvement of experimental 
methods was achieved in three directions: (i) determination of interstitial properties like 
configuration, dynamic behavior, migration, recombination and clustering behavior, (ii) 
improvement and understanding of the quenching procedure, especially with regard to 
impurity effects, (iii) detection of vacancy migration and clustering (PAC and PAS, see 
§ 2.2.3.2). As an introduction to the controversy the reader is referred to Seeger [1975], 
Young [1978] and Balluffi [1978]. The two latter authors conclude that in many 
metals the experimental observations can satisfactorily be explained by the one-interstitial 
model, i.e., by the vacancy migration in stage III. The experimental basis of the papers 
by Schindler et al. [1978] and Frank et al. [1979], which was taken as a strong 
support of the two-interstitial model by the authors just at the time of Young’s [1978] 
and Balluffi’s [1978] reports could not be confirmed later (Laupheimer etal. [1981]). 

In recent years, the intensity of the debate has diminished although the advocates of 
the two-interstitial model report on new deficiencies of the one-interstitial model and 
successful applications of the two-interstitial model (Kienle etal. [1983], Seeger [1982], 
Seeger and Frank [1983]). The general argumentation regarding interstitial properties 
is, as before, based on observations which cannot be explained by the well-known stage 
I interstitial properties directly. The second set of parameters introduced by the second 
interstitial of the two-interstitial model generally allows easy explanation of such 
observations. 

The key question of the controversy is the consistent explanation of stage III by 
vacancy migration. There are many experimental results reported in the literature which 
favor vacancy migration in stage III (§ 2.2.3). As an example we take results for Au for 
which in specially pre-treated electron-irradiated samples A//J^ = 0.71 eV was found in 
stage III (Sonnenberg and Dedek [1982]). The treatment consisted of high-dose 
a-particle irradiations which resulted in a very high density of dislocation loops. This 
high loop density acting as a high fixed sink density for the migrating stage III defect 
suppresses all other possible reactions of this defect. The model fitted to equilibrium, 
quenching and self-diffusion data by Sahu etal. [1978] yielded A//™=0.83eV. 
According to Seeger and Frank [1983] the differing migration enthalpies are the 
evidence for the migration of an interstitial in stage III with A//” =0.71 eV. To the 
present author this conclusion, although suggestive, might be premature. The quenching 
data yield A//jJ values between 0.55 eV and 0.63 eV, depending on the initial vacancy 
concentration. The result A =0.83 eV is derived by fitting a model which assumes a 
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substantial di-vacancy contribution. Di-vacancies are made responsible (i) for the 
curvature of the c v °(l/7) curve which determines A // 2 * and (ii) for the experimental 
A H{ = 0.62 eV. This model is certainly not unambiguous with regard to its basic 
assumptions. The question whether the above curvature could be due to temperature 
dependence of A h{ n (Varotsos and Alexopoulos [1982]), for example, must carefully 
be reinvestigated as well as the atomic processes during annealing of quenched resistivity 
samples. The electron-microscopic work on quenched gold clearly demonstrates the 
strong tendency for vacancy agglomeration. At the same time, there are reliable 
indications that the electrical resistivity does not change during the formation of small 
clusters containing not more than about five vacancies. Hence, resistivity recovery 
experiments can show only a later state of a complex vacancy-annihilation regime with 
certainly time-dependent sink concentrations. 

3.3.2.6. Dynamic properties. The dynamic behavior of the self-interstitial is perhaps 
the property most sensitive to the configuration. As has been shown in §3.3. 1.2, the 
decrease of the elastic modulus with increasing interstitial concentration played a key role 
in the understanding of the nature of interstitial structure. After a number of modulus 
measurements in polycrystals which demonstrated the negative sign and the large 
absolute value of the modulus change (KOnig etal. [1964], Wenzl etal, [1971]), the 
first measurements in single crystals containing randomly distributed single Frenkel 
defects have been reported by Holder etal. [1974] on thermal-neutron-irradiated Cu and 
by Rehn and Robrock [1977] in electron-irradiated Cu. The data obtained by the former 
authors are shown in fig. 29, indicating the largest effect, for c M . An interstitial 
concentration of 300 at-ppm softens the crystal by A c^/c^ — 1%. If the concentration of 
single interstitials were to be enhanced up to 3 % (limiting factor: spontaneous 
recombination, see §3.1.2) and the softening proceeded linearly, the crystal would be 
unstable against a <100> shear stress ( 0 ^ = 0). Quenched-in vacancies were found to 
produce much smaller changes of the elastic constants (Folweiler and Brotzen 



Fig. 29. Elastic moduli of Cu during electron irradiation at 4 K (after Holder et al. [1974].) 
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[1959]). The sequence of decreasing softening for C and K can be understood in terms 
of the dumbbell interstitial. In the case of <110> shear stress which measures 
C' = (c 11 -c 12 )/2, the bending spring of the dumbbell coupling is not activated, as can be 
seen in fig. 24, but the bending springs of the interaction with the nearest neighbors of 
the dumbbell are operating. As the equilibrium distance is larger than that between the 
dumbbell atoms the force-constants are smaller and hence the softening is less pro- 
nounced. This effect is even smaller for hydrostatic compression (bulk modulus k ). Data 
for A1 are given by Robrock and Schilling [1976]. In the bcc Mo the largest softening 
has been observed for C (Okuda [1975]), which shows the <110>-symmetry of the split 
interstitial in this metal. 

The large change of the elastic constants should significantly influence the phonon 
dispersion curves (Wood and Mostoller [1975], Schober etal. [1975]). Such an effect 
has indeed been observed by Nicklow etal. [1979]. Schober [1989] has pointed out 
that irradiated crystals with their self-interstitials show many features which are known 
as low temperature glass anomalies (low-frequency vibrations, relaxation processes and 
general two-level-system behavior). Granato [1993] proposes the self-interstitial to be 
the crucial defect which determines all thermodynamic properties of each of the 
crystalline, liquid or amorphous states, as well as their changes upon melting. 

3.3.2.7. Interstitial agglomeration. The formation of small multiple interstitials, even 
di- or tri-interstitials, has been concluded from measurements of the diffuse X-ray 
scattering, Huang scattering (§3. 3.2.2) and mechanical relaxation measurements 
(§ 3. 3.2.6). In fig. 30 diffuse X-ray scattering measurements on electron-irradiated A1 
subsequently annealed to 40 K are compared with calculations for two di-interstitial 
models. Obviously the parallel dumbbell di-interstitial is formed in A1 at the end of 



Fig. 30. Comparison of the diffuse X-ray-scattering intensity measured in A1 electron-irradiated at 4 K and 
subsequently heated to 40 K, with calculated curves for two different di-interstitial configurations. Numbers 1 
to 4 as in fig. 25. (after Ehrhart et at. [1974].) 
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recovery stage I. This configuration was predicted by computer simulation for a Cu 
potential (§3.3. 1.4). 

For the Huang scattering intensity the formation of clusters consisting of n interstitials 
leads to (c < /ri)(NAV i re> ) 2 = nc i (NAV i rel ) 2 instead of c d (JVAV ( " rf ) 2 for single interstitials of 
concentration c d . This enhancement is predicted for a linear superposition of the atomic 
displacements in the cluster. The effect of the actual relaxation upon clustering seems, 
however, to remain smaller than the gain by the factor n (Ehrhart and Schilling 
[1974]). The cluster formation can easily be monitored, as the wavevector dependence of 
the intensity according to q 2 is substituted by a q~* dependence in the upper q range. 
From the boundary of the q 2 validity the cluster radius can be estimated. For Al, with 
the same experimental conditions as in fig. 26, n = 2 was found in agreement with the 
above result. Between 40 K and 45 K, n was found to increase from 2 to 4 (see fig. 31). 
For Cu it has been found that already at the end of recovery stage I, small loop-shaped 
clusters with n = 5-10 exist. The lack of clusters smaller than with n = 5 leads to the 
conclusion that in Cu, di- and tri-interstitials are mobile in the same temperature region 
as the mono-interstitial is. This behavior was also found for di-interstitials by the 
computer simulations (§3.3. 1.4). More information on small interstitial clusters in Au, 
Cd, Co, Nb, Mg, Zn and Zr from Huang scattering is given by Ehrhart and SchOn- 
feld [1982]. The strong interstitial clustering found for Au already at 10 K (fig. 31) is 
in accordance with the interstitial mobility already found at 0.3 K (Birtcher et al. 
[1975]. In fig. 32 we see the strong reduction of the interstitial clustering in dilute alloys 
when compared with the pure solvant. 

In pure metals, the interstitial clustering produces the characteristic feature of the 
stage II recovery as seen in fig. 33. Similar behavior is found for material cold-worked 
at low temperatures (fig. 34). This continuous resistivity decrease with approximately 
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Fig. 31. Average interstitial cluster sizes as function of the annealing temperature for Au, Cu and Ni and initial 
defect concentration of about 300 ppm (after Ehrhart et al. [1986]). 
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Fig. 32. Average interstitial Cluster sizes for dilute Ni alloys as function of the annealing temperature after 
3 MeV electron irradiation to 300 ppm defect concentration (after Ehrhart [1991]). 


constant rates through relatively large temperature ranges (nearly two orders of magni- 
tude for Au, Dworschak etal. [1981]) must be due to interstitial rearrangements rather 
than due to recombination. Constant recovery rates mean “white” spectra of activation 
enthalpies for the rate controlling processes, in the case of Au ranging through nearly 
two orders of magnitude. Although this observation involves substantial information 



Fig. 33. Isochronal recovery (10 min annealing period) of the electrical resistivity of electron (1-3) and neutron 
(4) irradiated Cu. Radiation-induced resistivity increase: 1-0.157 nflcm, 2-3.03 nflcm, 3-5.9 nftcm, 4-0.0186 
nficm (after Ehrhart [1991)). 
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Fig. 34. Isochronal recovery of the electrical resistivity for Cu plastically deformed at -195°C to different 
strains e. The given data are normalized by p at 200°C (after Van Den Beukel [1970]). 


about cluster mobility and reactions (rearrangements), this urgently needed information 
(see §4.2.2) could not be utilized yet because of the unknown resistivity change per 
interstitial in the various cluster arrangements. 

3.3.2.8. Interstitial-solute interaction. Comparison of the stage I recovery of pure 
metals with that of impure ones showed that solutes trap migrating self-interstitials within 
certain temperature ranges (Schilling etal. [1970]). If this trapping was due to a long- 
range interaction between solutes and interstitials this interaction should be of elastic 
nature, because of the effective screening of charges by the conduction electrons. The 
interaction potential would then be described to a first approximation by eq. (9). The 
overall effect of solutes on migrating interstitials is an attractive one, despite the 
existence of repulsive directions (Profant and Wollenberger [1976]). The drift- 
diffusion of interstitials in the interaction potential has been treated in detail by 
SCHROEDER [1980]. 

The long-range interaction causing drift diffusion cannot reliably be applied at 
distances between solute and interstitial of three atomic distances or less. Answers to 
questions such as the configuration of an interstitial placed close to a solute or the 
binding energy of this pair must be derived from microscopic calculations. A first 
attempt was made by Dederichs etal. [1978], who simulated a solute in Cu in computer 
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experiments by simply shifting the Cu-Cu interaction potential by a certain distance R 0 . 
Enlarging the equilibrium distance of the neighbor simulates oversized solutes, diminish- 
ing it simulates undersized solutes. The model predicts a number of interesting results. 
For undersized solutes it shows the formation of a mixed dumbbell. Solute and one 
solvent atom share one lattice position in the <100> split configuration. By oversized 
solutes the self-interstitial dumbbell is trapped at different neighboring lattice positions 
of the solute atom, with different binding energies, of the order of magnitude of 0.1 eV. 

The interaction energies in saddle-point configuration of the self-interstitials have not 
been reported. The saddle-point energies for different types of jumps of the mixed 
dumbbell atoms have been investigated with the result shown in fig. 35. The three 
different kinds of jumps are illustrated in fig. 36. As A Hj" =0.1 eV, activation enthalpies 
for cage motion are very small according to fig. 35. For volume size misfits larger than 
about 10%, the mixed dumbbell should be able to rotate before dissociation occurs. Since 
subsequent rotational and caging jumps allow long-range migration of the mixed 
dumbbell (see fig. 36) activation enthalpies of migration smaller than that of the vacancy 
are to be expected. Such conditions are indeed found experimentally with considerable 
practical implications for solute transport (§ 4.7). 

Similar calculations have been performed for different solutes in A1 by using inter- 
ionic potentials derived from first principles (Lam et at. [1980, 1981]). The pair 
potentials for the Al-Al and Al-solute interaction extended up to the nineth neighbor. For 
the solute Zn, mixed dumbbell formation was found with a binding energy of 0.38 eV. 



/ R„ / * 


Fig. 35. Normalized enthalpies of mixed dumbbell dissociation, rotation and cage motion according to computer 
simulation of Cu (Dederichs etal. [1978] versus size-misfit. The ratio ft/ R a , of the potential shift r D ) divided 
by the equilibrium atom distance R a of Cu must be multiplied by about six in order to obtain the volume size- 
misfit comparable with experiments. 
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Fig. 36. Elementary jumps of a mixed dumbbell: (a) Jump of the solute; (b) by cage motion of the solute 
around the octahedral site the six mixed dumbbells shown are formed consecutively; (c) jump sequence of 
solvent atoms and solute which in case of looping leads to migration steps of the solute; (d) rotation (from 
Dederichs etal. [1978]). 


For Be, Ca, K and Li, maximum binding energies between 0.62 eV (K) and 1 .75 eV (Li) 
were found for different configurations but in no case a. split configuration with solvent 
atom and solute sharing a lattice site was found. For Mg no binding was found. The 
saddle-point energy for mixed-dumbbell migration was found to be much larger than this 
dissociation energy. 

The experimental situation of the solute-interstitial interaction has been reviewed by 
Wollenberger [1978]. Since then, a number of observations have been reported which 
indicate the existence of quite a number of different solute-interstitial complex configur- 
ations with different physical properties in one and the same irradiated dilute alloy. These 
results are reviewed by Robrock [1983]. 

Trapping of migrating interstitials by solutes prevents the interstitials from 
recombining with vacancies. This can clearly be seen in the isochronal resistivity 
recovery curve, as examplified in fig. 37. The presence of 112 at-ppm Au in Cu 
suppressed recovery in stage I E . The recovery is retarded and occurs stepwise around 
60 K and between 90 K and 120 K. Beyond this the resistivities of dilute alloy and pure 
solvent are equal, indicating that the Au atoms no longer trap interstitials. By fitting 
diffusion models to the recovery curves in the stage I D -I E range the reaction, capture or 
trapping radius of the solute for migrating interstitials is obtained. Appropriate evaluation 
of the final recovery stage yields the activation enthalpy for dissociation of the complex. 
Activation enthalpies obtained this way for different solutes in Cu are shown in table 7. 

The final recovery stage could also be caused by the solute-interstitial pairs becoming 
mobile as a whole and migrating until the interstitial recombines with a vacancy. Then 
the activation enthalpy derived from this recovery stage would characterize the saddle 
point for the complex migration. The resistivity recovery experiments do not discriminate 
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Fig. 37. Isochronal resistivity recovery of Cu and dilute Cu-Au electron-irradiated at 4 K (after Cannon and 
Sosin [1975]). 

between these two types of reactions. This question is particularly interesting for the 
solutes Co, Fe, Ge, Ni and Zn in Cu for which only an upper limit for the binding 
energy can be deduced. In these cases trapping was not observed at all. This might be 
due to migration of the complex already at the lowest temperature investigated (55 K). 

Solutes in A1 seem to cause even stronger binding since the recovery curves of dilute 
alloys do not coincide with that of the pure A1 before stage III recovery, i.e., before 
vacancies start moving (Wollenberger [1975], Dworschak et al. [1976]). The 
quantitative evaluation of resistivity measurements with respect to reaction radii is 
somewhat handicapped by the fact that the resistivity does not give any hint on defect 
reactions occurring if they are not accompanied by resistivity change. But whenever the 
resistivity decreases, we simply interpret the change as a result of recombination. 
Reactions which must Carefully be considered in this respect are formation of mobile or 
immobile di-interstitials which seem not to be accompanied by a measurable resistivity 
change but do change the defect reaction scheme significantly. Configurational changes 
of interstitial-solute pairs, on the other hand, could be accompanied by resistivity 
changes. Careful choice of the experimental conditions can to some extent help to 
exclude undesired reactions. The significance of di-interstitial formation in this respect 
has recently been discussed in some detail by Maury etal. [1980]. 

The simplest reaction scheme in view of the solute-interstitial trapping is obtained by 
electron irradiation at temperatures corresponding to the recovery stages I E and II. A 
large number of capture radii have been determined by resistivity-change rate measure- 

Table 7 


Activation enthalpies of self-interstitial solute pair dissociation \ 


Metal 

Ag 

Au 

Be 

Co 

Cr 

Fe 

Ge 

In 

A7f d *“ (eV) 

0.40 

0.33 

>1 

<0.15 

0.36 

<0.20 

<0.20 

0.44 

Metal 

Mg 

Mn 

Ni 

Pd 

Sb 

Si 

Ti 

Zn 

A/f dis ’ (eV) 

0.43 

0.32 

<0.15 

0.30 

0.32 

0.44 

0.40 

<0.20 


a Denoted by values according to Wollenberger [1978], 
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merits in the stage II recovery range. For Cu and A1 as solvents, the so determined 
capture radii of nearly all solutes are found to depend on temperature as T 2 . If the 
capture radii were determined by drift diffusion of the interstitial in the long-range 
interaction field, the potential energy should depend on distance R as R~ U2 in order to 
explain the T 1 dependence (Schroeder [1980]). No physical reason can be given for 
such a long-range potential. As for the recombination radius, a 7~ l/3 dependence was 
found (Lennartz et al. [1977]), as expected for the elastic dipole interaction. The 
curious T 2 dependence might arise from the fact that the trapped interstitials are 
detrapped from varying well depths during the period of measurement. By assuming the 
appropriate number of trapping sites and interstitial transition rates between them, the T 2 
dependence was quantitatively explained for Au in Cu as an example (Abromeit and 
Wollenberger [1983]). 

The absolute size of the effective capture radii is derived in the order of a few atomic 
distances for Cu, and for Al as solvent about twice as large. The sizes found in Cu 
disagree with the elastic interaction theory by a factor of about two. For Al, however, 
which is nearly isotropic, there is a disagreement of approximately one order of 
magnitude. The disagreement in the case of Al indicates that this type of theoretical 
description is not correct. It should be noted that diffusion jumps of the dumbbell 
interstitial in the vicinity of a solute have not been investigated by computer experiments 
until now. 

While resistivity measurements have yielded substantial information on the trapping 
behavior of solutes in terms of capture rate constants, methods such as ion-channeling 
measurements, Mossbauer effect analysis, perturbed y-y angular correlation and 
mechanical relaxation measurements have extended our knowledge of configurations and 
motion characteristics of solute-interstitial pairs. 

The ion-channeling method has been reviewed by Swanson etal. [1978] and Howe 
and Swanson [1982], It is the most direct method for localizing impurity atoms in a 
host lattice. Whenever solutes are displaced from regular lattice sites by trapped 
interstitials, ion channeling, in principle, can detect them and measure the displacement 
distance. Most of the solute-interstitial pair investigations have been performed by means 
of Rutherford backscattering of channeled ions. This method has the limitation that it has 
sufficient sensitivity only for solute masses large compared to the solvent mass. 
Consequently this method has been applied mainly to dilute Al alloys. The existence of 
<100> mixed dumbbells has been concluded for the investigated alloys with undersized 
solutes and no solute displacement was found for oversized ones. By using proton- 
induced X-ray emission as an indicator for the presence of impurities in interstitial 
positions (Ecker [1982]) the mass difference problem is avoided. 

It should be kept in mind that channeling measurements can unambiguously identify 
the interstitial position of a solute only when the solute does not occupy more than one 
type of position. Data evaluation in case of simultaneous presence of more than one 
position becomes ambiguous and in practice very uncertain for even one of the assumed 
positions. Besides the different configurations possible for the solute-one-interstitial 
complex, the trapping of more than one interstitial per solute could lead to different 
interstitial positions of the solute. The latter problem can be solved, in principle, by 
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careful investigations of the defect-concentration dependence of the solute position 
(Muller [1986]). 

The mechanical relaxation methods measure the damping of applied elastic vibrations 
caused by the stress-induced reorientation of solute-interstitial complexes (see reviews 
by KronmOller [1970], Nowick and Berry [1972]). By investigating single crystals 
the symmetry of the complex can be derived. Complexes of different configurations are 
expected to show different activation enthalpies for reorientation and migration. For In 
in Cu, for example, not less than six different damping maxima are obtained (Rollers 
etal. [1981]), pointing to six different complex configurations. Four of them annihilate 
close to the final resistivity recovery stage of this alloy (Dworschak etal. [1978]) with 
small differences in annihilation temperature among each other. The remaining two 
annihilate at considerably lower temperatures. A similar multiplicity of reorientation- 
damping peaks and annihilation temperatures has been found in dilute A1 alloys 
(Robrock [1982], Hultman etal. [1981]). It has also been shown that the activation 
enthalpy for re-orientation significantly depends on the solute species (Granato et al. 
[1982]). 

For Fe in Al a re-orientation process occurs at 8 K with an activation enthalpy of 
about 13 MeV (Rehn etal. [1978]). It has been identified to arise from the cage motion 
detected earlier by Mossbauer studies (see below). Some disagreement regarding details 
of the cage geometry as concluded from internal friction (Robrock and Schober 
[1981]) and Mossbauer spectroscopy (Petry et al. [1982]) shows progress and limits 
reached with the application of these methods. 

By Mossbauer effect studies, the cage motion of a solute which has trapped a self- 
interstitial has been detected. In a dilute alloy of 57 Co in Al or Ag irradiated at low 
temperatures and subsequently annealed through recovery stage I or irradiated at 
temperatures within stage II, a second Mossbauer line of the 57 Co ( 57 Fe) transition was 
observed besides the well-known one arising from the solute dissolved substitutionally 
(Vogl and Mansel [1975]). From the isomer shift of the new line it must be concluded 
that the Mossbauer isotope is placed closer to the next solvent atom than a nearest- 
neighbor distance. Hence, it must be situated in the close vicinity of an interstitial atom. 
The Debye-Waller factor / of this new line shows an unusual temperature dependence, 
as shown in fig. 38. It decreases between 10 K and 20 K (for Al) by about 80%. This 
decrease can be measured reversibly so long as the sample is not heated into recovery 
stage HI (Al) or to 100 K (Ag), i.e., until the trapped interstitials are annihilated. It has 
been quantitatively explained by a cage motion of the Mossbauer isotope in the cage 
shown in fig. 36b (Vogl et al. [1976]). The essential origin of the reversible Debye- 
Waller-factor drop is a locally restricted motion of the Mossbauer isotope within the 
lifetime of the Mossbauer quanta (10' 7 s). The low-temperature range of its occurrence 
fits well to the theoretical result in fig. 35. 

Mossbauer studies have also demonstrated the existence of more than one solute- 
interstitial-complex configuration. In irradiated Mo- 57 Co five different new Mossbauer 
lines were resolved (Mansel et al. [1982]). 

The PAC technique introduced in §2.2.3.2 has also been applied to questions of 
interstitial-solute interaction (§2.2.3.2). Again, a variety of different configurations of 
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Fig. 38. Debye-Waller factor as derived from Mdssbauer effect measurements in 57 Co in electron-irradiated A1 
and Ag. Irradiated and unirradiated states are indicated with “+ interstitial” and “substit.”, respectively, (after 
VoGL etal. [1976].) 


interstitial probe atom pairs must be concluded from the results for certain dilute alloys. 

The application of probe atom methods still seems to possess considerable potential. 
A systematic application to single crystal samples could give more information on the 
symmetry of the complexes and systematic studies of the kinetics could give valuable 
information on the interstitial population of the complexes. 

4. Miscellaneous radiation effects 

4.1. Introductory remarks 

The information collected in § 3 results from research which was to substantial degree 
carried out in order to solve problems of nuclear technology. Therefore, it seems to be 
reasonable to extend the considerations in § 3 beyond the limits of “simple” conditions 
for the defect production and the properties of single defects or small clusters towards 
more complex conditions common for technical processes. In § 4.2 the defect production 
by neutron and heavy ion irradiation is treated with special emphasis to the cascade 
phenomenon and its consequences for the formation of defect clusters and single defects. 
The role of the electronic interaction of energetic ions with metallic materials is 
described in §4.3 by looking at the influence of incident swift heavy ions on the 
stability of Frenkel defects. In §§4.4 and 4.5 the dimensional instability of materials 
under high fluence irradiations at elevated temperatures is treated. This phenomenon is 
one of the most serious problems in the technology of fast fission and a prospective 
fusion reactor. The multiple reaction regime built-up by the defect production, agglomer- 
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ation, trapping, and annihilation allows for self-organization of vacancy clusters. The 
boundary conditions for this reaction mode are treated in § 4.6. The coupling of defect 
fluxes with atom fluxes leads to substantial locally varying compositional changes and 
eventually to phase transformations. This topic is treated in § 4.7. 

4.2. Frenkel defect production by neutrons and ions 

4.2.1. Atomic displacement cross-section and cascade formation 

Electron irradiation is the basic method of Frenkel defect production when properties 
of single Frenkel defects are to be studied. Nuclear reactor technology introduced 
neutron-irradiated materials and raised the question about the underlying damage 
structure. Neutrons are “heavy” projectiles as compared with electrons. The energy 
transfer to target atoms for head-on collisions is given by the classical formula 
T m!a - AMmE/ (M+mf, where E is the projectile energy, and m and M are the masses of 
projectile and target atom, respectively. Reviews on radiation-solid interaction are by 
Lehmann [1977], Leibfried [1965] and Seitz and Koehler [1956], For Cu we have 
T m ~ E / 16 and the average recoil energy <T> = 5x 10 4 eV for 1.6 MeV fission neutrons. 
This energy is three orders of magnitude larger than the displacement threshold energy 
(§3.1.2), which leads to damage features completely different from those of electron 
irradiation. Before looking more closely at details of this type of damage, we realize that 
bombarding a sample with ions of the sample material (called self-ion irradiation) having 
5x 10 4 eV incident energy must have a similar effect as the fission neutron irradiation. 
An incident ion behaves like a lattice atom recoiling from a fission neutron collision. The 
only difference being: the lattice atom starts from a lattice site while the incident ion 
moves interstitially. This difference is of importance only when the primary collision is 
to be considered. For the secondary and all subsequent collisions the nature of the 
projectile is irrelevant. 

Self-ion irradiation is highly welcome as a form of simulation for neutron irradiation 
because any measurement on ion irradiated samples is much easier than on often highly 
radioactive neutron irradiated ones (Ishino etal. [1977]; Gabriel etal. [1976]; further 
reading: Stiegler [1976], eds. W. V. Greens al. [1990], Singh et al. [1990]). More- 
over, ion irradiations allow one to achieve given fluences (damage levels) in much 
shorter times than neutron irradiations do. This acceleration of the defect production 
shifts (among others) the dominance of the defect annihilation at sinks towards that by 
recombination and may thus impair correct simulation of more complex processes like 
swelling (§4.4) and phase transformations (§4.7). In practice, a self-ion energy of 
5 X 10 4 eV would not be appropriate for damage production in the bulk of samples as the 
penetration depth of such ions measures only a few atomic distances. With self-ions of 
0.3-1 MeV energy, samples for electron microscopy (100-200 nm thickness) or electrical 
resistivity measurements can be homogeneously damaged. If even thicker samples are 
required, as for mechanical property measurements for example, neutron irradiation is 
simulated by high-energy light-ion irradiation. Light ions are protons, deuterons and a- 
particles and applied energies are 20 MeV, for example. For such projectiles the 
penetration depth is of the order of 10-100 /um . The induced recoil energy spectra of the 
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target atoms are then different from those caused by reactor neutrons. Simulation is no 
longer straightforward. 

Lattice atoms which obtained recoil energies as large as 100 keV act themselves as 
projectiles in the lattice and release their primary recoil energy by collisions with other 
lattice atoms. In this way a primary knock-on-atom initiates a collision cascade which, 
according to the transferred energies, leads to a displacement cascade which, in turn, 
ends up in a defect cascade. The ranges of Cu + ions with energies 5 x 10 4 eV, 2x 10 4 eV, 
and 5x 10 3 eV in Cu are 65 nnd, 31 nnd, 13 nnd, respectively. Hence, the density of 
collisions and defects strongly increases towards the end of the cascade. 

As long as the collisions can be treated as binary collisions the process can analyti- 
cally be described by a linear Boltzmann-type transport equation, as was originally done 
by Kinchin and Pease [1955]. These authors obtained the ratio n - T/2T d , where T is the 
energy of the primary knock-on atom displacing n atoms in a cascade with recoil 
energies larger than the displacement threshold T d . This treatment can only be a first- 
order approximation as the binary collision model breaks down as soon as the recoil 
energies become of the order of 10 4 eV. The multiple-collision events in this energy 
region form so-called displacement spikes, illustrated in figs. 39a, b. The corresponding 
result for n as obtained from computer simulation reads n = 0.8 e (T)/2T ieff for 
T> 2.57’ deff , n= 1 for T dtft < T < 2.5T ieff and n = 0 for T< T ieff , where e(T) is the damage 
energy, which is the total energy T diminished by the losses due to electron excitations 
(Norgett etal. [1974], §4.3). These excitations result from the Coulomb interaction of 
the moving ion with the electrons of the solid. By electron-phonon interaction the energy 
is lost as heat. In Ni, for example, the excitation losses become appreciable around 
10 4 eV and measure about 60% of the total energy at 10 6 eV. By taking into account the 
number of Frenkel defects produced per primary recoil atom, we obtain for the total 
displacement cross-section: 

cr d {E) = \l^^^n{T)dT, (10) 

where E is the energy of the incident particles (neutrons or ions) and da/dT is the 
differential cross-section of a target atom to obtain energies T by collisions with the 
incident particles, and T ijeSS an effective threshold energy suitably chosen to replace the 
anisotropic threshold-energy surface by a spherical surface (§3.1.2). For neutrons and 
ions the differential cross-sections are entirely different functions of T. 

The validity of eq. (10) for ion irradiation on Cu has extensively been studied by 
means of electrical resistivity measurements for a wide range of recoil energies by 
Averback etal. [1978]. The important finding is that for 4.2 K irradiation temperature 
the quantity <r dmeas /cr dcalc =^, called the efficiency, falls from unity for recoil energies 
around 10 2 eV to 0.5 around 10 3 eV and to 0.35 above 10 3 eV. As the effective 
threshold r dieff for Cu amounts to about 50 eV (Jung [1991]) n values around 1 and 10 
correspond to the above quoted recoil energies of 10 2 and 10 3 eV. The decrease of £ 
below unity already from «=1 on indicates a substantial defect loss in reality. The 
sequence of images in fig. 39 shows that also in the computer model the number of 
Frenkel defects is strongly reduced by recombination as the collision cascade decays. 
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Fig. 39. Computer simulation of a displacement cascade with 7=25 eV in Cu. Shown is the ( 100) plane. The 
primary recoil impact direction was <1 10> given to the atom at that now vacant site in the cascade area which 
lies closest to the left lower comer of fig. 39a at t = 0. Atoms imaged by bright dots have more than 0.5 eV. 
In (b): t = 6x KT 14 s (real time); in (c): r = 3.5x 10“ 13 s; in (d), when t= 1CT 12 s a stable configuration is reached, 
consisting of five Frenkel defects, (from Lehmann etal. [1981].) 


This process has been studied in some detail by molecular dynamics calculations (MDC), 
(reviews: English etal. [1992] and Diaz De La Rubia and Guinan [1992]). Since the 
first application of MDC to point-defect production by Gibson et al. [1960] and 
Lehmann etal. [1981] (fig. 39), the method has considerably been improved, (i) by 
implementing more realistic isotropic manybody potentials as calculated by the embedded 
atom method (Foiles et al. [1986]) and (ii) by the use of modem supercomputers with 
vector architecture and (iii) by including the phonon-electron coupling (Caro and 
Victoria [1989], Finnis et al. [1991]). As a result, calculations became possible up to 
primary recoil energies of 25 keV applied to one of the atoms of a crystal block 
containing up to 5 x 10 5 atoms. These calculations generally confirm what is seen in fig. 
39 but uncovered a large amount of details. In regard to our question about efficiency £, 
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it is of fundamental importance that the cascade core assumes liquid structure as shown 
in fig. 40 for an intermediate time period. The atom density in fig. 40a was derived from 
a snap-shot of the cascade core at the given time. The density distribution in fig. 40b 
represents that of liquid Cu. The atom density in the core reaches only 80% of that of 
liquid Cu at 2200 K for which the radial distribution function matches the one observed 
in the cascade core. The lacking atoms are distributed around the liquid drop volume. 
The studies have further shown how vacancies and interstitials recombine mutually and 
agglomerate within each of both populations. As a result, the production efficiency £ was 
obtained in good agreement with the measurements. The strong decrease between 10 2 and 
10 3 eV coincides with the increasing importance of the thermal spike within the cascade. 

The MDC studies further show that enhancing the temperature of the cascade 
surrounding matrix enlarges the cascade cooling period with significant effects on the 
defect agglomeration (§ 4.2.2), i.e, on the escape from recombination. For Cu by counting 
the vacancies agglomerated in loops an increase of £ by a factor of two was observed when 
going from 4 K irradiation temperature to room temperature (Jenkins etal. [1993]). 

Cascades for recoil energies in the order of magnitude 10 5 eV are computer-simulated 
by means of binary collision codes. Figure 41 gives a three-dimensional impression of 
a computer simulation result. Two features are typical and important: (i) The vacancies 
are frequently clustered and are surrounded by single interstitials. The picture was 
postulated earlier by Seeger [1958]. (ii) The defect density strongly fluctuates and the 
structure could be described as a complex of subcascades. This feature is typical for 
high-energy cascades. 

The defect cascades can be imaged by electron microscopy (reviews: Jager [1981], 
Kiritani [1982] and Shimomura et al. [1982]) and field ion microscopy (reviews: 
Wagner [1982] and Seidman [1978]). A particular enlightening example of electron 
microscopy results is shown in fig. 42. By producing collision cascades in long-range 
ordered Cu 3 Au, disordered volume regions are created which are the images of the 
collision cascades. By using the superlattice reflections and dark-field imaging, the 
disordered zones appear as dark contrast within the brightly imaged ordered matrix. 




Fig. 40. Radial atomic distribution functions for Cu (a) the core of a 2 keV cascade after 0.6 psec, (b) liquid 
at 2200 K after English etal. [1992]. 
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Fig. 41. Computer simulation of a displacement cascade with 7=2 x 10 5 eV in a Cu model. The primary knock- 
on occurred in the lower left. The squares are vacancies and the (smaller) dots are interstitials. Cube edge 
measures 125 nearest-neighbor distances, (from Heinisch [1981].) 


Subcascade formation can clearly be seen in fig. 42b. The average cascade diameter was 
found to lie between 3 nm at 7= 10 keV and 12 nm at 7’=100keV, in satisfactory 
agreement with calculated values (Sigmund [1972]) for T< 50 keV. This example shows 



20nm 

i 1 


Fig. 42. Disordered zones in long-range ordered Cu 3 Au irradiated with Cu* ions of different energy: (a) 
T= 5x 10* eV; (b) T= 10 5 eV; (c) separated subcascade at T= 10 5 eV (from Jenkins and Wilkens [1976]). 
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the direct phase transformation by cascade-type damage. Another important aspect of the 
resulting defect cluster structure is their nature as point-defect sink. Both aspects are 
treated in more detail in §§4.2.2 and 4.6. 

4.2.2. Intra-cascade defect reactions 

Since about 1990, a predominant fraction of the fundamental radiation damage studies 
refer to intra-cascade defect reactions. This interest arises from a considerable amount of 
data, which could no longer be explained satisfactory by the chemical rate equation 
models with the simplifying assumptions regarding homogeneous single defect production 
and reactions. Challenging problems were the damage correlation for ion and neutron 
irradiation (see further reading: Singh etal. [1990]), extreme high swelling rates (Woo 
etal. [1993]), fluctuating cluster populations in time (Ishino [1993]), and the evolution 
of loop and void concentrations with increasing fluence (Kiritani [1993], Singh and 
Zinkle [1993], Abromeit [1994]). 

The detailed MDC studies have first led to quantitative understanding of the atomic 
mixing by the thermal spike process and the molten zone (Diaz De La Rubia and 
Guinan [1992]). Secondly, they revealed valuable information on the cascade collapse. 
The solidification process of the molten zone determines the configuration of the vacancy 
agglomerate collected in the central part of the cascade. The number of vacancies 
collected depends sensitively on the occurrence of collision chains, which transport 
interstitials out of the molten zone. The replacement chains, however, were found to be 
comparatively short. They are more important at lower temperatures and for lower 
cascade energies. For higher cascade energies, ballistic vacancy-interstitial separation 
was found to be more important (Jenkins etal. [1993]). Frequently, the ballistic ejection 
of interstitials from the cascade core was followed by interstitial clustering at the cascade 
periphery. These small interstitial clusters in general had Burgers vectors tangential to the 
cascade periphery allowing them to escape from the cascade region by thermally-assisted 
glide. This mechanism might be critical in preventing recombination and thus enhancing 
vacancy survival. It initiated the formation of new swelling models (§ 4.4). 

The fraction of single vacancies and interstitials, which may escape the intra-cascade 
reactions cannot yet be derived from MDC simulations with satisfactory accuracy 
because of poor statistical relevance. The binary collision codes with slightly better 
statistical relevance yield substantially more free interstitials (14% of total defect 
production) than free vacancies (3.5%) (Heinisch [1990]). 

It should be noted that the MDC simulations give very little evidence for vacancy 
loop production during the cascade event. However, loop formation was observed by 
MDC when areas of high vacancy concentration are subjected to intense thermal motion 
(Kapinos etal. [1990]). 

There is still a substantial gap between the few cascades studied by MDC and the wealth 
of experimental data on cascade collaps for ion and neutron irradiation. The quantities 
frequently evaluated are the defect yield Y and cascade efficiency e. The defect yield relates 
the number of visible vacancy loops per unit area to the ion flux and the efficiency relates the 
number of vacancies ascribed to the observable loops to the number of vacancies calculated 
per cascade according to eq. (10). The number of vacancies per loop is difficult to determine 



Ch. 18, §4 


Point defects 


1689 


because of problems with the image conditions for the loops. In addition, the sample 
surface as defect sink strongly influences the yield. Figure 43 shows the defect cluster 
distribution in a sample with variable thickness. The lack of vacancy clusters in the foil 
center is certainly due to interstitials annihilating clusters, while the interstitials are lost 
to the surface from the adjacent layer. For ion irradiated samples, which are generally 
studied in the thin geometry, the defect yields amount to a few percent for Cu and to 
around 10% for Ni when the incident ion energies range from 10 to 50 keV. 

For any kind of damage model the loop production' rate is one of the key quantities 
as it is essentially a production rate of sinks and of dislocations which may contribute to 
the microstructure controlling the mechanical properties of the material. At temperatures 
beyond 0.5 T m the loops act as vacancy sources and thus affect the effective defect 
production. 

The problems of reactor technology are relevant for fluences in the dpa range from 
1 to 100. Therefore, the question of the interaction of cascades with remnants of formerly 
produced cascades becomes very important. In fig. 44 we have the fluence dependence 
of the cluster density for neutron irradiated Cu at different irradiation temperatures. The 
upper solid curve reflects the behavior of fission, spallation, and fusion neutrons, i.e., for 
rather different neutron spectra. The cluster production rate vanishes at fluence levels 
between 10 2 and 10“‘ dpa. Obviously, cascades overlapping the existing clusters induce their 
annihilation. This effect was indeed observed by MDC simulations (Jenkins etal. [1993]). 
Deviations from the linear dependence of the cluster density on the fluence are explained by 
cluster annihilation due to freely migrating interstitials. The temperature dependence of 
the cluster density shows Arrhenius behavior caused by the vacancy evaporation. 

Experimental information on the fraction of single vacancies and interstitials which 
escaped reactions with their parent defect cascade comes from measurements of damage 
rates (Theis and Wollenberger [1980]), mass transport (Rehn and Birtcher [1993]) 
and swelling rates. Evaluation of the mass transport data yield fractions of less than 5% 
for Ni, for example, when equal fractions for vacancies and interstitials are assumed. For 
the explanation of the observed maximum swelling rates fractions beyond 7% are needed 
when the conventional single defect dislocation bias is presumed. This discrepancy lead 
to the detailed consideration of direct glissile interstitial loop production as part of the 
cascade event (Trinkaus etal. [1993]). If this type of loops were indeed produced to a 
substantial degree the mass transport data would have to be re-interpreted, as discussed 
by Trinkaus etal. [1994]. 



Fig. 43. Distribution of defect clusters in Ni irradiated as thin foil at 573 K with 14 MeV neutrons. The 
measured positions are projected to the paper plane (after Kirjtani [1993]). 
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Fig. 44. Fluence dependence of defect cluster accumulation in neutron-irradiated copper at different irradiation 
temperatures. The upper solid line represents data from fission, fusion, spallation neutrons, and 800 MeV 
protons (Singh and Zinkle [1993]). 


4.3. Fast heavy-ion irradiation 

The electronic energy loss of charged projectiles in the metallic crystal lattice has 
been treated in §4.2.1 as a quantity which must be reduced from the total recoil energy 
in order to obtain the available damage energy. The electronic energy loss was taken as 
a heat source and other effects have been neglected. This approximation holds up to a 
certain threshold (dE/cLt) e of the electronic stopping power. In fig. 45 the nuclear and 
electronic stopping powers are compared with each other as function of the incident 
energy of a Xe ion penetrating Cu. One realizes that the electronic stopping power 
predominates above a certain threshold. This threshold amounts to about 40 keV/nm for 
metals. By using oxygen to uranium ions with energies of 10 MeV/nucleon one covers 
the (dEXix)e range between about 1 and 10 keV/nm. Heavy ions with energies above 
1 MeV/nucleon are available from a number of accelerators, which have formerly been 
built for the purpose of nuclear physics research. The penetration depth of such ions in 
matter is of the order of 10^* m, which leads to a sample thickness well suited for 
microscopic and macroscopic investigations. 

The differential cross-section of a target electron to receive the recoil energy T varies 
as 1/7 2 with, for example, 7^=10 keV and <T>= 100 eV. These recoil energies are 
very small as compared to the projectile energy because of the unfavorable mass ratio of 
the colliding particles. The excited electrons are not able to initiate atomic displacements 
by collision with target atoms, again because of the unfavorable mass ratio. The energy 
released to the target electrons is shared between the kinetic energy of them and the 
potential energy which is essentially the potential energy of the ionized atoms. It amounts 
to about 10% of the total deposited energy for the ions located in the ion wake. The 
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Fig. 45. Electronic and nuclear stopping power for Xe in Cu. 


corresponding recoil energy per atom around the projectile trajectory amounts from 0. 1 
to a few eV. Quite different from the nuclear energy loss process, where the recoil 
energy transfer is a spatially and timely stochastic process the potential energy transfer 
to the trajectory surrounding atoms is a highly collective phenomenon in space and time. 
The response time of the free electron gas is characterized by the plasma frequency, 
which corresponds to a relaxation time of 10 -16 s. The whole process is termed a 
Coulomb explosion and leads to a shockwave emanating from the projectile trajectory. 

Coulomb explosion has been studied by means of MDC (Legrand etal. [1993]) with 
the goal of illuminating the result of such collective lattice deformations. The calculations 
have been performed for a Fe crystal cell containing 5,000 to 6,000 atoms with cell 
dimensions of about 6x6x2 nm 3 . The authors investigated the shockwave effect for 
projectile trajectories parallel to <100>, <110>, <11 1>, and <120>. The excitation 
energies per atom were chosen around 1 eV/atom and the radial impacts were given 
simultaneously to about 600 atoms forming the cylinder around the assumed projectile 
trajectory. The excitation energy was dissipated at the crystal boundaries by properly 
arranged stochastic walls. As a result, the formation of one to two Frenkel defects was 
observed per event. The results indicate that Frenkel defect production might be possible 
by electronic excitation via a Coulomb explosion. 

Experimental work proved a significant influence of the electronic excitation on the 
defect production above a certain threshold of (dE/dx) e . Figure 46 shows the thermal 
recovery of the radiation-induced resistivity increase after irradiation with the given ions 
and energies. Attention is focussed to the stage I annealing. While the 0.5 MeV proton 
irradiation produces the wellknown stage I annealing, the amount of recovery becomes 
smaller when the electronic stopping power of the projectiles is increased. The effect is 
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Fig. 46. Electrical resistivity recovery upon isochronal annealing and recovery rates for heavy-ion irradiated Cu 
and Ni (after Iwata and Iwase [1990]). 
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stronger for Ni than for Cu. The recovery reduction occurs at the low temperature side 
of stage I indicating that the fractional population of close pairs and correlatedly 
recombining freely migrating interstitials (§ 3.3.2.4) is reduced. By increasing mass and 
incident energy of the projectiles not only (dMLc) e is enhanced but also the recoil 
spectrum due to elastic nuclear collisions is changed. In order to separate potential effects 
arising from both of the mechanisms it is reasonable to plot the stage I recovery one time 
vs. the weighted mean elastic recoil energy T U2 from nuclear collisions (Averback el al. 
[1978] and the other time vs. (dE/cLc) D . Increasing T\/2 means essentially fractional 
decrease of single Frenkel defect production in favor of defect cluster production. Both 
are done in fig. 47 for Ni. The two branches (open and closed symbols) in fig. 47a tell 
us that equal values of T\/2 lead to completely different stage I populations, depending 
on the incident ion energy. Fig. 47b shows all datapoints describing just one curve. 
Obviously, the electronic stopping power is responsible for the stage I reduction. For Cu 
the datapoints describe just one curve already in the Tin -plot indicating negligible 
influence of (d£/dx) e on the stage I population. The metals Al and Pt were found to 
behave like Ni while Ag behaves like Cu. The two groups of metals deviate by their 
electron-phonon coupling (§4.2.1 and §4.2.2) with a strong coupling for Ni and Pt and 
a weak one for Cu and Ag. The authors explain these observations by Frenkel defect 
production via elastic nuclear collisions and the reduction of the recovery in stage I by 
subthreshold annihilation (§ 3.1.2) of formerly produced Frenkel defects as a result of the 
electronic excitation. The behavior of the defect production rate as function of the defect 
concentration confirms this interpretation. The rate is constant initially and begins to 
decrease with increasing defect concentration when the accumulated defect concentration 
allows for a measurable annihilation rate (Iwata and Iwase [1990]). 

For the metals Ti, Co, and Cr, Dunlop et al. [1993] observed positive electrical 
resistivity damage rates which increase linearly with (dE/dJc) e . The effect is largest for 
Ti. For Ti irradiated by GeV Pb ions electron microscopy resolved alignments of small 
defects along the incident ion beam direction. The defects appear by the same contrast 
as known for dislocation loops. The defect size appears to stay below 10 nm. By 
irradiating with 4 GeV U ions at 20 K the phase transformation from the hep equilibrium 
a-phase to the hexagonal w-phase is induced. Without irradiation, this phase transform- 
ation is initiated under hydrostatic pressure of the order of 10 kbar. For Fe the authors 
find the Ni type behavior for irradiation with ions up to (dE/ck) e “ 40 keV/nm. Beyond 
this value Fe behaves like Ti, Co, and Cr. Dunlop et al. [1993] point out that each of 
the metals Ti, Fe, Co, and Cr does form various allotropic phases and exhibits soft 
modes in the phonon spectra, which are associated with the displacive phase transform- 
ation. This correlation might suggest that the damage detected by means of the electrical 
resistivity increase consists of second phase precipitates also for Fe, Co, and Cr. 

Up to now there is no experimental evidence for direct Frenkel defect production by 
electronic excitation in pure metal crystals. But, the deposited energy is obviously 
sufficient to induce close Frenkel pair recombination (subthreshold collisions) with strong 
electron-phonon coupling. The specific collective and radial impact generation emanating 
from the swift ion path may well trigger displacive phase transformations. This has been 
demonstrated, in particular, for the crystal-amorphous transition in intermetallic com- 
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Fig. 47. Fraction of stage I recovery for heavy-ion-irradiated Ni (a) vs. weighted average recoil energy of the 
irradiating ions indicated, (b) vs. electronic stopping power of the irradiating ions indicated (after Iwata and 
Iwase [1990]). 

pounds. In compounds like NiZr 2 and NiTi which can easily be amorphized by other 
kinds of irradiation (elastic collisions) or plastic deformation, ion tracks of amorphous 
material of about 8 nm diameter are produced for (dE/dc) e >50 keV/nm (800 MeV Pb 
or U ions). For Cu 3 Au no phase transformation by electronic excitation has been 
observed. 

In amorphous materials, fast-ion irradiation causes strong plastic flow perpendicular 
to the ion beam direction (KlaumOnzer [1992]). This phenomenon is again caused by 
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the collective cylindrical impact generation which, in amorphous materials, can develop 
significant internal stress levels as the atoms must not necessarily return to their initial 
positions before the Coulomb explosion. In crystals they do return because of the 
topological stability. The amorphous material responds to the internal stress by intense 
plastic flow. This flow requires the presence of shear sites of sufficiently large concentra- 
tions. These sites are likely to be identical with the free volume (ch. 7, §9.1.2). 

4.4. Swelling 

One of the serious radiation damage problems in nuclear engineering is the dimen- 
sional instability of metallic structural materials. It shows up as swelling, i.e., decrease 
of the materials density with increasing fluence, and as volume-conserving creep, which 
is observed at significantly lower temperatures than is the thermally activated creep 
(§ 4.5). Swelling and the formation of voids in the bulk of the material was detected in 
structural core parts of the Dounreay Fast Reactor (Cawthorne and Fulton [1967]). 
Electron micrographs of voids in Ni-ion-irradiated Ni and neutron irradiated type 316 
stainless steel are shown in fig. 48. The observed macroscopic volume increase AV/V, 
i.e., the swelling equals the total void volume. The morphology of voids as three- 
dimensional vacancy arrangements is multiform (further reading: Corbett and Ianiello 
[1972], Bleiberg and Bennett [1977] and Carpenter etal. [1980], Gavillet et al. [1993]). 
Voids have also been studied by small-angle X-ray scattering, which can fruitfully be 
combined with other techniques (ch. 12). The phenomenon is widespread in view of the 
materials as well as the irradiating particle species. Microstructure and chemistry 



Fig. 48. Voids in (a) Ni irradiated by 200 keV Ni' ions at 500°C (after Adda [1972]) and (b) type 316 
stainless steel irradiated by 7.1 x 10 22 neutrons/cm 2 at 525°C (after Appleby et al. [1972]). 
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(composition) of the materials and irradiation temperature are of considerable influence 
on the swelling rate. The general feature of the swelling behavior of relevant structural 
materials is sketched in fig. 49. We note a number of important characteristics: (i) 
significant swelling starts after accumulation of an incubation fluence, (ii) saturation of 
swelling appears not to exist within tolerable swelling levels (except a few void lattice 
forming materials), (iii) ferritic steels show considerably less swelling than austenitic 
ones, and, (iv) cold-working enhances the incubation fluence. For comparison, the 
behavior of Mo is shown: this develops a void lattice (§ 4.6). 

In the present section we focus our attention on the basic processes controlling void 
nucleation and growth. This view will enable us to understand the above characteristics 
in terms of general defect reaction schemes. Although voids have been known for a long 
time from the Kirkendall effect (§ 2.2.1) (see also ch. 7, 5.3), they were not expected in 
irradiated materials as all earlier investigations showed only planar aggregates of 
vacancies (dislocation loops, stacking fault tetrahedra). Moreover, such aggregates were 
found to dissolve at temperatures at which voids are still being formed. Research of the 
nucleation problem was always impeded by the fact that impurities, in particular, gaseous 
ones will certainly stabilize three-dimensional vacancy agglomerates. In addition, Fast 
Reactor irradiations not only produce Frenkel defects but also relatively large amounts 
of He resulting from the large cross-sections for ( n,a ) transmutations of frequently 
present metal isotopes. On the other hand, it proved to be difficult to prepare sufficient 
gas-free or generally impurity-free samples to study homogeneous void nucleation under 
charged particle irradiation. MDC studies for Cu and H as impurity (Shimomura et al. 
[1993]) revealed that the tri-vacancy relaxation towards the stacking fault tetrahedric 
structure may be suppressed by involving H atoms. However, the trapping energy for H 
is small for vacancy agglomerates up to 5 vacancies. From 6 vacancies onwards, trapping 



Fluence / dpa 


Fig. 49. Swelling vs. fluence for different materials and conditions (schematic) after Eyre and Matthews 
[1993], 
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of H stabilizes the void structure. An H atom which is trapped in the hexa-vacancy 
pushes out four atoms which are positioned in the tetradedral 10-vacancy of 6-vacancy 
stacking fault tetrahedra. This stabilization is equivalent to the suppression of the 
relaxation of a 3-vacancy to a stacking fault tetrahedra. The void structure becomes more 
and more stable between 10 and 20 vacancies, but needs an increasing number of H 
atoms for the stabilization. The study also shows that vacancy agglomerates with less 
than 15 vacancies become thermally unstable at the peak swelling temperature (see 
further below). 

Gaseous impurities are now generally accepted as necessary prerequisites for the 
stabilization of the void structure. The fission product He has received most attention 
with respect to its role in void nucleation (Glasgow etal. [1981], Stoller and Odette 
[1985], Parker and Russell [1982]). Void formation by electron irradiation demon- 
strates the nucleation for pure single vacancy production. Here, void densities are often 
enhanced at and near grain boundaries and dislocations (Horsewell and Singh [1988]). 
This inhomogeneity corresponds to the well-known impurity segregation. 

For cascade-producing irradiation, intra-cascade clustering and the considerable 
fluctuation of the defect concentration of freely migrating defects (which escape from 
cascades) on void nucleation, has been studied in some detail (Kitajima etal. [1985], 
Trinkaus [1990]). It turns out that these effects should have only minor influence on the 
effective void nucleation rates, mainly because of the small probability for the direct 
agglomeration of large enough numbers of vacancies. 

Gas-containing three-dimensional vacancy agglomerates (cavities) are termed bubbles, 
the size of which is controlled by the gas pressure, i.e., the ratio of the numbers of gas 
atoms to vacancies and the metal surface energy (Trinkaus [1983], Mansur [1993]). 
The transition to apparently unlimited void growth as seen in fig. 49 is correlated with 
the critical bubble radius, which is given by the deviation of the effective gas conditions 
from those of the ideal gas, the surface energy, and the excess vacancy flux to the cavity. 
Hence, extension of the incubation fluence requires materials design such that the gas 
and/or vacancy fluxes towards bubble embryos are reduced. 

The excess vacancy flux feeding the void growth was generally assumed to arise 
from biased vacancy and interstitial annihilation at sinks (reviews: Brailsford and 
Bullough [1978], Bullough and Wood [1980], Mansur [1993] and further reading: 
Wiffen etal. [1979], Nygren etal. [1981]). The common explanation of such biased 
vacancy and interstitial annihilation at sinks considers the elastic interaction between the 
migrating point defects and dislocations as causing a drift diffusion towards the disloca- 
tions. The interaction energy for a dilatation centre A with the volume change AV a "' in 
the strain field of an edge dislocation is proportional to AV™/r, where r is the distance 
of A from the dislocation core. The proportionality to AV^ makes the interaction 
strength quite different for vacancy and interstitial (Cu, for example: AV” =-0.2, 
AV" = 1.6). In a given material, the topologies and concentrations of saturable sinks 
(e.g., sessile dislocations), unsaturable sinks and saturable traps composes a complex 
scheme of spatially varying defect reactions. 

The swelling theories were based upon the chemical rate equation approach which 
assumes spatially homogeneous defect reactions. Spatially inhomogeneous reactions like 
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point-defect annihilation at an extended localized inexhaustable sink (e.g., sessile edge 
dislocation) were incorporated into the rate equations by appropriate divergence terms for 
the defect concentrations. For a long time, freely migrating single interstitials and 
vacancies were assumed to be the only reactants for traps and sinks and in addition the 
displacement rates according to eq. (10) were taken as production rates for these freely 
migrating defects. Intra-cascade recombination and clustering (§4.2.2) were widely 
neglected. The wealth of experimental data on the microstructural evolution under all 
kinds of irradiation (further reading: Klueh etal. [1992], Ishino etal. [1991]) as well as 
the progress in modelling suggest that satisfying agreement between the early theories 
and experiments whenever achieved may be attributed to the large number of less well- 
known parameters which were involved in the theories and adjustable within relatively 
wide ranges. However, taking the various defect reactions into account allowed a 
satisfying qualitative explanation of the swelling response to the various materials 
variables and irradiation conditions. For a brief illustration, we look at the influence of 
cold-working and alloy composition on the irradiation temperature dependence of the 
swelling (fig. 50). The cold work controls the vacancy supersaturation via dislocation 
sink strength and bias. The temperature controls the defect mobility and, by this, the 
stationary defect concentration which in turn determines the relative influence of the 
recombination. At low temperatures the recombination rate is large because of the slowly 
migrating vacancies. The biased sink annihilation is suppressed. At higher temperatures 
the latter is favored at the expense of recombination. At the same time, vacancy loops 
start decaying which enhances the vacancy supersaturation. At even higher temperatures, 
the stationary vacancy concentration approaches thermal equilibrium values and, hence, 
does not provide the supersaturation required for void formation. Slowing down of one 
of the migrating defect species can be achieved by adding solutes which temporarily trap 
interstitials (§3.3.2.8). In this way swelling is reduced (see, for example. Potter et al. 
[1977]) as shown in fig. 50b. The peak swelling temperature depends on the displace- 
ment rate because of the increasing dominance of recombination with increasing 
displacement rate (steady-state defect concentration). 

Quantitative interpretation of the vacancy supersaturation by the annihilation bias 
alone became more and more difficult when new experimental work revealed swelling 
rates up to 5%/dpa in neutron-irradiated Ni (Kiritani etal. [1990]) and in Cu contain- 
ing 0 (Yamakawa etal. [1992]). Moreover, detailed investigation of local variations of 
the swelling rates showed maxima up to 7%/dpa for Al in the neighborhood of grain 
boundaries and in subgrains surrounded by dislocation walls (Horsewell and Singh 
[1988]). The width of these maximum swelling regions is of the order of 1CT 6 m. A 
swelling rate of 5%/dpa requires an excess vacancy flux to the voids by the same 
fraction of the total flux of the freely migrating single defects. Hence, for the less 
probable case that all interstitials annihilate at sinks different from the voids, the 
fractional freely migrating defect escaping from the cascades must be 5% or significantly 
larger for the probable case of partial interstitial annihilation at voids. Such values do not 
fit theoretical and experimental findings (§4.2.2). In addition, the width of KT 6 m is 
considerably larger than the average diffusion distance of single vacancies and inter- 
stitials for the sink strength observed for respective irradiation conditions. These 
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Fig. 50. Influence of pretrealment and solute additions on void-swelling: (a) 316 type stainless steel after 
14.1 x 10 22 neutrons/cm 2 , o - annealed, a - 10% cold worked (cw), □ - 20% cw, v - 30% cw (from Flinn and 
Kenfield [1976]); (b) Fe-15 at% Cr-20 at% Ni with quoted Si additions after a fluence corresponding to 140 
displacements per atom (from Johnston etal. [1976]). 
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difficulties can be overcome by taking into account the production of glissile interstitial 
loops during the cascade event (Trinkaus etal. [1992]). Such loops are known to move 
by climb and glide. Small interstitial loops are believed to be faulted. However, MDC 
studies for Cu have shown that small interstitial loops do transform spontaneously from 
the faulted to the highly glissile unfaulted configuration (Foreman etal. [1991]). Such 
loops would perform a one-dimensional glide with an estimated activation enthalpy of 
about 0.1 eV. For small loops of four interstitials, the direction of the Burgers vector 
may change to equivalent lattice orientations with an activation enthalpy of about 0.5 eV. 
Such loops would migrate by describing a zig-zag path until they are trapped or 
annihilated by proper components of the microstructure. The significant result of the one- 
dimensional glide motion is the much larger free travel distance than the diffusion 
distance of the three-dimensionally migrating defect for the same microstructure. Under 
these circumstances, spontaneously produced small interstitial loops may be trapped or 
annihilated having a very small chance to get involved in recombination or to encounter 
voids. Numerical calculations showed that the free travel distance may indeed be of the 
order of KT 6 m and thus explain the required vacancy supersaturation in the above 
mentioned width of maximum swelling regions near grain boundaries. By this explana- 
tion, the annihilation bias is supplemented by a production bias in the sense that the 
numbers of freely migrating vacancies and interstitials escaping the cascade are biased 
by the gliding interstitial loops. More specific experimental work is necessary to confirm 
this defect reaction path. 

The remarkable swelling resistance of the ferritic steels (fig. 49), and of other bcc 
metals and alloys, was earlier ascribed to the small elastic interstitial dislocation 
interaction due to the small relaxation volume of the interstitial in these structures (table 
4). The small elastic interaction would result in a small drift diffusion effect towards the 
dislocations and, hence, a small annihilation bias (Sniegowski and Wolfer [1984]). 
MDC studies of the interstitial dislocation interaction, however, showed that interstitials 
are likely to be trapped near dislocations (Kuramoto [1992]) and thus kept available for 
recombination. More detailed experimental information leads to the conclusion that 
impurity trapping is the leading effect among a number of other defect reactions which 
are of greater influence on the vacancy supersaturation than corresponding reactions are 
in fee metals and alloys (Little [1993]). The interstitials C and N and the substitutional 
Si are particularly seen as being effective. The C-vacancy binding enthalpy seems to be 
as high as 0.85 eV and would therefore act as an effective recombination catalyzer even 
at high temperatures. Also, Cottrell atmospheres around dislocations may screen the sinks 
and reduce climbing rates. 

4.5. Radiation-induced creep 

The thermally activated creep of materials is treated in chs. 20-25. An important feature 
of this phenomenon is the exponential temperature dependence of the creep rate, e, for 
a fixed stress, a. If the creep experiment is done under irradiation, e becomes nearly 
independent of temperature below about T M / 2. This behavior originates from the high 
radiation-induced vacancy- and interstitial-concentration which allows sufficient disloca- 
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tion climb and deformation, despite the small diffusivities of these defects (reviews: 
Lucas [1993], Mansur [1992] and further reading: Carpenter et al. [1980], Gittus 
[1978], Woo and McElroy [1988]). For not too large applied stresses <x, the creep rate e 
is found to be proportional to cr and to the defect production rate. One of the successful 
models uses the stress-induced preferential absorption (SIPA) of defects at dislocations 
capable of climbing (Heald and Speight [1974]), as illustrated in fig. 51. If again the 
elastic-energy-controlled drift diffusion is assumed to determine the defect current 
towards dislocation sinks, the orientation of the dislocations with respect to the external 
stress will control the rate constants for defect annihilation. According to the model 
calculation, dislocations of type I (fig. 51) absorb more interstitials than vacancies, and 
vice versa for type II dislocations. The resulting climb along the directions indicated 
cause the volume-conserving elongation of the sample parallel to a-. Within the void- 
swelling regime, the excess interstitial flux to the dislocations determines the creep rate, 
which then obtains a contribution proportional to the swelling rate. 

The SEPA mechanism is certainly of fundamental importance for the radiation- 
induced creep but the really observed behavior is more complex due to the complex 
microstructure of the structural materials (dislocation interactions, grain-boundary 
diffusion, chemical heterogeneities). At high stress levels the creep rate becomes 
proportional to o 2 which has been explained by a climb-enabled dislocation glide 
mechanism (Mansur [1979]). 

4.6. Self-organization of point defect agglomerates 

Many metals and alloys develop three-dimensionally ordered arrangements of voids 
(void lattices) when irradiated with heavy ions or neutrons at temperatures around 0.45 
T m . Figure 52 shows the void lattice in Nb, for example. The phenomenon is observed 
for fee, bcc and hep structures. In the cubic metals, the void lattices are isomorphous to 
the atomic lattice. For the hep metals, only planar ordering parallel to the basal planes is 
observed. The void radii amount to about 5 nm while the void lattice parameters are 
about ten times larger. This ratio varies between 5 and 15 for the various metals with the 
exception of Al, for which it amounts to about 40. The ratio is nearly independent of the 
irradiation temperature. The minimum fluence required for the lattice formation ranges 



Fig. 51. Dislocation climb directions by preferential vacancy and interstitial absorption under applied stress <i. 
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(b) 

Fig. 52. Void lattice in Nb irradiated at 1073 K with 7.5 MeV Ta ions to fluences of (a) 140 dpa and (b) 300 dpa. 



from 5 dpa to 400 dpa, depending on the metal species. This fluence also depends on the 
irradiation temperature and on the gas and impurity level of the material. In pure metals 
void lattices are not observed for electron irradiation. Electron irradiation of disordered 
void arrangements as produced by ion irradiation did not induce the formation of the 
lattice. As the total swelling caused by the void lattice generally amounts to a few 
percent, the total number of displacements is by two to four orders of magnitude larger 
than that of the vacancies accumulated in the void lattice. Correspondingly, the energy 
temporarily stored in the total number of defects produced (given by the total fluence 
level) is large compared to the energy stored in the void lattice. The ordered structure is 
the result of the highly energy dissipative process. 
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Sufficiently long irradiation with ions or neutrons at temperatures around 0.3 T m leads 
to periodic arrangements of dislocation walls (see fig. 53, for example). The existence 
ranges for disordered and ordered vacancy agglomerates are well distinguished in the 
defect production rate vs. temperature diagram as shown in fig. 54. At still lower 
temperatures, simple cubic lattices of stacking fault tetrahedra are observed in electron 
irradiated fee metals and alloys (Zaiser etal. [1992]). 

Earlier theoretical treatments of void lattice formation were based upon equilibrium 
thermodynamics (minimization of the total energy) and the elastic interaction of the 
microstructural components (Willis [1975]). These attempts failed as they can neither 
explain the observed dependence on temperature and irradiating particle species nor the 
lattice formation in isotropic metals like W. The now generally accepted explanation 
considers the void lattice as the result of a self-organizing open system far from 
thermodynamic equilibrium (Martin [1975], Krishan [1982]). The various theoretical 
approaches reported since follow this concept but deviate by the various defect reactions 
taken into account and considered to be dominant (Abromeit and Trinkaus [1992], 
Hahner and Frank [1992], review: Abromeit [1989]). The simplest defect reaction 
system describing a heavy ion or neutron irradiated crystal already fulfils the conditions 
for self-organization (Haken [1977]). The irradiation produces vacancy loops and freely 
migrating vacancies and interstitials by constant rates. The loops grow by vacancy 



Fig. 53. Periodic arrays of planar (001 } walls of dislocations and stacking fault tetrahedra in copper irradiated 
with 3.4 MeV protons around 100° C to 2 dpa fluence. Imaged in <100> projection (after Jager etal. [1990]). 
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annihilation and shrink by interstitial annihilation. The annihilation rates are biased as 
described in § 4.4. For simplicity we neglect recombination, which is allowed for small 
defect production rates and/or high temperatures. The important characteristics for the 
self-organization are (i) the diffusion length of the reacting vacancy and interstitial 
commensurates with the characteristic length of the evolving ordered structure, (ii) the 
reaction rates controlling the loop arrangement are non-linear as they are given by the 
concentrations of the loops times the concentrations of the migrating defects. Math- 
ematically) the system is described by a proper chemical rate equation system which is 
analysed by means of the linear stability analysis with respect to the stability of spatially 
homogeneous defect distributions. Our diffusion-controlled reaction system does not 
contain any information about lattice orientation. Hence, it will not give predictions on 
the relationship between the ordered structure and the lattice orientation. Nevertheless, we 
stay with this model in order to understand the physical origin of the self-organization. 
The stability analysis considers the behavior of concentration fluctuations superposed to 
the spatial averages. The fluctuations are described by a Fourier series. One component 
is shown in fig. 55. The index s stands for sink and c s is proportional to the vacancy loop 
concentration. Because of the annihilation bias the fluctuation amplitudes of the 
interstitial concentration c ■, and of the vacancy concentration c v are different in the way 
as shown in the figure. This difference causes amplification of the fluctuation of the loop 
concentration Sc/<c s > as the vacancy concentration is larger than the interstitial 
concentration at the maximum of the loop concentration and higher vacancy arrival rates 
than interstitial arrival rates induce loop growth. The growing fluctuations induce 
vacancy and interstitial fluxes from the regions of positive fluctuation amplitudes to those 
of negative ones (fig. 55). These fluxes are initially larger for the interstitials than for the 
vacancies due to the difference in the fluctuation amplitude. However, they reduce this 
difference until total equilibration. The linear stability analysis shows that this amplifica- 
tion of 8c/<c s > vanishes when the fluctuation wavelength A goes to zero and to infinity. 
The fluctuation of the loop concentration must therefore have a maximum at a certain 



Fig. 54. Map of periodic cluster arrays for Cu irradiated with neutrons, heavy ions, and protons, v - random, 
□ - periodic cluster array, o - periodic voids and dislocations (after JaGER and Trinkaus [1993]). 
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Fig. 55. Sketch of the spatial fluctuation of the concentrations of sinks (s), vacancies (v) and interstitials (i). 


finite value of A . 

It becomes obvious that the driving force for the self-organization is the annihilation 
bias for the two diffusing reactants and the fact that one reactant increases the sink 
strength of the loops (by growing) and the other (interstitial) decreases it. When the bias 
had the opposite sign the system would not be self-organizing. Adding recombination or 
secondary sinks (grain boundaries) or traps to the reaction system would reduce the 
driving force for self-organization to the measure at which the loops contribute to the 
total sink and trapping strength. Introducing a production bias (§ 4.4) shifts the annihila- 
tion bias threshold accordingly: the difference of the sink strength for interstitials and 
vacancies must be larger than the respective difference of the production rates. At low 
temperatures where the recombination dominates the sink annihilation, loop ordering will 
be impossible in accordance with fig. 54. At high temperatures the vacancy loops decay 
but void formation becomes possible if nucleation is favored and self-organization may 
be possible when they dominate as sinks and the respective annihilation rates are biased. 

The linear stability analysis yields the initial conditions for which homogeneous 
cluster distributions may become unstable. It does not give information on the shape of 
the evolving structure. Such information is obtained by analysis of the stationary 
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solutions in the instability regime. As a result one obtains the concentration fluctuations 
for vacancy loops as shown in fig. 56. The regions of high-loop concentrations are much 
smaller than those of low concentrations and with proceeding fluence the separation 
between the empty and the highly populated regions becomes even sharper and the 
wavelength slightly increases. 

The stability analysis for defect reaction systems controlled by isotropic diffusion 
cannot give information about the symmetry relationship between the ordered defect 
arrangement and the crystal structure. Anisotropic diffusion controlled by the crystal 
lattice symmetry must influence the process of pattern formation. Two different attempts 
have been made to introduce anisotropic diffusion: (i) the diffusion of the vacancies and 
interstitials is assumed to be asymmetric in the sense that the vacancy migrate three- 
dimensionally while the interstitials migrate one-dimensionally (dynamic crowdion, 
gliding interstitial loops) or two-dimensionally (Evans [1987], Hahner and Frank 
[1992], Dubinko [1991], Jager and Trinkaus [1993]), (ii) the elastic interaction 
between (predominantly) the interstitials and the ordering vacancy cluster introduces the 
crystal lattice symmetry, leads to respective anisotropic drift diffusion and thus controls 
the shrinkage of the clusters (Borodin et al. [1993]). 

The effect of one-dimensionally migrating interstitials can easily be seen in fig. 57 
which shows how a mispositioned void is shifted. Arriving interstitials whose path is not 
blocked by neighboring voids induce shrinkage at the one side and three-dimensionally 
migrating excess vacancies arriving within the screened regions induce growth on the 
other side. In a three-dimensional crystal lattice the evolution of the void lattice would 
therefore start with linear ordering along the migration paths of the interstitials or with 
planar ordering within one of the planes containing the maximum number of the 
migration lines. The fundamental question with this explanation of the pattern formation 
concerns the structure of the one-dimensionally migrating interstitial. Stability of the 
dynamic crowdion has not been confirmed by the MDC studies for fee and bcc metals 



Fig. 56. Spatial variation of the loop concentration for two different maximum minimum ratios (after Abromeit 
and Trinkaus [1992]). 



Ch. 18, 


4 


Point defects 


1707 



Fig. 57. Rearrangement of voids by one-dimensionally migrating interstitials (after Foreman [1972]). 


up to now (see § 3.3). In addition, the difference of the formation enthalpies for the 
dumbbell interstitial and the crowdion were found to only measure a few tens of an eV. 
Such values are at variance to the values required for the explanation of void lattices 
observed at the upper temperature limit and the observed maximum void lattice parame- 
ters. The mean free path of the crowdion between its creation and conversion to a stable 
dumbbell interstitial must be at least of the order of the void lattice constant. The 
opposite tendency of the mean free path to decrease and the void lattice distance to 
increase with increasing temperature defines an upper temperature limit for the void 
lattice formation. The observed maximum void lattice distances and the upper tempera- 
ture limit requires a difference between the enthalpies for interstitial conversion and 
migration of the crowdion of values between 0.5 and 1 eV in contrast to the findings 
from MDC studies. Of special interest in this regard is the behavior of Au for which the 
absence of a pronounced annealing stage I has been interpreted by the conversion of the 
crowdion to the stable dumbbell configuration at very low temperatures (Frank and 
Seeger [1983]). Therefore, the observed bubble lattice (Johnson etal. [1983] cannot be 
explained by a one-dimensionally migrating interstitial. Similar arguments hold for Nb. 
The gliding interstitial loops discussed in § 4.4 might explain the symmetry relationship 
in a consistent manner. For hep metals two-dimensionally migrating interstitials are more 
likely to be stable and would explain the observed void lattice geometries. 

The various attempts to explain the symmetry relationship by anisotropic drift 
diffusion due to the elastic interaction have been successful in rationalizing the symmetry 
relationship (Martin [1977]) but quantitative explanation seems to be successful for 
sinks with significant elastic asymmetry (as dislocation loops) only. 
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4.7. Radiation-induced solute segregation 

Irradiation of a crystal causes the supersaturation of vacancies and interstitials in the 
lattice. At irradiation temperatures such that both defect species are mobile the defects 
will annihilate by recombination and annihilation at sinks. For very low sink densities the 
annihilation rate will be controlled by the recombination reaction. As the defects migrate 
randomly recombinations will occur homogeneously in space. For descriptions in terms 
of the rate equation approach the defect annihilation is considered by the “lossy 
medium”. This treatment neglects an important sink effect. Unsaturable sinks like 
surfaces or non-sessile dislocations cause persistent defect fluxes under stationary 
circumstances. The situation in front of such a sink is sketched in fig. 58 for a solid 
solution consisting of A and B atoms. If the partial diffusion coefficients D A V and Dg of 
the constituents A and B of a binary alloy due to vacancy migration are different, the 
vacancy flux J y to the sink will cause different atom fluxes J K and J° with the result of 






Fig. 58. Inverse Kirkendall effect produced by vacancy (a, b) and interstitial (c, d) fluxes to sinks. 
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a depletion of the faster transported constituent near the sink as shown in fig. 58b. The 
process is the inverse Kirkendall effect (Smigelskas and Kirkendall [1947]). If the 
interstitial flux J { provides atom fluxes as shown in fig. 58c constituent A will be 
enriched as shown in fig. 58d. For steady state conditions and r vs = r is the concentra- 
tion gradient Vc A was derived by Wiedersich et al. [1979] to be proportional to 
[(D;/D;)-(D;/£»i)]Vc v . 

Atomic redistribution of this kind also occurs if one of the defect species forms a 
mobile tightly bound complex with one of the constituent atoms and annihilates at the 
sink. In this case, the complex forming constituent will be enriched at the sink. One 
realizes that such atomic redistributions may lead to the formation of new phases. The 
most simple case is the irradiation of an undersaturated solid solution of B in A. When 
the redistribution amplitude is large enough, the solubility limit will be exceeded near or 
at the sink and an intermediate phase will be formed whenever nucleation conditions 
allow it. At sufficiently high irradiation temperatures the process may be reversed by the 
action of thermal vacancies after stopping the irradiation. Theoretical description of the 
radiation-induced segregation has been reviewed by Wiedersich and Lam [1983]. 

For studies of precipitation near internal sinks high voltage electron microscopy has 
proved to be a very valuable tool. Electron energies from about 0.5 MeV upwards are 
sufficient to produce Frenkel defects during the imaging. The beam intensity is extremely 
high such that the number of atomic displacements produced by irradiation in nuclear 
reactors can be exceeded by orders of magnitude within minutes. Radiation-induced 
formation of ordered phases like Ni 3 Si can thus be imaged easily by means of the dark 
field technique. For Cu-Be the formation of radiation-induced Guinier-Preston zones 
(Wollenberger and Kell [1989]) has been studied by means of field ion microscopy 
and that of the long range ordered y-phase CuBe by transmission electron microscopy 
(Koch etal. [1981]). The experimental material on radiation-induced segregation was 
reviewed by Rehn and Okamoto [1983]. 

Radiation-driven atomic redistribution is not necessarily bound to sink annihilation of 
the defects. If the rate of Frenkel pair recombination depends on the alloy composition, 
which might be the case for a strong influence of the chemical potential, the 
recombination rate would become inhomogeneous in space. This again would create 
persistent defect fluxes which could amplify existing concentration fluctuations (Martin 
[1980]). The field of phase transformations in irradiated materials has been reviewed by 
Russell [1993], 
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1. Introduction 

1.1. General features 

Much of the development of practical metallic materials has involved dealing with 
metastable states. Just what is meant by “metastable” we will consider more carefully in 
§ 1.3; for the moment we use it to describe any state with a Gibbs free energy higher 
than the lowest value corresponding to stable equilibrium. Conventionally the access to 
metastable states is achieved by solid-state quenching. Examples are the quenching of 
steels from the austenite phase field to form martensite (ch. 17, § 3.2) and the quenching 
of solution-treated Al-Cu alloys to obtain a supersaturated solid solution suitable for age- 
hardening (§5.1, below). These examples illustrate the point of obtaining metastable 
states: the relaxation of the constraint of being in equilibrium gives access to a wider 
range of structures, some of which can be expected to show superior properties. They 
also illustrate that the metastable state may be of interest as the final product or as an 
intermediate on the way to the final product. At least since the 1920s there has been 
interest in developing new materials by increasing the departure from equilibrium, mainly 
by rapid quenching of the liquid. This grew into a large research effort following the 
work of Duwez and colleagues in 1959-60 (reviewed by Duwez (1967]), in which gun 
splat-quenching was used to demonstrate complete, metastable, solid solubility in Ag-Cu 
and metallic glass formation in Au-Si and Au-Ge. Rapid liquid quenching has since 
been used in a very wide range of fundamental studies as well as being adopted as a 
significant commercial production process, particularly for magnetic materials. In the 
early 1980s interest turned also to solid-state processing as a way to obtain alloys in 
states far from equilibrium. This built on a great deal of earlier work, including the 
development in the 1960s of mechanical alloying to produce high-temperature alloys 
(Benjamin [1970]). 

The microstructural manifestations of the departures from equilibrium achieved by 
novel processing routes can be classified under five headings: 

(i) Increased defect concentrations. These include increased concentrations of 
vacancies (and to a lesser extent interstitials), dislocations, stacking faults, twin bound- 
aries and grain boundaries. Another characteristic to be put in this category is an 
increased level of chemical disorder (that is, reduced order parameter) in ordered solid 
solutions and compounds. 

(ii) Microstructural refinement. This involves finer scale distributions of different 
phases and of solute. Examples include finer dendrite arm spacings, eutectic spacings and 
precipitate diameters. 

(iii) Extended solid solubility. A stable crystalline phase may be found with solute 
levels beyond the solubility limit at the ambient temperature, or indeed beyond the 
equilibrium limit at any temperature (see account of solute trapping in ch. 8, §§5.2, 8.1.7). 

(iv) Metastable phases. A metastable phase may form during processing. It may be one 
that is not found in equilibrium under any conditions in the system of interest, or it may be 
so found but appear metastably under different conditions of composition, temperature or 
pressure. This heading includes crystalline and quasicrystalline intermetallic compounds. 
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(v) Metallic glasses. These are a special case of a metastable phase. When the 
processing is by cooling a liquid they form not by a phase transformation, but by a 
continuous congealing of the liquid. 

Some of the kinetic aspects of rapid solidification have been dealt with in ch. 8. The 
present chapter is concerned more generally with the wide variety of methods for 
producing alloys in states which are far from equilibrium, that is, with novel methods 
which lead to larger departures from equilibrium than can be achieved by conventional 
methods. It is also concerned with the metastable products themselves, and in particular 
with the metallic glassy state, which is the most distinctive product of the novel methods. 
Few surveys have sought to cover metastable states from such a broad perspective; 
another recent example is a book chapter (Martin, Cantor and Doherty [1996]). 

1 2. Methods for achieving metastability 

Here, an outline is given of some of the very diverse methods which have been 
applied, and links between them are discussed. Details of the methods are presented in 
§ 3. Most of the effort on developing new production methods has focused on cooling 
rate. This depends on the dimensions of the sample, the heat conduction in the material, 
and the rate of heat transfer to the surrounding quenching medium. If the sample to be 
quenched is initially solid, its dimensions are fixed and often not ideally suited to rapid 
heat extraction. In addition, efficient heat transfer may be difficult to achieve. On the 
other hand if the sample is initially liquid, it can be flattened into a thin sheet in intimate 
contact with a good solid heat sink. This is the basis of the Duwez method, subsequently 
developed into diverse continuous production techniques described in §3.1. In addition 
to solid heat sinks, the heat from the liquid alloy can be extracted into another liquid or 
a gas. The most common rapid liquid quenching methods give products in droplet, wire, 
ribbon or sheet form, with at least one dimension as small as 10 to 100 jam. The cooling 
rate is commonly 10 5 - 10 6 K/s. Rapid liquid quenching is also readily achieved when the 
surface of a bulk sample is melted; the bulk itself then acts as the heat sink (§ 3.5). The 
melting can be by a scanned continuous heating beam or by a stationary pulsed beam. 
For laser beams, pulse durations can be ultra short — in the nanosecond or even 
picosecond range — giving very thin melted layers and quenching rates as high as 10 14 
K / s; this is the highest liquid quenching rate which can be achieved (Lin and Spaepen 
[1984]). 

The objective of rapid heat extraction is to obtain rapid movement of the solid-liquid 
interface. The latent heat of freezing released at the interface is extracted through the 
growing solid into the quenching medium. However, the latent heat can alternatively be 
extracted into the liquid ahead of the interface if that liquid is sufficiently undercooled. 
The methods for obtaining highly undercooled liquids are described in § 3.2; they have 
attracted much interest because of their suitability for fundamental studies of the 
mechanisms of metastable phase formation. 

A number of other techniques are associated with the concept of rapid quenching, 
though less obviously than with the liquid case. When a solid is grown from atoms in the 
vapor phase (generated by evaporation or sputtering), the effective quench rate in the 
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process of condensation is as high as 10 12 K/s (§3.3). Other methods in which the 
mobility of the atoms joining the solid is rapidly reduced are those based on chemical or 
electrochemical precipitation or deposition from solution (§ 3.6). Irradiation of a solid 
leads to locally disturbed configurations which regain thermal equilibrium very quickly; 
again the effective local quench rate is of the order of 10 12 K/s. Amorphization by 
irradiation is considered in § 3.4. 

Chemical reactions can proceed towards equilibrium in stages, and the intermediate 
stages can yield a metastable phase. Just this is found in the solid-state amorphization 
reaction (§ 3.7) in which an amorphous alloy can be produced by the reaction of two 
solid metallic elements. 

Severe mechanical deformation can lead to metastable states (§ 3.8). The deformation 
forces the production of disturbed configurations, and in this way can be considered akin 
to techniques such as irradiation. On the other hand it can be used to bring different 
phases into intimate contact, promoting solid-state reactions which can yield metastable 
phases. Finally, high pressure can induce metastability. 

1.3. The nature of metastability 

A classification of thermodynamic states is illustrated in fig. 1 . In stable equilibrium 
the free energy (the Gibbs free energy for a system under constant pressure) is at the 
lowest possible minimum. The minimum shown schematically in the figure may 
represent the stable state of a single phase or of a phase mixture. A metastable state is 
one in internal equilibrium, that is, within the range of configurations to which there is 
access by continuous change, the system has the lowest possible free energy. However, 
if there were a large fluctuation — the nucleation of a more stable phase — transform- 
ation to the new phase would occur. A good example of a thermodynamically metastable 
state is an undercooled liquid; this has well defined equilibrium properties at each 



Fig. 1. A schematic illustration of possible thermodynamic states of alloys: (a) unstable equilibrium, (b) 
metastable equilibrium, (c) stable equilibrium, and (d) unstable non-equilibrium state. The vertical axis shows 
Gibbs free energy, the horizontal, a parameter representing changes in structure. 
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temperature, but contact with a nucleant will result in rapid transformation to the stable 
crystalline phase. A state of unstable equilibrium is destroyed by any fluctuation, 
however small; an example would be a precipitate dispersion (subject to Ostwald 
ripening) in which all the precipitates have exactly the same size. And there are unstable 
states, not in equilibrium at all. 

The term “metastable” was invented by the German physical chemist, Wilhelm 
Ostwald, in 1893. In his textbook (Ostwald [1893]) he defined the newly named 
concept as follows: “Of a physical system, persisting (in its existing state) when 
undisturbed or subject to disturbances smaller than some small or infinitesimal amount, 
but passing to a more stable state when subject to greater disturbances.” 

As elucidated by Turnbull [1981], all solids of practical interest are 
configurationally frozen, that is, under the conditions of observation, the atomic 
configuration does not undergo significant changes. In this way microstructures which are 
thermodynamically unstable are readily found, and are indeed much more common than 
microstructures in true equilibrium, whether stable, metastable or unstable. Examples are 
a supersaturated solid solution in the spinodal range, a dispersion of precipitates of varied 
size, and a glass; all of these can relax toward lower free energy states continuously, 
without any nucleation barrier. That these states persist at all is of course because of 
kinetic barriers to atomic motion. As suggested by Turnbull, the best practice may be to 
describe a microstructure in terms of its thermodynamic state before configurational 
freezing set in. In this way, a glass would be considered metastable and a supersaturated 
solid solution in the spinodal range would be considered unstable. A common practice, 
however, is to term all such configurationally frozen states “metastable”. Although such 
a usage is approximate at best, we shall adopt it in this chapter. By considering the local 
potential wells between which atoms must make diffusional jumps, even states which are 
thermodynamically unstable may be thought of as kinetically metastable. Such kinetic 
metastability can exist only where thermal activation of atomic jumps is required. 
Transformations which do not require thermal activation cannot be suppressed; marten- 
sitic transformations come in this category, and Turnbull suggests that crystallization of 
amorphous pure metals does also. 

Taking the definition of a metastable state to be any state which has a free energy in 
excess of that of stable equilibrium, it is then natural to quantify the degree of meta- 
stability by the value of the excess. The novel production techniques and microstructures 
associated with large excess free energies are the focus of this chapter. By novel methods 
such as those outlined in § 1.2, excess free energies as high as RT m (where T m represents 
the average melting point of the elements in the system) can be retained in con- 
figurationally frozen states. Such values are up to twice the highest achieved by 
conventional processing. In the various microstructural effects of novel processing 
methods (§ 1.1), the excess free energy can be seen to arise from three basic kinds of 
metastability (Turnbull [1981]): morphological, compositional and structural. In 
morphological metastability, the excess free energy comes from defects of all kinds 
including interphase boundaries; it is associated with (i) increased defect concentrations 
and (ii) microstructural refinement. Compositional metastability is found in (iii) extended 
solid solubility. Structural metastability is associated with (iv) metastable crystalline 
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phases and (v) metallic glasses. Typical excess free energies associated with these types 
of metastability are: 

morphological < 0. 1 RT m 
structural < 0.5 RT m 
compositional < 1 .0 RT m 

As described by Turnbull, such excess values can be considered to be attained in 
general by a process of energizing the system followed by quenching to effect 
configurational freezing. Considering the processing routes outlined in § 1.2, examples of 
energization can be seen to include melting, generation of gaseous species by evaporation 
or sputtering, dissolution, irradiation or mechanical deformation. In the solid-state 
amorphization reaction, the energization is the creation (by deposition or mechanical 
working) of interphase interfaces. The quenching, typically characterized by a cooling 
rate, is needed to reach a configurationally frozen state while retaining a large excess 
energy. It is useful to consider what constitutes configurational freezing. This does not 
occur at a single well defined temperature, but reflects the range of types of atomic 
motion. Roughly, however, it can be considered that diffusive motion of atoms within 
bulk phases (that is, lattice diffusion in a crystal, or diffusion in a glass) is frozen out 
below a reduced temperature (77 TJ of =0.4. For grain-boundary and free-surface 
diffusion, the corresponding reduced temperatures are = 0. 1 and = 0.2. Though a highly 
metastable material may have a lower temperature for effective configurational freezing 
because of for example a larger grain-boundary area, the fundamental temperatures for 
freezing the various atomic processes stay the same as in materials near equilibrium. 
Thus the retention of large excess free energies in practical materials is not necessarily 
thwarted under normal service conditions by any increased susceptibility to structural 
change. 

Configurationally frozen normal materials, even when processed by novel methods, 
could be in thermodynamically stable states, and such states by definition have the 
largest driving forces for their formation. It is reasonable to ask, therefore, why 
metastable materials are so commonly obtained. They must be favored kinetically. The 
nucleation of one phase in another is easier with lower interfacial energy between the 
phases, and that energy tends to scale with the entropy difference between the phases. 
Growth of one phase in another is favored if substantial solute partitioning is not 
required. Thus a transformation is in general more likely if the new phase is more similar 
to the old, either in entropy or composition. This has been described as a “step rule” 
(Turnbull [1981]), and it will often lead to metastable products. 

2. Formation of metallic glasses ( amorphous alloys) 

2.1. Formation and thermodynamics 

The metastable alloy structure most associated with novel processing methods and 
with large departures from equilibrium is the metallic glass. Strictly, a glass is formed 
when a liquid which is cooled fails to crystallize and instead continuously and uniformly 
congeals into a solid. The glass has essentially the structure of the liquid, combined with 
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the resistance to shear deformation of a solid. Non-crystalline, amorphous , alloys can be 
obtained by any of the methods outlined in § 1 .2, and not just those involving quenching 
of a liquid. In this section we show that amorphous alloys made by liquid quenching can 
be considered as true glasses and that those prepared by other methods are not signifi- 
cantly different. The thermodynamics of the glassy/ amorphous state are considered in 
relation to the formation of amorphous alloys. 

We start by pointing out that amorphous and glassy states are not constrained to be 
the same, and illustrate this with the case of silicon. Liquid silicon has a high density and 
is metallic. Crystalline silicon has tetrahedral covalent bonding. Amorphous silicon, 
which can be made by vapor deposition as well as by solidification from the liquid, also 
has local tetrahedral coordination with covalent bonding (Poate et al. [1987]). In its 
bonding type, solid amorphous silicon is therefore quite distinct from the liquid and 
cannot be regarded as a glass, which if formed would be dense and metallic. For 
example, this has the result that crystallization of amorphous silicon is a quite different 
transformation from crystallization of liquid silicon, as illustrated by the different 
temperature dependences of the two processes (Stiffler, Evans and Greer [1992]). 

Figure 2 shows schematically the changes in properties which occur when a liquid is 
cooled into the crystalline or glassy states. At the higher temperatures, range A, the melt 
is in equilibrium. It is extremely difficult (though possible under certain circumstances, 
Perepezko and Gleiter [1983]) to superheat a crystal above the thermodynamic melting 
point, so that metastable states are very rare in range A. In defining the boundary 
between ranges A and B, we take 7] (the equilibrium freezing temperature); this is not 
necessarily the same as T m , the melting temperature, for alloys with a solidus-liquidus 
gap, but we ignore the complexities arising from an interposed solid + liquid range. In 
effect, the diagram is for a system in which all the transformations are without solute 
partitioning. 

In range B, the melt is thermodynamically undercooled and will crystallize rapidly if 
a critical nucleus is provided. At least in the higher temperatures in this range the liquid 
has a low viscosity, and crystallization can be avoided only because fast cooling gives no 
time for nucleus formation. At the lowest temperatures in this range the liquid viscosity 
increases rapidly with falling temperature, and a glass can still be formed even in the 
presence of nuclei because the low atomic mobility stifles crystal growth. Devitrification 
studies (§ 4.3) provide evidence for copious nucleation occurring in the later stages of a 
quench without significant overall transformation of the liquid. In range B, the viscosity 
varies over some fifteen orders of magnitude, and though most of the range is not 
accessible to direct measurement (because crystallization would intervene during the 
measurement), the curve can be estimated by interpolation between experimentally 
accessible points in the neighborhood of P and Q. At all temperatures in range B the 
atomic mobility is adequate for the liquid to remain in configurational equilibrium (that 
is, in an internal equilibrium in which it has the lowest free energy of any state which it 
can reach by continuous change). 

Crystallization, if it occurs, involves a discontinuous change in properties (as shown, 
for example, by volume in fig. 2). Crystallization can occur only in range B, and if it is 
avoided on cooling, range C is reached. Range C is defined by the glass transition 
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Hg. 2. The glass transition: variation of several properties with temperature. See text. 

temperature T g , below which the liquid configuration is congealed in a pattern which 
corresponds to equilibrium at 7’ g . Experimentally, T g is close to the temperature at which 
the viscosity reaches 10 12 Pa.s. The nature of the transition at T g can be seen by 
considering the temperature dependence of liquid/ glass properties, for example volume 
and viscosity as shown in fig. 2. Above T g the properties show a strong temperature 
dependence, reflecting the configurational changes in the liquid. Below T g , however, 
atomic motions in the melt are so slow that during a quench (as presupposed in drawing 
fig. 2) there is no longer time for configurational changes; the mobile melt has become 
a rigid glass. Corresponding to the lack of configurational change in the glass, its 
properties have a rather weak temperature dependence, roughly matching the behavior of 
the crystal. 

It is important to recognize that T g is not a thermodynamically defined temperature; 
its location is determined wholly by kinetic considerations. If the quenching rate is 
reduced (but not so much as to permit crystallization in range B), then the liquid 
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configurations can stay in equilibrium to a lower temperature, that is (T g ), is lowered to, 
say (T g ) 2 . This slower quench entails a smaller volume (higher density) and higher 
viscosity for the glass. Thus different glassy states can exist; §4.2 deals with the 
structural relaxation by which glassy states can change. On simple annealing glasses 
densify and evolve towards an ideal glassy state represented by the extrapolation of the 
equilibrium properties, shown on fig. 2 for the viscosity. How dense could an ideal glass 
be? This intriguing question was first addressed by Kauzmann [1948]. He pointed out 
that extrapolation of liquid properties to lower temperatures would soon (that is, at 
temperatures not far below experimental T g s) lead to what he termed an apparent 
paradox in which the characteristic difference between liquid and crystal properties 
would be inverted. For example, the liquid would become denser than the crystal and it 
would have lower entropy. While not thermodynamically forbidden, structurally such an 
inversion seems very unlikely. Kauzmann proposed that some transformation, either an 
ideal (non-kinetic) glass transition, or some type of spontaneous crystallization, must 
intervene to prevent the paradox being reached. He suggested that the isentropic point (at 
which the extrapolated liquid entropy matches that of the crystal) could be taken to 
represent the ideal glass transition temperature and that the observed glass transition 
(found at finite rather than infinitely slow cooling rates) occurs somewhat before the 
ideal transition. There has been a steady flow of papers attempting to make sense of 
Kauzmann’s entropy catastrophes (as they are termed nowadays), not only in Kauz- 
mann’s original version at low temperatures, but also other catastrophes in the neighbor- 
hood of the liquid/ vapor transition, which the most recent study (Kishore and Shobha 
[1994]) has sought to relate to van der Waals’s well-known critical temperature. The 
Kauzmann paradox remains very much on the research agenda. 

In connection with fig. 2 we have described the classical features of glass formation, 
well known for oxides and polymers. Metallic systems are not so well known as glass- 
formers, yet experiments have consistently shown that they fit into the same picture. 
Around T g , metallic glass-forming systems show all the characteristics (volumetric, 
rheological, thermal) of the conventional glass transition illustrated in the figure. On 
annealing, the properties of metallic glasses change in a way characteristic of the 
structural relaxation of conventional glasses. Most importantly, some easy glass-forming 
compositions are sufficiently stable that measurements are possible in range B without 
crystallization. In this way the link between the glass and liquid is directly established 
(Kui and Turnbull [1985]). Measurements of properties and structures (and, as we will 
see later in this section, thermodynamic measurements) suggest that amorphous metallic 
phases produced by other methods are (within the range of structure expected) identical 
to glasses produced by liquid quenching. 

Of central importance in considering glass formation and the properties of glassy or 
amorphous phases is the atomic mobility. Low mobilities permit the glass to be formed 
and a glassy or amorphous phase to persist. Much of the discussion of atomic mobility 
in dense non-crystalline systems has centred on the concept of free volume, which we 
now examine. As seen in fig. 2, a faster quenched glass (1) has a lower density than a 
more slowly quenched one (2) or, in other terms, “1” has a larger free volume than “2”. 
Free volume was originally introduced and analysed by polymer physicists (Fox and 
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Flory [1951]); Ramachandrarao etal. [1977a, b] give a full discussion of the concept. 
In the simplest terms, free volume is the unfilled space between the atoms or molecules 
(which at once poses the problem of defining the contours of these entities); the central 
concern is with the space in excess of that which would exist even in ideal close- 
packing. The free volume can be defined in terms of thermal expansion starting from an 
ideal glassy state, or in terms of the hole theory of liquids in which free volume is 
postulated to consist of identifiable holes with a rather narrow size distribution. 

The crucial point is that, the smaller the free volume, the more sluggish is any 
structural rearrangement. The viscosity, 77 , varies with free volume approximately 
according to the Doolittle equation, originally derived for polymers: 77 = A exp(B// r ), 
where /j. is the relative free volume defined by (V T - V 0 )/V 0 , where V T , V 0 are the volumes 
of material at temperatures T and 0 K respectively. The equation is valid throughout 
range B in fig. 2, but cannot be applied in the glassy state itself. The contentious 
question whether the free volume is more or less uniformly distributed, or else concen- 
trated in holes which are probably at least the size of the smallest atoms in the glass, is 
still open, as reviewed by Cahn [1979]. Glass stability against annealing-induced 
crystallization (which will be considered further in § 4.3) is directly related to atomic 
mobility. Figure 3 shows that the crystallization temperature has a good correlation with 
hole formation enthalpy; a stabler glass has a higher formation energy for holes, 
therefore fewer holes form, the total free volume is smaller and thus crystallization more 
sluggish. In range B (fig. 2) the relative free volume remains in thermal equilibrium. On 
cooling it decreases particularly rapidly as T g is approached, leading to a very sharp rise 
in viscosity. Below T g free volume is frozen-in. In the glassy state, free volume is a very 
useful parameter for describing and analysing the property changes which occur On 
annealing; this is exemplified in the work of Spaepen and Taub [1983] on viscosity and 
of Chason and Mizoguchi [1987] on atomic diffusion (see §4.2). 



Fig. 3. Dependence of the measured crystallization temperature, T x , of various binary metallic glasses on the 
formation enthalpy of a hole the size of the smaller of the constituent atoms: the hole formation enthalpy is 
calculated from thermochemical data. (After Buschow [1982]). 
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Different types of glass-forming system, network formers like Si0 2 , ionic liquids and 
those showing van der Waals bonding, do show different types of behavior, reflecting the 
different sensitivities of the liquid structure to temperature changes. The so-called strong 
liquids like Si0 2 have a near-Arrhenius dependence of atomic mobility (characterized by 
the viscosity). On the other hand an organic liquid like toluene is a fragile liquid with a 
strongly non-Arrhenius temperature dependence following the Doolittle equation. The 
distinction is fully discussed by Angell [1995]. Metallic liquids fall between these 
extremes, nearer to the fragile limit. It is characteristic of such systems that there is a 
large step in specific heat at T g . As shown in fig. 2c, the specific heat of a glass-forming 
liquid alloy is significantly greater (perhaps as much as two times greater just above T g ) 
than that of the corresponding crystalline solid. Integration of liquid and solid specific 
heats over the temperature interval between T { and T g shows that the heat of crystalliza- 
tion of the glass at T g should be significantly less than the latent heat of melting at 7); 
this is confirmed by direct measurement and provides a way of estimating the liquid 
specific heat in the undercooled region when values are not otherwise available 
(Garrone and Battezzati [1985]). The large specific heats of glass-forming liquids 
relate directly to the loss of entropy on cooling. Glass-forming alloys typically show a 
strongly negative heat of mixing of their components. This heat is greater at lower 
temperatures, again reflecting the increased order in lower-temperature liquids. Figure 4 
shows the entropy difference between liquid and crystalline states in an easy glass-forming 
system for which reliable specific heat data are available. The ideal glass transition tempera- 
ture (zero entropy difference) is just a little below the measured glass transition (determined 
as in fig. 35, below). The proximity means that thermodynamic parameters can be used to 
estimate glass transition temperatures, as shown by Battezzati and Greer [1987]. 



Temperature (K) 


Fig. 4. Excess entropy of liquid Pd^Ni^a, as a function of temperature, calculated from specific heat data of 
Evans etal. [1987]. The excess entropy has the value of the entropy of fusion, A at the eutectic temperature, 
T c , and extrapolates to zero at the ideal glass transition temperature. This is seen to be very close to both the 
observed kinetic glass transition, T g , and the ideal transition temperature, T a , calculated from the hole theory of 
Dubby and Ramachandrarao [1984]. 
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The thermodynamics of the glassy state itself can be estimated using specific heat 
measurements as above. In addition, electrochemical (E.M.F.) measurements on the glass 
can be used to determine its thermodynamic parameters directly; the procedures have 
been discussed by Bormann and ZOltzer [1992], CALPHAD (CALculation of PHAse 
Diagrams) methods involve the determination of a consistent set of thermodynamic 
parameters for an alloy system by fitting the phase diagram. When this takes account of 
specific heat and other data (such as E.M.F. measurements) reliable estimates are 
possible, even in the undercooled liquid/glass regime. Saunders and Miodownik 
[1986] first successfully applied this approach to interpret amorphous phase formation by 
solid-state reactions, and it has since been developed further (Bormann etal. [1988]). 

A striking example of the link between the liquid and an amorphous alloy formed in 
the solid state, and of the use of thermodynamics in interpreting experimental results, is 
provided by inverse melting. This phenomenon occurs in the metastable body-centred 
cubic solid solution in the Cr-Ti system. When a solid solution of Cr 55 Ti 45 is annealed 
below ~ 750°C it transforms polymorphically to an amorphous phase of the same 
composition. Yet the bee phase itself forms from the liquid. Thus if the amorphous phase 
is thermodynamically continuous with the liquid, its Gibbs free energy must equal that 
of the bee at two temperatures (fig. 5). The upper temperature is the normal melting/ 
freezing point. At the lower temperature there is an equilibrium between the amorphous 
and crystalline phases, but the amorphous phase is the more stable phase below the 
temperature; this is an inverse melting point, because in equilibrium at this point a crystal 
would melt on cooling. The Cr-Ti system has been subjected to a CALPHAD analysis 
and this predicts behavior of the form shown in fig. 5. The corresponding metastable 
phase diagram is shown in fig. 6; in this diagram only three phases are included — the 



Fig. 5. Free energy curve of the liquid/amorphous phase with respect to the bcc solid solution for Ti-55 at% 
Cr. Hep titanium and bcc chromium are taken as reference states at each temperature. T m and T im denote the 
melting and the inverse melting temperatures, respectively (After Bormann [1994]). 
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Fig. 6. Metastable phase diagram of the H-Cr system as calculated by the CALPHAD method, considering 
only equilibria between the hep and bcc solid solutions and the amorphous or the liquid phase (After Bormann 
[1994]). 


hep and bcc solid solutions and the liquid. Intermetallic compounds are excluded. The 
metastable diagram is relevant to experiments on solid-state amorphization because at the 
temperatures used the compounds have difficulty in nucleating. As shown in fig. 6, the 
amorphous phase appears at low temperatures. The thermodynamic analysis shows that 
this is really the same phase as the liquid. Consideration of the temperature dependence 
of the free energies in fig. 5 shows that the liquid or amorphous phase at the inverse 
melting point must have a lower entropy than the bcc solid solution. This remarkable 
result is possible because the amorphous phase is strongly chemically ordered, whereas 
the solid solution is disordered. The greater chemical order in the amorphous phase 
outweighs its greater positional entropy. Given the chemical order in the amorphous 
phase, it is possible to regard it as a type of intermetallic compound, as its phase field in 
fig. 6 would imply. 

The analysis of inverse melting was concerned with a transformation occurring near 
thermodynamic equilibrium (albeit a constrained equilibrium in which intermetallic com- 
pounds do not appear). Other types of solid-state amorphization have prompted consider- 
ation of what the ultimate limit to crystalline stability might be — if a liquid or 
amorphous phase did not nucleate, how far could the crystal be superheated? Fecht and 
Johnson [1988] have considered this, and show that the proliferation of vacancies in the 
crystal would lead to its entropy becoming greater than that of the liquid at a finite 
temperature (roughly twice the thermodynamic melting temperature T m ). In a further 
application of Kauzmann’s concept of a structural paradox, they suggest that this point 
would represent the absolute stability limit of the crystal. More generally, how far could 
the crystal exist beyond its equilibrium phase field? Its thermodynamic stability could be 
reduced not only by heating, but also for example by changing its composition or by 
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injecting defects (by irradiation or deformation). These points have been considered by 
Fecht, Desre and Johnson [1989], who suggest that melting or amorphization could 
become a continuous second-order transition at high enough solute contents. It is as yet 
unclear whether any observed amorphization fits this pattern, or occurs closer to 
thermodynamic equilibrium. 

2.2. Compositions of amorphous alloys 

Amorphous alloys are now known for many compositions. Within a given alloy 
system, the range of compositions which can be made amorphous depends on the 
production method and conditions. For example, with rapid liquid quenching, the range 
of glass formation is wider for higher cooling rates. However, even at extreme cooling 
rates (for example in pulsed laser quenching, §3.5), the glass-forming composition 
ranges remain limited. Table 1 gives some examples of compositions which can be made 
amorphous by ‘standard’ techniques such as melt-spinning (§ 3.1). More complete listings 
may be found in Takayama [1976], Suryanarayana [1980] and Johnson [1986]. 

The main categories are: 

(i) Late transition metal -metalloid. This is probably the most studied category of 
amorphous alloys, as it includes the Fe-, Co- and Ni-based alloys which are commercial- 
ly exploited because of their soft magnetic properties (Smith [1993]). The alloys consist 
of one, or a combination of, the late transition metals and one, or a combination, of the 
metalloids. The late transition metals are here considered to be in groups VIII and IB. 
The metalloids are in groups IIIB, IVB or VB, including A1 and Sb. Amorphous alloys 
are mostly formed by rapid liquid quenching, and the glass-forming compositions are 
centered around deep eutectics, typically in the range 15 to 25 at.% metalloid. Binary 
amorphous alloys in this category are common, but the glass-forming ability is generally 
found to be higher when there are more components, probably because the eutectics are 
deeper. 

(ii) Early transition metal -metalloid. Early transition metals from groups IVA, VA 
and VIA also form amorphous alloys in combination with metalloids, with compositions 
again centered on deep eutectics in the range around 20 at% metalloid. Partial substitu- 
tion of an early transition metal for the late transition metal in category (i) can improve 
the glass-forming ability. When the alloys contain a refractory metal, the crystallization 
temperatures can be particularly high, and this has led to interest in early transition 
metal-based glasses as diffusion barriers (Dorner et al. [1991]). 

(iii) Early transition metal - late transition metal. These alloys of group VIII or IB 
elements with group IVA, VA or VIA elements form the second most-studied category. 
When they are made by rapid liquid quenching, the glass-forming composition ranges are 
typically much wider than in the metalloid-containing alloys (categories i and ii), 
covering several eutectics and intermetallic compounds. Although the glass-forming 
ability is often greatest around the centre of the composition range, Fe, Ni and Co alloys 
with as little as 9 at% Zr have been made amorphous. These alloys have attracted much 
attention because they are the main category which can be made amorphous by solid- 
state reaction (§ 3.7). 
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Table 1 

Examples of glass-forming alloys", classified into the categories discussed in the text. 


Type 

Alloy 

Glass-forming range References 

(i) Late transition 

F6]oo- x B x 

x=12-25 

Hasegawa and Ray [1978] 

metal-metalloid 

Pdio^Si, 

x= 14-22 

Lewis and Davies [1976] 


Niioo-xB* 

x=17-18.5, 31—41 

Donald and Davies [1980] 



x=34-36.5 

Srivastava et al. [1972] 


^80^13^7 


Duwez [1967] 


FejoNi^B^ 


Luborsky [1977] 


Fe 40 Ni 40 P 14 B 6 


Polk and Chen [1974] 


^'°70.sF e 4jSi 1 jB , 0 


Makino etal. [1990] 


Pd 4€ Ni 40 P 20 


Maitrepierre [1970] 

(ii) Early transition 

Ti[oo-xSi x 

- x= 15-20 

Inoue and Masumoto [1980] 

metal-metalloid 

^ e 6sS*35 


Fischer et al. [1978] 


w 8 „N M 


DOrner etal. [1991] 


^60^ r 2(>B20 


Fischer etal. [1978] 

(iii) Early transition 

Nbioo-.Ni, 

x=40-70 

Ruhl et al. [1967] 

metal-late transition metal 

CUiOO-xZTx 

x=25-60 

Ray etal. [1968] 



x=10-12, 33-80 

Clemens et al. [1984], 




Gregan etal. [1981] 


Fe 10 &- x Zr x 

x=9, 72, 76 

Horvath etal. [1988] 



x=40-70 

Giessen etal. [1976] 


Ta 55 Rh 45 


Fischer etal. [1978] 


^ r 6oNi25Alj 5 


Inoue et al. [1990] 

(iv) Aluminum-based 

Alioo-xLa* 

x=10, 50-80 

Inoue etal. [1988b] 


Al7oNi 15 Si 15 


Dubois etal. [1992] 


A^CujjVjo 


HolZer and Kelton [1991] 


Al 8 5Ni 10 Zr 3 


Dubois et al. [1992] 


AlgoNitoYjo 


Inoue etal. [1988a] 


Al S 5Fe l0 La 5 


Inoue etal. [1988a] 


Alg 5 Ni 5 Y 8 Co2 


Inoue etal. [1990] 

(v) Lanthanide-based 

^ 3 100-x^^x 

x= 18-34 

Agyeman et al. [1979] 


La 10 o-x^ e x 

x=17-22 

Agyeman et al. [1979] 


Laioo-,Au, 

x= 18-26 

Johnson et al. [1975] 


Gd 100 _ x Fe x 

x=32-50 

Buschow [1979] 


La 55 Al 25 Ni 20 


Inoue et al. [1989] 

(vi) Alkaline-earth 

Mg 100 -,Zn s 

x=25-32 

Calka etal. [1977] 


^ a i00-x^^x 

x=12.5— 47.5 

Sommer et al. [1978] 


Y 10 


Kim etal. [1990] 


Be 4 oZr 10 Ti 50 


Hasegawa and Tanner [1977] 

(vii) Actinide-based 

^100-x^°x 

x=24-40 

Ray and Musso [1976] 


" All compositions are quoted in at.%. The amorphous alloys have been formed by a variety of techniques, 
mostly rapid liquid quenching. The compositions given are those of representative alloys and do not imply 
that amorphous phase formation at other compositions in the same system would not be possible. For binary 
alloys, however, some indication is given of glass-forming ranges. 
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(iv) Aluminum-based alloys. Aluminum-based amorphous alloys have attracted 
attention only quite recently, when ductile compositions with >75 at.% A1 were dis- 
covered; these are of interest because of their high specific strength (§4.4.1). The alloys 
most studied have 5 to 15 at% of a lanthanide and 5 to 15 at% of a late transition metal. 
Amorphous phase formation is also found in the binary aluminum-lanthanide systems, 
at approximately 10 and 50-80 at% lanthanide. Other amorphous aluminum alloys are 
based on additions of an early transition metal and a late transition metal, or of a late 
transition metal and metalloid. 

(v) Lanthanide-based. Alloys of a lanthanide with 10 to 35 at.% metalloid can form 
amorphous alloys. Also, binary alloys of the lanthanides, or Y, with late transition metals 
(groups VIU and IB) have deep eutectics at lanthanide-rich compositions, and some 
glass-forming ranges have been found around these. An alloy of this type (La-Au) was 
the first to demonstrate solid-state amorphization by interfacial reaction (Schwarz and 
Johnson [1983]). Of particular interest are alloys from the same Al-lanthanide-late- 
transition metal systems as yield ductile Al-rich amorphous alloys. When lanthanide- 
based, e.g., La 55 Al 25 Ni 20 (Inoue etal. [1989], these alloys exhibit unusually high glass- 
forming abilities. The critical cooling rate for glass formation is exceptionally low < 10 
K/ s) and the amorphous phases are remarkably resistant to crystallization, crystallizing 
only at temperatures ~ 100 K above T g . (Most amorphous alloys crystallize rapidly at or 
slightly above T g .) The very good glass-forming ability has the consequence that bulk 
amorphous samples (for example, rods of diameter up to ~ 1 cm) can be cast (Inoue et 
al. [1993]), in marked contrast with the thin sections associated with the rapid quenching 
required for more normal glass-forming alloys. 

(vi) Alkaline-earth based. The group IIA metals can have quite wide composition 
ranges for amorphous alloy formation when alloyed with a metal from groups IB, IIB, 
IIIB, IVA, VnA or VIII. A ternary addition of a lanthanide can greatly improve the 
glass-forming ability (Kim etal. [1990]). For example, compositions such as Mg M Cu 25 Y )0 
show glass-forming ability similar to that of the lanthanide-aluminum-transition metal 
alloys discussed in (v), and can similarly be cast in bulk (Li et al. [1992], Inoue et al. 
[1993a]). 

(vii) Actinide-based. A number of amorphous binary alloys based on actinides are 
known, which do not fit into a clear pattern. 

The examples in table 1 illustrate the main types of amorphous alloys, but there are 
many others. The table emphasizes binary compositions, but many of the compositions 
which form amorphous phases most readily have three or more components, and some 
of these may be difficult to categorize. As amorphous alloys are optimized for various 
applications, multicomponent compositions are developed. An example is the composition 
Fe 73 5 Cu 1 Nb 3 Si 1 3 5 B 9 , designed to devitrify to a fine-grained structure with good soft 
magnetic properties (§ 4.3). Like most complex compositions, however, this is based on 
a simple amorphous alloy type, in this case type (i). Other compositions, such as 
Zr 412 Ti 138 Cu 125 Ni 10() Be 22 5 , have been developed to obtain exceptionally good glass- 
forming ability (Peker and Johnson [1993]. These recent findings have been put in their 
context in a reportage by Greer [1993]. Subsequently, Inoue [1995] has found a whole 
series of easy-forming glasses, quaternary and quinary compositions of the type 
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Zr 65 Al 75 (Co,Ni,Cu) 27 5 , e.g., Zr 65 Al 75 Ni 10 Cu 175 which, like Johnson’s alloy, can be 
prepared as an amorphous cylinder as much as 16 mm in diameter. In an alloy with 
many components crystallization may be strongly inhibited; this is sometimes referred to 
as the confusion principle. The confusion is increased if the atoms are of differing sizes. 
In the first example given, zirconium and titanium atoms are large, copper and nickel of 
intermediate size, and beryllium has the smallest atoms of any metallic element. Alloys 
of this type also have an exceptionally wide temperature gap between the glass transition 
and the temperature at which crystallization begins. Other than these broad generaliz- 
ations, other criteria for selecting compositions to form amorphous phases readily are 
considered in the next section. 

We show here (fig. 7) seven phase diagrams, one from each of the categories in table 
1, each with the glass-forming ranges (for cooling rates such as are achieved in melt- 
spinning). It is very clear that glasses form preferentially near deep eutectics, where 
viscosity is high before the thermodynamic freezing temperature is reached on cooling, 
although the glass-forming ranges are usually not symmetrical about the eutectic composition. 
These phase diagrams link with several of the criteria discussed in the next section. 

2.3. Criteria for formation of amorphous phases 

In this section, we concentrate initially on glasses made by rapid solidification; those 
made by irradiation, mechanical milling or interdiffusion will be discussed afterwards. 
The generic term most commonly used to represent what we are trying to interpret here 
is glass-forming ability, or GFA. This is usually measured by the critical cooling rate, R c , 
which is necessary to turn a melt into a glass, avoiding crystallization: when this is in the 
experimentally accessible range for mainline rapid solidification processes such as melt- 
spinning, approximately up to 10 7 K/s, then the alloy can be vitrified by such a process. 
If theory tells us that R c is higher than this, then at best tiny amounts of glass can be 
made by some much faster quenching process such as in-situ quenching with nanosecond 
or picosecond laser pulses (§ 3.5). 

Motorin [1983] calculated from first principles the expected homogeneous 
nucleation rates of crystals in supercooled pure metal melts, using known physical 
parameters as input. For silver, copper nickel and lead, minimum cooling rates of 
10 I2 -10 13 K/ s were derived, which demonstrates clearly enough why no pure metals have 
been amorphized by melt-quenching. Not even picosecond laser-quenching has been 
successful in this respect. Thus practically interesting GFAs refer only to alloys — as 
mentioned in § 1.3, Turnbull [1981] did propose that crystallization of pure amorphous 
metals would come into the category of transformations which do not require thermal 
activation and thus cannot be suppressed. 

The range of ideas and approaches which have been proposed to attain an understand- 
ing of why some alloy systems can be vitrified and others not, and of what determines 
the composition ranges over which metallic glasses can be made in practice, is 
disconcertingly broad. There are almost as many criteria for glass formation as there are 
for the good life, and almost as little agreement among the protagonists as to which is 
preferable. 
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Fig. 7. Equilibrium phase diagrams for examples of each of the categories (i) to (vii) listed in table 1. Glass- 
forming ranges are indicated. 


An early attempt to come to grips with this difficult issue was by Polk and Giessen 
[1978], The best comprehensive overview of the development of these criteria is by 
Ramachandrarao [1984]. Cahn [1986a], Hafner [1986] and Massalski [1986] have 
published more specialised treatments. Here, only an abbreviated review is feasible. 

In spite of a tendency by some theorists to regard this claim as culpably naive, there 
can be no doubt that atomic size is the single factor that plays the major role in 
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determining GFA. The recognition that the constituent atoms in a melt must have 
sufficiently different Goldschmidt radii to permit glass formation goes back to very early 
experiments on vapor-quenching, in a classic study by Mader el al. (1967). A critical 
radius mismatch of ~ 15% fitted the results, and this value has been accepted ever since. 
This criterion might be termed an anti-Hume Rothery criterion, since Hume-Rothery’s 
celebrated law laid it down that a radius mismatch not exceeding 15% was necessary for 
extensive solid solution formation in terminal alloy phases. — Mader himself had 
simulated glass formation by jiggling populations of wooden balls of different sizes on 
a tray, and found that the radii had to differ sufficiently for “crystallization” to be 
prevented; subsequently, Simpson and Hodkinson (1972) performed the first of several 
simulations of this kind with rafts of soap bubbles of - 1 mm diameter which better 
simulate real interatomic forces, and again found a critical radius mismatch of ~ 15%. 

The next stage was a series of attempts to create GFA maps, in which radius 
mismatch was plotted along one axis of a graph, and some measure of bond strength 
such as the negative heat of mixing or heat of evaporation along the other — the idea 
being that strong interatomic bonds render crystallization more difficult. One of several 
such attempts in this period was by Giessen (1981). Such graphs suggest that the bond 
strength does indeed play a part, but it is a “weak” variable compared to the size 
mismatch. It should be noted, however, that strong bonding can itself modify an effective 
atomic size: this was demonstrated by Bechet and Regazzoni [1991] who showed that 
the effective size of late-transition-metal atoms in amorphous aluminum-rich alloys is 
significantly reduced by the strong bonding between the elements. 

A very important development of the radius mismatch approach is due to Egami and 
Waseda [1984], followed by a further development of the same ideas by Egami and 
Aur [1987]. These authors were interested in calculating the microscopic stress levels at 
the scale of individual atom groups in a glass or a crystalline solid solution. Briefly, they 
concluded that in a glass, neither the local stress fluctuations nor the total strain energy 
vary much with solute concentration when these quantities are normalized with respect 
to the elastic moduli, whereas in a crystalline phase the strain energy rises steadily with 
solute content. Thus, beyond a critical solute content, glass becomes favored over crystal 
formation, and in this way, not only GFA but also the glass-forming composition range 
can be calculated. 

The conclusion is that c B mm (v B -v A )= 0.1, where c B mm is the minimum solute concen- 
tration of B in A required for a glass to form and the v’s are atomic volumes. The 
agreement with observation is good for many systems, though melt-quenching by normal 
melt-spinning is not necessarily fast enough to give agreement with Egami and Waseda’s 
criterion. Figure 8, for the Ni-Nb system, shows. at the bottom the results of applying 
Egami and Waseda’s criterion, and also three different experimentally determined glass- 
forming ranges for different quenching methods. The “faster” methods agree very well 
with the theoretical criterion. 

Several other theories, of a thermodynamic nature, based on atomic volumes have 
appeared. One version is due to Ramachandrarao [1980]: he estimated the departure 
from ideality of melts, in terms of a kind of deviation from a liquid Vegard’s Law, and 
concluded, both theoretically and by comparison with experiment, that melts with a large 
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Fig. 8. The Ni-Nb phase diagram, with experimentally determined glass-forming ranges for three methods of 
progressively diminishing effective quenching speed, together with the range predicted by Egami and Waseda’s 
theory. 


deviation, in the sense of having an anomalously small atomic volume, were most likely 
to form glasses. This finding was interpreted in terms of the enhanced viscosity 
associated with a small atomic volume (which implies little free volume available to 
support atomic motion in the melt). — Yavari et al. [1983] independently developed a 
closely related idea, suggesting that a zero (or negative) change of specific volume on 
melting of a crystalline species favors glass formation on subsequent rapid solidification. 
In effect, their idea is that if a crystal is denser than the melt from which it grows, then 
in growing it rejects free volume into the melt and thereby reduces its viscosity; thus 
crystal growth becomes self-catalytic. Contrariwise, a less dense crystal raises viscosity 
and favors glass formation. 

Theoretical physicists have developed a number of models to rationalize GFA. The 
best known is that due to Nagel and Tauc [1977], They proposed that a glass is most 
likely to form if its electronic energy lies in a local metastable minimum with respect to 
composition change. They showed that if the structure factor corresponding to the first, 
strong peak of the diffuse X-ray scattering curve, K p , satisfies the relationship K p - 2 k F , 
where k F , is the wave vector at the Fermi energy, then the electronic energy does indeed 
occupy a local minimum. A number of familiar glasses, in the Au-Si, Au-Ge and Co-P 
systems for example, accurately obey this criterion, but there are also exceptions, e.g., 
some obscure glasses in the Cs-O and Rb-O systems. This kind of approach, using 
modern approaches such as pseudopotential theory, has been taken much further in recent 
years, but a discussion would take us too far here. For further details, the reader is 
referred to reviews by Hafner [1981, 1986]. However, as we shall see below, the 
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ability, demonstrated by Hafner, to make theoretical estimates of the glass transition 
temperatures of unknown glasses offers a novel way to estimate GFA. 

Theories based more explicitly on the need to prevent nucleation of a crystalline 
phase take two main forms. First, there are models based on an explicit calculation of 
homogeneous nucleation rates (this begs the question whether heterogeneous nucleation 
plays a role). The standard approach here is due to Davies (1976). He adapts a theory of 
isothermal crystallization kinetics due to Uhlmann and calculates the cooling rate 
necessary to bypass the “nose” of the time-temperature-transformation plot thus calcu- 
lated. (The nose is the minimum time required to initiate homogeneous nucleation, at 
some temperature well below the thermodynamic freezing temperature). The principal 
difficulty here is to estimate the viscosity of a supercooled melt, but there are a number 
of empirical relationships which allow a rough estimate to be made. (There is now 
available a critical survey of known viscosities of molten metals and alloys, by 
Battezzati and Greer [1989].) Figure 9 shows some calculated critical quenching rates 
obtained in this way, and fig. 10 shows how the calculated value of f? c varies with 
composition across a phase diagram. It will be seen from this second figure that, as a 
number of people have pointed out and as follows from Davies’ theory, the ratio T^T ( 
(where T g is the glass transition temperature, T s is the thermodynamic freezing tempera- 
ture) is a crucial figure of merit in determining GFA, for purely kinetic reasons associ- 
ated with the need to avoid crystallization. The lower this ratio, the more viscous the 
melt becomes before it is ever undercooled and the more difficult crystallization 
becomes, thus enhancing GFA and reducing R e . 

A somewhat related, more thermodynamically biased approach derives from a paper 
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Fig. 9. Calculated critical quenching rates, R c , for glass formation, and corresponding estimated limiting ribbon 
thickness for wholly glassy ribbons. Key to alloys: (1) (2) Au 73 Ge I4 Si 5 ; (3) Fe 83 B 17 ; (4) Fe 415 Ni 413 B 17 ; 

(5) Co 75 Si 15 B,„; (6) Fe^Si^B,,; (7) Pe^C,; (8) Pd 82 Si, 8 ; (9) Ni 63 Nb 37 ; (10) Pd 77 . s Cu 6 Si lss ; (11) Pd^Ni^Pd*, 
(After Davies [1978]). (An updated version of this plot has just appeared, Davies [1995].) 
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Fig. 10. Equilibrium freezing temperature, T ,, glass transition temperature, T s , figure of merit, T/T, and 
calculated critical quenching rate, R c , for a range of Pd-Si solid solutions. 


by J. W. Cahn (1980) on the thermodynamics of metastable equilibria. From this, the 
role of the relative values of T g and T 0 becomes clear; here, T 0 is the temperature at 
which liquid and solid of the same composition have the same free energy. A glass forms 
readily if T g > T 0 \ what this means is that the melt rigidifies before reaching the tempera- 
ture at which freezing without composition change (and thus without long-range 
diffusion), also known as solute-trapping, becomes possible. The implications of this are 
clearly set out in a review by Massalski (1986) and applied to a detailed analysis of 
GFA across the Cu-Ti system in a paper published about the same time (Massalski and 
Woychik [1985]). The problem is that, before a glass has actually been made, its T g is 
not known. Here, Hafner’s [1983, 1986] ability to calculate glass transition temperatures 
comes into its own. In fig. 11, for the Ca-Mg system, the calculated T a values are shown, 
and also two estimates for T s , an upper limit based on the “entropy-crisis” or Kauzmann 
model, and a lower limit based on a free-volume model. (For a fuller explanation, see 
Hafner (1983).) Theoretical composition ranges for glass formation are shown for two 
cooling rates; here the criterion T g >T 0 is the central consideration; agreement with 
experiment is quite good. Another detailed thermodynamic analysis of metallic glass 
formation near eutectic troughs was published by Highmore and Greer [1989]. 

Another set of theories is based on the postulate that metallic melts are not homo- 
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Fig. 11. Calculated phase diagram, T a values and upper and lower limits (7^,7^) for 7 g for the Ca-Mg system, 
with theoretical and experimental glass-forming ranges. (After Hafner [1983, 1986]). 


geneous in composition but contain compositional clusters. Such clusters are held to aid 
crystal nucleation, and their absence or weak development to aid glass formation. 
Contrariwise, short-range order (the converse of clustering), which is believed (though 
there is a singular absence of diffraction evidence on this point) to increase with falling 
temperature of a glass, just as it does in a crystalline solid solution, should enhance GFA. 
(See, for instance, Wagner’s [1986] survey of SRO in metallic glasses; at least, it has 
been established for Cu-Ti that a glass has higher SRO than the melt from which it is 
quenched (Sakata etal. [1981]).) Indeed, there is some evidence that some Cu-Ti alloys 
show clustering in the melt whereas the glass has SRO. This field of research, which has 
a large literature, has been somewhat beset by controversy: for further details, the reader 
is referred to Ramachandrarao [1980, 1984], 

Finally, we return to a less sophisticated “figure-of-merit” approach which has been 
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quite successful in rationalizing GFA. Donald and Davies [1978] long ago recognized 
the awkwardness of theories which related GFA to quantities (such as T/T f ) which 
involve T g , when normally this is unknown until a glass has been made and examined. 
They proposed that a good test of GFA is the extent to which the equilibrium freezing 
temperature of an alloy melt is depressed below the ideal value, which they calculated 
simply by linear interpolation between the freezing points of the constituent metals. This 
simple method was then developed by Whang [1983], who took into account the 
modification required to allow for possibly extensive solid solution of one metal in the 
other, in the solid state. This is necessary because solid solubility reduces the slope of 
the T 0 versus composition curves (like those shown in fig. 1 1). Whang generated figure- 
of-merit maps in which one axis gave T lr , defined as ^R = AT/T l °, where AT is the 
difference between the ideal freezing temperature for an alloy (T L °), defined as above, 
while along the other axis is C et , a simple measure of the amount of the solubility of the 
minor constituent in the major at the eutectic temperature — a large value of T lr implies 
a severely depressed liquidus, while a small C e r implies a steeply sloping T 0 versus 
composition curve ... both factors favouring easy glass formation. In fact, the maps so 
generated show a clear boundary between glass-forming and non-glass-forming alloys — 
Dubey and Ramachandrarao [1990] have developed Whang’s model to show that 
most eutectic phase diagrams can be expected to show an asymmetry of GFA, in the 
sense that glass formation is easier just to one side of the eutectic composition than on 
the other; the melt-spinning range indicated in fig. 8 shows an example of this, and 
several of the diagrams in fig. 7 exemplify the same feature. 

Whang’s theory was then adapted by Tendler [1986] to show, for a series of Zr- 
based alloys, that alloys which according to Whang’s criterion should be good glass- 
formers are also those in which there is anomalously fast diffusion, in the special sense 
introduced by Warburton and Turnbull [1975] (see ch. 7, §4.3.2). Figure 12 shows 
one of Tendler’s figures for a series of Zr-M alloys. All the alloys showing fast diffusion 
(Zr with Cr, Mn, Fe, Co, Ni, Cu and Be) are also glass-formers. For fast diffusion, the 
solute atom must be much smaller than the solvent atom (for details see Tendler’s paper) 
and this clearly also favors GFA. In fact, some years ago Turnbull [1976] had 
predicted just such an association between GFA and fast diffusion. Care must of course 
be taken in setting up such diagrams as fig. 12 because, as shown by Bechet and 
Regazzoni, large radius ratio disparities (leading to strong bonding and to fast diffusion) 
can modify effective atomic sizes. Fast diffusion is also linked to glass-forming 
mechanisms through the fact that in the solid-state amorphization reaction (SSAR), one 
atomic species diffuses very much faster than the other. 

This by no means exhaustive overview of the models and theories that have been 
advanced to make sense of glass-forming systems and ranges might well seem discourag- 
ing, because at first sight they are mutually exclusive. In fact, hidden cross-connections 
undoubtedly exist: the linkage between Whang’s thermodynamic approach (related to 
terminal solid solubilities) and Tendler’s association between fast diffusion and GFA 
clearly comes from a correlation of both solid solubilities and fast diffusion with atomic 
size ratios. Perhaps in due course even the electronic criteria studied by Nagel and Tauc 
and by Hafner may prove to be linked with some of the other ideas, e.g., the free volume 
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Fig. 12. A number of Zr-M alloys plotted on a Whang graph, separating glass-formers (GFA) from non-glass- 
foimers. (After Tendler [1986]). 


approach due to Ramachandrarao and Yavari. Our own view is that simple geometry — 
atomic sizes — will prove to be the main criterion that in various subtle ways incorpor- 
ates the others. 

2.3.1. Criteria for amorphization by irradiation, and mechanically aided and 
induced amorphization 

Some alloys can be amorphized either by irradiation with electrons or heavier 
particles (see §3.4) or by heavy deformation, if need be followed by annealing (see 
§ 3.8). In § 3.4, below, a brief account is offered of an unresolved controversy regarding 
the relative roles of two amorphization criteria — the need to destroy the (chemical) 
long-range order in an irradiated compound to a sufficient degree versus the need to 
attain a critical concentration of point defects. The criteria for amorphizability in the case 
of mechanically aided or induced amorphization . (see for instance Schultz and 
Hellstern [1987]) are generally assumed to be similar to those applying to irradiation, 
but no critical comparison of the two families of techniques appears to have been made. 
There is an important new approach, however, due to Beke etal. [1991]. They estimated 
the additional elastic (mismatch) energy which is stored in an ordered solid solution 
when its long-range order is destroyed, whether mechanically or by irradiation; this 
depends, of course, inter alia on the atomic sizes. This energy is then compared with the 
ordering energy, and it was concluded that amorphization is possible if (a) the (virtual) 
critical disordering temperature is above the melting temperature, and (b) the ratio of the 


References: p. 1818. 




1748 


R. W. Cahn and A. L. Greer 


Ch. 19, §3 


elastic mismatch energy to the ordering energy exceeds a critical amount. A comprehen- 
sive test against experimental facts gave widespread though by no means universal 
agreement. 

3. Practical methods of creating metastable phases and microstructures 

In what follows, various techniques are outlined, both those involving solidification 
of a melt and those which operate entirely in the solid domain. In exemplifying what can 
be achieved, emphasis is placed upon amorphization, but most of the methods have of 
course also been used to make metastable crystalline phases. 

3.1. Rapid quenching from the melt 

All casting methods involve a chill zone close to the mould surface; the metal in the 
chill zone cools more rapidly than the interior of the casting, and its grain morphology 
and composition are accordingly different. Such a zone will normally cool at a rate of up 
to some hundreds of K/s. The modern techniques of Rapid Solidification Processing 
(RSP), with which this chapter is principally concerned, however, involve cooling rates 
in the range 10 4 — 10 7 K/s, and require an approach quite distinct from the traditional 
casting methods. 

Atomization of liquid metal to form small droplets — one of the methods widely used 
today — goes back to the last century, and as recounted by Jones [1981] and more 
recently by Cahn [1993] in historical surveys, several investigators in the period 
1925-1955 developed variants of chill-casting which gave estimated cooling rates up to 
10 s K/s, while several studied extension of solid solubilities resulting from such cooling 
rates. However, the remarkable present-day developments in RSP were initiated by 
Duwez in 1959-60. Duwez, in California, set out to establish whether a continuous 
metastable series of solid solutions could be created in the Cu-Ag system, to bring it into 
line with the Cu-Au and Au-Ag systems which both show continuous solid solubility, 
as required according to the familiar Hume-Rothery rules. For this purpose, he argued, 
the melt would have to be frozen fast enough to inhibit nucleation of two distinct fee 
phases. Having unsuccessfully tried a number of other approaches, he designed the 
Duwez gun, a device in which a gaseous shock wave atomizes a drop, ~ 10 mg, of molten 
alloy and projects the microdroplets into contact with a copper substrate, or chill block, 
to produce small foils, or splats. The technique was by some given the onomatopoeic 
nickname splat-quenching-, this term, disapproved of by Duwez, fell out of use together 
with the original gun, as improved methods were developed. 

In 1960, Duwez and his collaborators were successful in making a continuous 
metastable series of Cu-Ag alloys without any two-phase region, and in the same year 
also discovered the first metallic glasses made by rapid quenching, in the Au-Si and 
Au-Ge systems (Duwez etal. [I960]). The essential condition for rapid quenching from 
the melt, as realized in the Duwez gun, was that small particles of liquid metal should 
rapidly be flattened into a thin sheet in intimate contact with a good heat sink. More 
generally — and this was the real importance of the innovation wrought by Duwez — 
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only by starting from the molten metal was it possible to achieve really high rates of 
cooling in the solid. The importance of RSP arises both from the changes brought about 
in the freezing process and from the more rapid cooling of the solid so formed — but it 
should be noted that not all melt-quenching methods in fact create fast cooling in the 
solid stage. The inadequacy of melt-spinning in this respect emerges from some striking 
observations concerning age-hardening kinetics (see §5.1). 

The Duwez gun was used for a decade to investigate the crystallographic and 
calorimetric characteristics of numerous metastable splat-quenched crystalline phases and 
of a few glasses, notably Pd 80 Si 20 which was for a number of years a standard composi- 
tion for those interested in the properties of metallic glasses. Duwez [1967] has 
described these pioneering days, and also the background to his original experiments. 

The splat-quenching principle was developed to provide faster cooling rates still, in 
the form of the rotating-wing quencher, attempted by various early investigators and 
designed into an effective form by Sommer et al. [1987]: here a disc fitted with vertical 
wings rotates very rapidly about a vertical axis, in high vacuum, and a drop of melt is 
allowed to fall into the rotating wings. The instrument has recently been used to quench 
out disordered Ti 3 Al (Braun et al. [1994]), a good test of efficacy in fast cooling. 

The next stage came when the first methods of making continuous rapidly quenched 
ribbons were invented. All these depend on contact between a thin liquid ribbon and a 
moving chill block. The old method of single-roller melt-spinning was re-invented by 
Pond in 1958 and later published (Pond and Maddin [1969]); twin-roller melt-spinning 
is due to Chen and Miller [1970] and was improved by Murty and Adler [1982] but 
is not very efficient and is little used now; melt-extraction was developed by Maringer 
and Mobley [1974]. The fourth major technique, much used in basic research, is the 
drop-smasher, also known as piston-and-anvil quencher, in which an alloy drop is 
levitation-melted inside a conical induction coil, released and quenched between two 
moving copper surfaces to form a disc (Harbur et al. [1969], Beghi et al. [1969] and 
Cahn et al. [1976]). The principles of these four methods are indicated in fig. 13. Of 
these methods, melt-spinning and its variant planar flow-casting (fig. 14), which allows 
wide sheets to be made, approaching a meter in width, are in practice the most import- 
ant; the kinematic and heat-flow aspects of these processes are analysed by Shingu and 
Ishihara [1993]. Their analysis produces an analytic relationship between ribbon 
thickness and substrate velocity: ribbon thickness varies as the - 0.8 power of substrate 
velocity, ranging from 70 pm for 15 m/ s to 20 pm for 50 m/ s. (See also ch. 29, § 5.2). 

Plasma-spraying is a method which in a sense represents a return to Duwez’s 
microdroplet approach: an electric discharge between two electrodes in argon provides 
energy to melt alloy globules which are then projected by an argon jet on to a substrate 
to build up a deposit. The starting material is pre-alloyed powder, 50-100 mm in 
diameter. The hot gas jet needs to be blown aside to ensure rapid cooling of the 
impinging droplets. Gagn£ and Roy [1982] have succeeded in plasma-depositing layers 
of a Ni-Cr-Si-B glass on to a metallic substrate. A full account of the use of this 
technique was published by Safai and Herman [1981], and some of the metastable 
phases made in this way are discussed by Herman and Bhat [1980]. The method can 
also be used to solidify fine droplets in flight for subsequent compaction, though it has 
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(c) (d) 


Fig. 13. Principal methods of rapid quenching from the melt: (a) drop-smasher, using levitation-melting by 
induction; the pistons are pneumatically or electro-magnetically accelerated and come into action when the 
falling drop breaks a light beam; (b) melt-spinning; (c) pendant-drop melt-extraction (there is a distinct version 
with the wheel dipping into a melt surface); (d) twin-roller quenching device. 



Substrate motion 

Fig. 14. Planar flow-casting, schematic. 
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been little used in this mode. A simple flame-heated metal-spraying gun has also been 
used to deposit glassy layers of Fe 40 Ni 40 P 14 B 6 (Shingu [1979]). 

A number of different methods have been used for the production of RSP wires. One 
is free jet melt-spinning of a molten alloy, usually a steel, through a nozzle into a gaseous 
quenching medium (Mottern and Privott [1978]). The problem here is to avoid 
Rayleigh instabilities which tend to break up the free jet: this is best achieved by alloying 
the steel with aluminum, quenching into carbon monoxide and relying on the surface 
oxide layer formed on the freshly formed jet to resist its break-up. Other alloys can be 
correspondingly treated. An alternative approach (Ohnaka and Fukusako [1978], Inoue 
etal. [1982], Ohnaka [1988]) relies on a variant of melt-spinning: the alloy jet impinges 
on the inner surface of a rotating vessel containing water which is held in an annular 
shape by centrifugal force. The method is known as the in-rotating-water splat-processing 
(INROWASP) (fig. 15). This, like the free-jet method, produces wires 100-200 /rm in 
diameter. Rayleigh instabilities are avoided by carefully matching the speed of the liquid 
metal jet and the rotational speed of the water annulus. There is evidence from compara- 
tive relaxation studies (Inoue etal. [1983]) that a glass in the form of wire is solidified 
at about the same rate as the same glass in the form of a melt-spun ribbon several times 
thinner, but (because of continuing close thermal contact) cools faster after solidification 
(an important aspect of rapid solidification that is too often ignored). In contradiction to 
this, a very recent study of the microstructure of Ni 3 Al converted into wire by the in- 
rotating-water method (Chiba and Hanada [1996]) provides some (though not conclus- 
ive) microstructural. evidence that wires made in this way have solidified more slowly 
than melt-spun ribbons of equivalent thickness. — Another method of rapid solidification 
which has recently enjoyed a revival is the Taylor wire method : here a thick wire is 
encapsulated in a glass which is heated and drawn down, together with the molten wire 



Fig. 15. In-rotating-water splat-processing (Inrow aspi, schematic. 1, Rotating drum; 2, liquid coolant; 
3, nozzle; 4, melt jet; 5, induction coil. (After Ohnaka [1985)). 
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within; the change in cross-section combined with heat withdrawal by the encapsulating 
glass provides effective rapid cooling. Apart from the rapid-freezing aspect, this is a 
good method of making ultrafine wires (see review by Donald [1987]). 

Lebo and Grant [1974] published a key paper on production of RSP aluminum alloy 
powder by atomization and also of splats against a cold metal surface, followed by 
consolidation of the powder at a temperature and for a time designed to prevent 
deterioration of the desirable properties induced by RSP. Another key paper, describing 
the compaction and properties of splat-quenched A1 — Fe alloys, was by Thursfield and 
Stowell [1974]. All subsequent research and development of industrial uses of RSP as 
applied to crystalline alloys has been based on this approach. Atomization methods in use 
(reviewed by Suryanarayana [1991]) include ultrasonic atomization centrifugal 
atomization in helium and quenching of droplets between rollers to make flakes 
(Sankaran and Grant [1980]). 

Consolidation is either by warm extrusion or by hot isostatic pressing. Even cold 
explosive compaction can be used (Morris [1980]). An alternative approach which is 
increasingly finding favour is to pulverize a brittle crystalline melt-spun ribbon. A small- 
scale variant, capable of producing extremely fine, submicron powders, is electrostatic 
extraction of particles from a liquid-alloy tip drawn out into a point by a strong electric 
field (Perel etal. [1981]). This apparatus has been used for a detailed metallographic 
study of the microstructures of RSP aluminum alloy globules (Levi and Mehrabian 
[1982]; the globules were fine enough for TEM examination without further thinning). 

Yet another method which is used from time to time is spark-erosion, using a 
standard workshop machine for the purpose. Electrodes and a vibrating charge of blocks, 
all of the same alloy, are spark-eroded in a dielectric, which can be a hydrocarbon, water 
or liquid argon. Small drops of the alloy are heated to an extreme degree and turn into 
a plasma which is then quenched by the surrounding dielectric fluid. The technique is 
described by Berkowitz and Walter [1987], and Cahn etal. [1988] have exemplified 
the drawbacks arising from carburisation or oxidation of the alloy by the dielectric fluid; 
this is why liquid argon is to be preferred (Bucklow and Drain [1964]). 

All the foregoing methods have been systematically reviewed and compared in a 
major overview by Suryanarayana [1991]; he also includes recent, advanced 
atomization methods, some employing lasers, which are now used to make tonnage 
quantities of atomised alloy powders. The historical background to the early methods of 
rapid solidification is reviewed by Cahn [1993]. 

3.1.1. Cooling rates in rapid solidification processing 

The rate of cooling of the melt before, and of the solid after solidification can be 
accessed by direct measurement. Alternatively, the cooling rate at the freezing stage can 
only be deduced by measurement of a microstructural feature of the resulting crystalline 
alloy, most commonly the secondary dendrite arm spacing (ch. 8, §7.1) or, where 
apposite, eutectic lamellar spacing; but such a microstructural approach is not available 
when the product is a glass. 

Dendrite arm spacings are determined by both the temperature gradient in the 
solidifying melt, G L , and the velocity of the melt/ solid interface, V. The cooling rate, 
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dT/dt,-G L .V. The spacing of primary dendrite arms is not suitable for cooling rate 
measurement because its relationship with G L and with V follows distinct power laws, 
whereas for secondary dendrite arms, both variables are related to the arm spacing, /, by 
an inverse 1/3 power law. This is demonstrated in fig. 16, (Jones [1982]), based on a 
range of measurements for Al-Cu and Al-Si alloys. For the fastest cooling rates, a 
microthermocouple projecting into the melt in a piston-and-anvil quenching unit is used 
(Armstrong and Jones [1979]). The dendrite method is not quite so reliable for nickel- 
and iron-based alloys; the eutectic-lamella approach, used for melt-spun aluminum alloy 
ribbons and relying on extrapolation from slower, measured cooling rates, gave an 
estimate of 3 x10 s K/s in the neighborhood of the freezing temperature (Chatto- 
padhyay and Ramachandrarao [1980]). 

The limitations of such indirect methods are: (1) they are not applicable to glass 
formation; (2) they can give an estimate only of the effective cooling rate (related to the 
solidification time and specimen thickness) in the temperature range in which solidifica- 
tion takes place, and not of the subsequent cooling rate in the solid state; (3) each family 
of alloys requires separate calibration; (4) the dendrites may coarsen after solidification 
and so yield misleading results. For all these reasons, direct measurement is now 
regarded as preferable, difficult though it is. 

The early direct measurements of liquid-quenching rates which have been published 
have referred to the piston-and-anvil technique. Thus, Harbur etal. [1969] made use of 
the metals (suitably insulated) of two dissimilar pistons as the legs of an intrinsic 
thermocouple, whereas Duflos and Cantor [1982, 1987] used a minute intrinsic 
thermocouple, electrically shorted by the melt itself. Pyrometry of the droplet has also 
been employed (e.g., Kattamis et al. [1973]). Melt-spinning poses a more difficult 
measurement problem altogether: Warrington etal. [1982] used calibrated black-and- 



Fig. 16. Dendrite spacing, A, as a function of cooling rate, e, near the melting temperature, for Al-4 to 5 wt% 
Cu (open circles) and Al-7 to 11 wt% Si (solid circles). The plot refers to secondary dendrite arms. The line 
shown represents le l/3 = const. = 50 mm (K/s) 1 ' 3 . (After Jones [1982]). 
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white photography, while later techniques have focused on the use of calibrated 
photometry of color photographs of the surface being investigated (Gillen and Cantor 
[1985], Bewlay and Cantor [1986]) and the use of infrared imaging devices, which 
last have proved particularly useful for spray-deposition (Grant et al. [1989], Grant 
and Cantor [1990], Bewlay and Cantor [1990]). 

Figure 17 shows some results by Duflos and Cantor for a drop-smasher, while fig. 18 
shows photometric measurements relating to melt-spinning; the sharp decrease in cooling 
rate when the ribbon loses contact with the copper wheel should be noted. It is quite 
clear that the cooling rate varies substantially between the freezing point and ambient 
temperature. Thus at 1500°C, near the freezing-point of a steel, the cooling rate for 
piston-quenched steel or iron samples 25-50 /zm in half-thickness (the proper measure) 
is 10 6 -10 7 K/s; the higher values apply to the very rapid-acting piston-and-anvil 
apparatus designed by Cahn et al. [1976]. The cooling rate at 1000°C is = 10 times 
slower than at 1500°C, and at 500°C, 20-30 times slower again. It is evident that the 



Fig. 17. Data from many cooling curves obtained by embedded microthermocouple and oscilloscope traces 
during piston-and-anvil drop-smashing of iron, replotted as cooling rate, T , versus temperature, T. The voltages 
cited are electromagnetic accelerating voltages; higher voltages imply faster impact. The weights refer to 
specimen sizes; larger specimens, heated for longer periods, had greater superheat. (After Duflos and Cantor 
[1982]). 
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Time (milliseconds) 

Fig. 18. Temperature measurements during melt-spinning of Ni-5wt% Al, by color-calibrated photometry. 
(Based on Gillen and Cantor [1985]). 


custom of characterizing the cooling rate by a single figure is unacceptable, unless the 
temperature range to which such a figure applies is specified. Single-roller melt-spinning 
turns out to be slower than piston-and-anvil quenching (Hayzelden [1983]): thus, for a 
40 mm-thick steel ribbon, the measured cooling rate is 5x 10 5 K/s at 1500°C and only 
10 4 K/s at 1000°C (because the ribbon has by then left the wheel). It is generally 
accepted that dual-roller quenching is even slower than single-roller melt-spinning 
overall, in spite of the presence of double chill block, because the duration of contact 
between the ribbon and the chill block is so brief. However, the initial cooling rate 
appears to be very high (Davies et al. [1978]). In the case of rapid quenching by 
atomization, cooling rates have been estimated from dendrite spacings: thus, for a gas- 
atomized superalloy, cooling rates ranging from 10 3 -10 6 K/s have been estimated as 
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particle diameters range from 400 to 10 /xm (Duflos and Stohr [1982]). The preferred 
approach at present, however, is by means of heat-transfer calculations, as exemplified 
for conventional casting in ch. 8., § 2. The crucial parameter here is the Nusselt (or Biot) 
number, N, of both specimen and “chill block” (the latter, in the case of a droplet 
solidifying in flight, is gaseous). N-hd/k, where h is the heat-transfer coefficient, d, is 
the effective specimen or chill-block thickness and k is the thermal conductivity. As h 
increases, cooling changes from Newtonian, via intermediate to “ideal” cooling, with 
progressively larger temperature gradients in the specimen, h can be estimated from 
measured cooling rates in the case of piston-and-anvil quenching (Cantor [1982]), but 
for powder atomization it is necessary to compute h from first principles for specified 
boundary conditions ( h is very low for pure radiative transfer and several orders of 
magnitude larger for favorable convective cooling). The theory has become extremely 
complex: full treatments have been published by Levi and Mehrabian [1982], Mehra- 
bian [1982] and Clyne [1984]; reference should be made to these papers. 

The cooling rate varies of course as a function of droplet diameter and depth within 
the droplet, and is perturbed by the release of latent heat in a way which depends on the 
undercooling that preceded the start of freezing. A particularly important regime is with 
heat-transfer rapid enough to permit hypercooling, that is, suppression of nucleation till 
the sample has undercooled so far that the release of the entire latent heat is insufficient 
to reheat the sample to the equilibrium freezing point: freezing can then go to completion 
without any further heat extraction. The analysis by Clyne [1984] concludes that there 
are two distinct approaches to generating very rapid crystallization from the melt: one can 
either hypercool, via very small droplets, or else accept a lesser degree of undercooling and 
arrest the subsequent recalescence by means of highly efficient heat extraction. 

In the theory of atomization, it turns out to be more useful to compute solid/liquid 
interface velocities rather than cooling rates, since the former relate more closely to 
interface stability. Mehrabian cites values for 50 /*m aluminum droplets of 4 m/ s at the 
start of freezing, dropping to 0.04-0.13 m/s in the intermediate stages of freezing. 

3.2. Solidification of highly undercooled liquids 

When a liquid alloy is cooled, solidification normally starts at a small undercooling 
below the liquidus temperature, triggered by nucleation on heterogeneities. However, 
liquid alloys are typically rather resistant to homogeneous nucleation. The most thorough 
kinetic analysis has been by Turnbull [1952] of liquid mercury in which homogeneous 
nucleation appears to occur only at undercoolings as high as 35 to 40% of the absolute 
melting temperature. Thus there is a possibility of achieving large undercoolings if 
heterogeneous nucleation can be suppressed or at least inhibited. This is achieved by 
having a clean melt and ensuring that it does not make contact with a solid container; 
various methods are illustrated in fig. 19. 

In the emulsion method (exploited by Turnbull [1952] and more recently by 
Perepezko [1984]) the liquid alloy is dispersed in an inert carrier liquid. When the 
number of alloy droplets exceeds the number of potent heterogeneous nucleants originally 
in the alloy, some of the droplets must exhibit substantial undercooling. However, for the 
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Fig. 19. A schematic illustration of the most common methods for inhibiting heterogeneous crystal nucleation 
and thereby obtaining highly undercooled liquid alloys: (a) droplet emulsion, (b) substrate, (c) entrained droplet 
(mushy state quenching), (d) fluxing, and (e) drop-tube or levitation. (After Perepezko [1984]). 


emulsion to be stable the droplets must be coated and this coating may itself act as a 
nucleant. The coating problem is not present in the fluxing technique in which a single 
alloy drop is surrounded by viscous flux or slag, which, in addition to isolating the drop 
from contact with solid container, may have a cleaning action (Kui, Greer and Turn- 
bull [1984]). Truly containerless solidification can be achieved for droplets in free-fall 
in evacuated drop-tubes (Greer [1994]) and levitated, for example electromagnetically 
(Schleip et al. [1988]). A few examples of maximum undercoolings which have been 
achieved for various metals are shown in table 2. 

When a melt is highly undercooled, solidification when it does occur is rapid and 
shows all of the microstructural effects outlined in § 1.1. Crystal nucleation can even be 
avoided completely, yielding a glass (Perepezko and Smith [1981]). The techniques for 
obtaining undercooled melts are of interest both for production of metastable states and 
for fundamental studies. The emulsion and fluxing methods, for example, can readily be 
used to measure not only undercooling but’ also the specific heat of the undercooled 
liquid, an important thermodynamic parameter. Of particular interest is that electromag- 
netic levitation of a droplet can be used for quantitative studies of solidification velocities 
at selected undercoolings. Studies of undercooled alloy melts have been comprehensively 
reviewed by HerlaCH etal. [1993], Among the phenomena investigated are grain refinement 
(Greer [1991]) solute trapping and the nucleation and growth of metastable phases. 

These topics are discussed in some detail in several sections of ch. 8, § 4.2. 
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Table 2 

Some examples of maximum undercoolings achieved for pure metals, using various techniques. Much more 
extensive surveys will be found in Kelton [1991], Herlach etal. [1993] and Greer [1994], from which 

these data are taken. 


Metal 

Maximum 
undercooling 
Ar_ (K) 

Melting point 

r„(K) 

Normalized 

undercooling 

A r_/r m 

Experimental 

method 

Ni 

480 

1728 

0.28 

electromagnetic 

levitation 

Fe 

538 

1809 

0.30 

electromagnetic 

levitation 

Hg 

80 

234 

0.34 

emulsion 

Ga 

174 

303 

0.57 

emulsion 

Sn 

191 

505 

0.38 

emulsion 

Re 

975 

3453 

0.28 

drop- tube 

Hf 

530 

2500 

0.21 

drop-tube 


3.3. Deposition by evaporation or sputtering 

The rapid-quenching methods described above have been used to make not only 
crystalline alloys but also metallic glasses. Several glass-making methods which do not 
depend on melt-quenching will be outlined here. These methods may collectively be 
termed pseudo-RSP. 

One group of such methods depends on the deposition of vapor on cold substrates 
(often held at liquid-nitrogen or even liquid-helium temperature). These methods are in 
fact of older ancestry than true RSP methods and were used to make various amorphous 
and supersaturated crystalline materials in the 1930s (Kramer [1934, 1937]) and 1950s 
(Buckel and Hilsch [1956]), generally in the form of thin films. (For more details, see 
Cahn’s 1993 review). Sometimes thermal evaporation is used to produce the vapor 
(Mader [1965] and Sinha etal. [1976]), but cathodic sputtering is more usual; if a 
powerful high-rate sputtering apparatus is used, glass samples several /am thick can be 
made, though at very high cost (see review by Dahlgren [1978]). These techniques are 
now waning, in spite of the fact that more alloy compositions and wider composition 
ranges can be made glassy by vapor methods than by liquid-quenching (Turnbull 
[1981]). Perhaps a reduction of interest in these methods is a recognition that in 
metallurgy, as opposed to microcircuitry, thin films have only a modest role. 

3.4. Amorphization by irradiation 

A further approach is heavy irradiation (to a dose of at least one displacement per 
atom) of a crystalline alloy (generally an ordered intermetallic compound), using either 
neutrons or ions (Rechtin et al. [1978], Elliott and Koss [1981]). As judged by 
calorimetric tests, such “artificial” glasses are structurally very similar to the same alloys 
vitrified by RSP. A related “solid” approach to making a glass is by implantation into a 
pure metal of ions of a different element, most usually metalloid ions. Phosphorus has 
been most effectively used. The bombarding ion both destroys crystalline order and, by 
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occupying strategic locations in the glass structure, stabilizes it. Self-ion bombardment 
(e.g., Ni + into nickel) never generates a stable glassy structure in a metal (although 
apparently it can do so in silicon). The technique has been reviewed by Borders [1979], 
and there is later research by Rauschenbach and Hohmuth [1982], and by Russell 
[1985]. Very recently, a complete and up-to-date review by Howe [1995] has been 
published. Here there is space only for a thumbnail sketch: — A striking group of 
observations is on electron-induced amorphization. Mori and Fujita [1982] have 
established the temperature dependence of the critical dose for amorphizing NiTi with 2 
MeV electrons, while Mori et al. [1983] have observed the surprising phenomenon of 
initial amorphization preferentially at dislocations, in the same alloy. The theoretical 
background, including a discussion of critical irradiation doses, is presented by Carter 
and Grant [1982]. It is important to recognize that glassy layers made in this way are 
less than one micron thick. It is now well established that the intermetallic compound 
must be disordered by the irradiation below a critical level of long-range order; it then 
becomes unstable towards amorphization (Luzzi etal. [1984, 1985, 1986] ). Mori etal. 
[1984] nevertheless incline to an alternative (or additional?) criterion, requiring a large 
critical vacancy concentration. Cahn and Johnson [1986] have critically discussed the 
relative merits of these models. A more recent discussion of these rival models is by 
Motta and Lemaignan [1992]: they assemble evidence that a critical degree of 
destruction of long-range order is generally a necessary but not a sufficient criterion and 
that a critical concentration of one type of point defect (either vacancies or interstitials) 
is also necessary, and they answer earlier criticisms suggesting that the required point 
defect concentrations are not in practice achievable. 

Mendoza-Zelis et al. [1982] describe the amorphization of nickel films by phos- 
phorus ion implantation. Electrical resistivity indicates that the glass thus prepared is 
structurally identical to glasses of the same compositions made by evaporation or 
electrodeposition. (The last two approaches to glass-making, together with the related 
technique of electroless plating, with a partial but extensive list of compositions prepared 
in these ways, are outlined by Dietz [1977].) Ion implantation is also used to amorphize 
crystalline silicon; some of the consequential research is outlined in the next section. A 
technique closely related to ion implantation is ion-beam mixing, in which two discrete 
layers at a surface are mixed on an atomic scale by bombardment with “neutral” ions 
such as A + or Xe + . Thus, Au-Si glasses have been made by ion-mixing of a gold-coated 
silicon substrate; this same glassy alloy has been made by RSP of the melt, evaporation 
on to a cold substrate, laser-treatment and ion-mixing. Ion-beam mixing is reviewed by 
Follstaedt [1991]. 

The methods mentioned in the last two paragraphs effectively involve treatments at 
or close to metallic surfaces, which is the subject of the following section. 

3.5. Rapid solidification processing of surfaces 

A benefit of the various rapid solidification methods that operate at surfaces is that 
only a very thin layer of the solid needs to be melted and the underlying solid then can 
act as a built-in heat sink. The usual instrument employed for this purpose is a laser, 
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either continuous-acting (C0 2 type), used in scanning mode, or pulsed (ruby type). The 
most common way of rapid quenching at surfaces involves no local change of composi- 
tion, but it is possible to coat the surface with a thin layer of another substance and 
metastably alloy it into the melted surface; in this way, expensive substances need only 
be used in small quantities. 

RSP at surfaces has in practice been used for two very different purposes: (1) There 
is a large body of research (which can only be briefly exemplified here) on the laser- 
treatment of semiconductor surfaces, silicon in particular, and (2) laser treatment has 
been used on a large scale for the hardening of surfaces of steel and some other 
industrial alloys. 

Figure 20 shows the variables involved in the treatment of a surface by a continuous 
wave laser. Heat-flow calculations, for adequately large heated spots, can be simplified 
by neglecting sideways heat flux and treating the problem in terms of one-dimensional 
heat flow: calculations of this kind have been published by Mehrabian [1982]. He 
assumes that the heat which can be usefully absorbed is limited by the onset of vaporiz- 
ation; on this basis, increasing the laser power (which entails faster scanning over the surface) 
yields a shallower melt, faster refreezing and a steeper thermal gradient. Typical figures, for 
aluminum, are 650 /rm or 6.5 fun melt depth for 5 x 10 8 or 10 10 W/m 2 heat flux, respectively. 
The estimated solid-liquid interface velocities are 0.035 and 3.5 m/s, respectively. Kear et 
al. [1981] estimated that, again taking vaporization as a limiting factor, quench rates up to 10 6 
K/s can be achieved for a melted layer ~ 10 fim thick. More recently, Lin and Spaepen 
[1984] showed that a pulsed laser can provide cooling rates as high as 10 14 K/s. 


laser 

beam 



Fig. 20. Schematic illustration of laser beam-substrate geometry during rapid surface melting and refreezing by 
laser treatment. (After Mehrabian [1982]). 
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The use of Q-switched ruby lasers so designed as to give extremely short pulses in 
the nanosecond (or even picosecond) range has been highly fruitful in research on 
silicon. A good example of research on metastable structures in silicon is the body of 
work on the melting of amorphous silicon. A layer of amorphous silicon can be created 
at the surface of a silicon crystal slice by ion-implanting either silicon itself or another 
species such as indium. It was first suggested in 1978, independently by Spaepen and 
Turnbull and by Bagley and Chen, that such an a-silicon layer might undergo a first- 
order melting transition at a temperature several hundred degrees lower than the melting 
temperature of crystalline silicon (although a-silicon cannot have an exact thermodynami- 
cally defined melting temperature). The melting of a covalently bonded amorphous solid 
to give a metallically bonded liquid here is bound to be discontinuous. This was duly 
confirmed by laser-heating in situ of the a-silicon layer, with a = 3 ns pulse: what is done 
here is to make transient measurements of properties, in particular, electrical conductance 
and optical reflectance, which vary with the state of the silicon, with a time resolution of 
a few nanoseconds. Indeed, the investigators found that the molten layer can then 
explosively crystallize: the release of latent heat makes the process self-sustaining. Fig. 
21, from a review by Poate et al. [1987], shows transient conductance and reflectance 
measurements on such a specimen, and also indicates the evolution with time of the 
molten and subsequent crystallized layer at the expense of a-silicon. (Because of the 
large latent heat released, crystallization under these circumstances is “explosive”). The 
transient measurement approach can even be used to assess whether the crystallizing 
layer is nucleated at the surface or in the interior of the melt (Peercy et al. [1990]); the 
conclusion was that nucleation is internal. Figure 22 shows the time-dependent depth of 



Fig. 21. Transient conductance measurements of molten layer thickness and simultaneous surface reflectance 
for a 0.20 J/cm 2 pulse incident on a 320 nm thick a-Si film. The schematic diagrams at the right illustrate 
various stages for the (“explosive”) crystallization process. (After Poate et al. [1987]). 
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Fig. 22. Molten layer thickness as a function of elapsed time for irradiation of 100 KeV-implanted silicon. 
(After Peercy etal. [1990]). 


the molten layer deduced from reflectance measurements (because reflectance changes as 
melt thickness changes) for different laser intensities. It can be seen that the dimensions 
involved are very different from those calculated for typical experiments with metals, as 
indicated in the preceding paragraph. 

At the other extreme, laser treatment of steels, titanium and aluminum alloys has been 
widely practised, with and without powder injection into the laser-melted surface pool to 
change local composition. The field has recently been reviewed by Mordike [1986, 
1991] and by Schubert and Bergmann [1993], Thus, aluminum-silicon alloys can be 
effectively “modified” to refine the scale of the eutectic and enhance mechanical 
properties thereby, without the use of the usual modifying dopants. Similar effects can 
be achieved with cast iron. The nature of nucleation of crystalline phases in a surface 
melt-pool created by laser irradiation can be deduced by examination of the resultant 
microstructure, and the conclusion was drawn that nucleation at the free surface, at the 
melt/ substrate interface, and homogeneously within the melt, can all be generated by 
varying the conditions of irradiation. For further details, Mordike’s review should be 
consulted. 

A further approach is to make layered alloys and use laser pulses to convert these 
into a homogeneous metallic glass. Thus, Lin and Spaepen [1982] made an iron-boron 
glass with as little as 5 at% B (much less than is required when less extreme methods are 
used) by picosecond laser treatment of a finely layered Fe/Fe 3 B sandwich. 

3.6. Electrochemical, electroless and sonochemical deposition of amorphous phases 

It was discovered long ago by Brenner at the NBS that Ni-P alloys containing more 
than ~ 10 at% P can be made by electrodeposition, to give an ultrahard surface coating. 
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This industrially useful method has been steadily improved over the years (Brenner 
[1963]). The most recent development has been an examination of the influence of 
pulsed as opposed to steady current (Lashmore and Weinroth [1982]), and it appears 
that Ni-P glass deposits made under these two regimes are structurally distinct (Lash- 
more etal. [1982]). Little has been published about other compositions that can be made 
in amorphous form by electrodeposition, but there is evidence that some research has 
been done under conditions of industrial secrecy: apparently Co-W-B is another alloy 
family that can be made amorphous in this manner. Also, bright amorphous chromium 
(presumably stabilised by solutes) can be electrodeposited from chromic acid solution 
with additives (Hoshino etal. [1986]; Tsai and Wu [1990]). 

Electroless plating has already been mentioned in passing in § 3.4, in connection with 
Dietz’s 1977 review. Much research has been done on this technique, which is based on 
electrochemical principles though no external potential difference is applied. It has very 
largely been applied to the deposition of Ni-P coatings applied from solutions containing 
hypophosphite ions, which can be amorphous or nanocrystalline and are very hard. This 
extensive research, much of it concerned with the control of deposition rates, is reviewed 
by Roos etal. [1991], 

A recently discovered, distinctly exotic way of making an amorphous metal is via 
sonochemistry. This term relates to the technique of injecting energy into a chemically 
reacting system by means of ultrasound. The key investigators here have been the 
chemist K. Suslick and his group. Ultrasound is conveyed to a liquid mixture of 
chemicals in the simple apparatus shown in fig. 23. Suslick has explained how high 
temperatures and consequential ultrarapid cooling arise in such an apparatus (Suslick 
[1993], [1995]). 

Vapor-filled bubbles are formed in the liquid through resonant absorption of sound 
energy and oscillate in size at, typically, 20 kHz. When a bubble has grown so large that 
it is no longer resonantly coupled to the ultrasound field, it collapses at extreme speed 
(“cavitation”) under the influence of surface tension and in the process, the vapor is 
heated adiabatically to a high temperature; immediately after this stage, the surrounding 
liquid cools the vapor as fast as it was heated. The temperatures reached and the time- 
scale involved have been measured, among other techniques, by means of sono- 
luminescence, the emission of light flashes during bubble collapse, and temperatures near 
5000 K have been estimated (Flint and Suslick [1991]). (The very existence of 
sonoluminescence has been doubted until, recently, Barber and Putterman [1992] 
measured the time-dependent radii of sonoluminescent bubbles at the same time as the 
sonoluminescent output. Maddox [1993] has discussed the implications of these striking 
observations). The duration of the high temperature is estimated to be of the order of 100 
ns and the cooling rate thereafter is around 10 10 K/s, much faster than any of the other 
techniques discussed hitherto. “Nearly pure” (>96% Fe) amorphous iron powder, of 1-2 
nm particle size, has been made by the sonochemical decomposition of iron penta- 
carbonyl, a liquid (SuXlick etal. [1991]). The powder crystallizes at the surprisingly low 
temperature of = 308°C. This approach does not appeal' to have been applied yet to nickel 
carbonyl. 
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Fig. 23. A typical sonochemical apparatus with directly immersed ultrasonic horn. (After Suslick [1993]). 

3.7. Solid-state amorphization reactions (SSAR) 

Since the amorphous phase in alloys can be a well defined metastable state, it can be 
produced by reaction, enabled by thermal annealing. The initial “energized” state (see 
§ 1.3) is simply an aggregate of unreacted elements. The first metallurgical demonstration 
of a solid-state amorphization reaction was by Schwarz and Johnson [1983], who 
showed that thin polycrystalline gold and lanthanum films in a multilayered stack would 
react at 50-100°C to give an amorphous alloy. (This research followed a slightly earlier 
discovery of a related phenomenon: the compound Zr 3 Rh, in a metastable crystalline 
form, could be amorphized by reacting it with hydrogen to form a metastable amorphous 
hydride (Yeh et al. [1983]; this finding has been broadened more recently by Aoki etal. 
[1991], who discovered hydrogen-induced amorphization in numerous binary 
intermetallic compounds in which one constituent is a hydride former). 

A significant number of elemental pairs have since been found to undergo similar 
reactions similar to that between gold and lanthanum; these include Ni-Zr, Cu-Zr, Fe-Zr, 
Ni-Ti, Ni-Nb, Cu-Ti among others, as reviewed by Johnson [ 1 986] and also in the first 
conference devoted to this phenomenon (Schwarz and Johnson [1988]). The amorphi- 
zation reaction is associated with two characteristics of the elemental pairs, one thermo- 
dynamic and one kinetic: a strongly negative heat of mixing in the amorphous state, and 
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a marked asymmetry in diffusivity, that is, one element diffusing much faster than the 
other. Since diffusional asymmetry is most likely associated with a size difference, it can 
be seen that the characteristics needed for the amorphization reaction have much in 
common with those favoring glass formation generally (§2.3). The negative heat of 
mixing provides the driving force for the reaction, but it is associated with a tendency for 
the formation of highly cohesive intermetallic compounds. However, at the low annealing 
temperature of the reaction (~ 250°C, for example, in the case of Ni-Zr) these com- 
pounds have difficulty in nucleating, and then the metastable extrapolated liquidus curves 
shown in bold in fig. 24 become relevant. The amorphization reaction is then interpret- 
able as eutectic melting occurring below T % and therefore yielding an amorphous solid 
rather than a liquid. In the case of Ni-Zr, nickel is a fast interstitial diffuser in crystalline 
zirconium, whereas zirconium is a slow substitutional diffuser in crystalline nickel. This 
difference in behavior, in which the Ni can diffuse several orders of magnitude faster 
than the Zr at the same temperature (Greer etal. [1990]), applies also in the amorphous 
phase. Diffusional asymmetry of this kind appears to apply in the other systems showing 
the solid-state amorphization reaction, the high mobility of one species enabling the 
reaction to occur, the low mobility of the other species inhibiting nucleation of the 
equilibrium intermetallic compounds. 

Systematic experiments on Ni-Zr have established that the amorphous phase is 
nucleated at grain boundaries in the polycrystalline zirconium films (SchrOder et al. 
[1985] and Vredenburg etal. [1986] have confirmed this by a beautiful experiment in 
which they demonstrated the impossibility of launching an amorphization reaction in a 
bilayer consisting of polycrystalline nickel deposited on single-crystal zirconium. 

It is mostly found that the amorphization reaction proceeds on a planar front at a 
diffusion-controlled rate which diminishes as the amorphous layer thickens. Beyond a 
critical thickness of reacted layer, that is, below a critical front velocity, a crystalline 



Ni Zr 

Zr (al. %) 

Fig. 24. Ni-Zr equilibrium diagram, a system exhibiting solid-state amorphization. The bold lines show the 
terminal liquidus curves and their metastable extrapolations. (After Highmore and Greer [1989]). 
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compound appears. Its appearance has been quantitatively interpreted by Highmore 
[1990] in terms of transient nucleation theory. The intermetallic nuclei thus formed can 
grow at the expense of the previously formed amorphous layer (Meng et al. [1988]). 
Thus, if a fully amorphous product is to be obtained by the reaction, initial thicknesses 
of the elements in a multilayer stack must be small, typically below 100 nm. Multilayers 
with such thin layers can be made by thin-film deposition, but then the final product is 
likely restricted to a thin-film form, and this could be obtained more readily by initial co- 
deposition of the elements. SSAR in multilayers has been illuminatingly examined by 
Jankowski etal. [1995]. 

The more general issue of the relation between “nucleation of disorder” in SSAR, 
melting and the destruction of atomic long-range order in a solid solution has been 
discussed by Cahn and Johnson [1986] and brought up to date in a stimulating 
thermodynamic approach by Wolf etal. [1990], 

3.8. Amorphization by mechanical processing 

When the diffusion-induced amorphization reaction was first discovered, it was soon 
recognized that deposition by evaporation was not the only way of making the initial 
multilayers: it was also possible to roll or wire-draw composites of foils or wires of the 
constituent metals (to reduce the scale of the microstructure and to weld the constituents) 
and then to anneal the product. In fact, the production of ultrafine filamentary composites 
by swaging, wire-drawing or rolling of mixtures of two metals, notably for use as 
superconductors, was pioneered in the 1970s long before the SSAR had been discovered, 
and reviewed by Bevk [1983]. Soon after the discovery of SSAR, Atzmon etal. [1985] 
rolled stacks of alternating 20-200 mm foils of Ni-Zr, Ni-Er and Cu-Er to very heavy 
reductions and then annealed them: partial conversion to amorphous phase was observed. 
Schultz [1987] worked with similar Ni-Zr stacks: his amorphous phase had a composi- 
tion different from that of the average of the stack, implying that the zirconium had 
become exhausted before the nickel. It appears that Schultz may have been the first to 
observe this kind of mechanically aided amorphization (Schultz [1989]). 

Bordeaux and Yavari conducted an extensive programme of experiments on cold- 
rolled multilayer stacks of systems such as Ni-Zr, Ni-Nb, Cu-Zr, Cu-Ti and Fe-Ti, in 
the late 1980s (see review by Bordeaux and Yavari [1990a]). X-ray diffraction 
established that a measure of amorphization took place during the rolling process itself, 
but annealing was required to complete it. They were able to amorphize Al-Pt stacks 
completely during heavy rolling (Bordeaux and Yavari [1990b]); this kind of process 
is termed mechanically induced amorphization, to distinguish it from mechanically aided 
amorphization They made careful kinetic measurements of amorphization rates, using 
XRD and DSC, and were able to show (in spite of difficulties occasioned by the fact that 
the as-rolled layer thicknesses had a broad distribution) that amorphization was much 
faster than in an interdiffusion experiment at the same temperature, in the absence of 
cold work. The comparison was also complicated by the fact that deformation causes 
local heating at the foil interfaces, which may transiently approach +100 K as measured 
by thin-foil thermocouples (Martelli etal. [1990]). From their kinetic studies of the rate 
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of thickness increase of the amorphous layers, Bordeaux and Yavari deduced that the 
activation energies for nickel diffusion through the layers were much lower than those 
previously measured for undeformed multilayer stacks. This suggested that “holes” — 
localised low-density regions in a metallic glass generated by deformation — had 
accelerated diffusion (see ch. 7, § 9.1.2). It is possible that the extreme defect concentra- 
tions generated by very heavy deformation altered the free energy balance between the 
various phases sufficiently to accelerate the kinetics significantly. 

In parallel to these studies, ball mills were introduced as an alternative way of 
imparting heavy deformation and thus bringing about amorphization without the need for 
annealing. This was originated by Yermakov [1981] in Russia, and soon after taken up 
by Schwarz and Koch [1983] in America, who prepared Ni-Ti and Ni-Nb amorphous 
phases by ball-milling. In this process, grinding balls, usually of specially hardened steel, 
are combined with a small amount of powder, either mixed metallic elements or powder 
of a prealloyed intermetallic compound, in a container which can be rotated or vibrated 
or subjected to a combined form of motion. The end-results are found to be sensitive to 
the particular form of grinding motion employed. Both mixed elements and prealloyed 
powders often behave in the same way with respect to amorphization. Details of the 
extensive body of research on this process of mechanically induced amorphization which 
has been accumulated within a few years can be found in a book chapter by Koch 
[1991]. The accumulated evidence suggests that mechanically aided amorphization and 
mechanically induced amorphization are variants of a single process. 

The criteria which govern what systems can be amorphized in this way are discussed 
at the end of § 2.3. 

3.9. Pressure effects 

In considering phase transformations, we have mostly been concerned with changes 
in temperature. Yet pressure is an equally important thermodynamic variable. As we have 
seen, metastable phases or microstructures can be achieved as a result of rapid tempera- 
ture changes. Of particular interest, then, is that pressure changes (because they do not 
involve a conductive process) can, especially for samples of large cross-section, be much 
more rapid than temperature changes. There are several ways in which pressure and 
pressure changes can be used to attain large departures from thermodynamic equilibrium. 

When pressure is varied, new phases can appear and pressure is then directly of 
interest in providing access to a wider range of structures. Some of these structures can, 
in fact, be obtained without application of pressure. For example, a metastable phase of 
pure bismuth obtained at high undercooling in a droplet emulsion experiment is a high- 
pressure phase; thermodynamically it can form because the liquid is cooled below its 
(metastable) melting point, and it must be kinetically preferred to the stable phase 
(Perepezko and Anderson [1980]). In most cases, however, application of pressure is 
necessary to cause a transformation (usually in the solid state) to a high-pressure phase. 
The question is whether this phase will be retained when the pressure is again reduced 
to ambient. Temperature is commonly used to effect retention under ambient conditions. 
The general procedure is as follows: Pressure is applied, the temperature is raised to 
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permit the transformation to the high-pressure phase to take place, the temperature is 
reduced to lower the atomic mobility, and the pressure is then reduced with the atomic 
mobility sufficiently low to prevent decomposition of the high-pressure phase. In this 
way metastable high-pressure phases are obtained. The classic example of such a phase 
is diamond. Research in this area has concentrated on compounds, such as InSb, GeP, 
NbjGe, NbjTe. Before the advent of oxide-based high critical temperature 
superconductors, high-pressure phases such as Nb 3 Ge were of considerable interest for 
their superconducting properties. Jayaraman [1980] has reviewed metastable phase 
synthesis by this route. The metastable compounds produced can in some cases decay 
into amorphous solids (Battezzati [1990]). 

A special case of a transformation induced by pressure is melting. This of course 
occurs only in those systems which show a contraction in volume on melting. Of these, 
by far the most widely studied is ice. Ice held at 77 K and subjected to a pressure of 10 
kbar undergoes amorphization (Whalley [1988]). Thermodynamically this result is 
straightforward: the high pressure takes ice to its melting point despite the low tempera- 
ture. Although there is some superheating, the ice does melt, albeit to an amorphous 
phase rather than liquid water, because 77 K is below T g for water. A similar technique 
could find application in some semimetal systems showing similar volume changes on 
melting. 

Just the converse effect has been exploited in a wide range of alloys showing the 
more usual behavior of a volume expansion on melting. For these materials application 
of pressure increases the melting temperature. Thus rapid pressure application to the 
liquid alloy can induce rapid solidification. After the solidification is complete, the 
temperature is reduced to ambient, not necessarily rapidly. The crystalline products can 
show all of the features of materials rapidly solidified by temperature reduction, such as 
refined microstructure, extended solid solubility and metastable crystalline phases. An 
example of work in this area is that by Sekhar and Rajasekharan [1986], and the field 
has been comprehensively reviewed by Reddy and Sekhar [1989]. 

The rapid pressure application technique does not lend itself to forming amorphous 
phases. The application of pressure increases the thermodynamic driving force for 
crystallization of the liquid without as large a reduction in atomic mobility as would be 
brought about by the corresponding temperature change. Nevertheless, application of high 
pressure should decrease atomic mobility and this has been successfully exploited. For 
example, Cu-Sn alloys held under pressure can be heated by electric resistance, and then 
quenched by turning off the electrical current. When this is done, Cu-(12-17 at.% Sn) 
glassy phases can be made, which is not possible under ambient atmospheric pressure 
(Battezzati [1990]). Ponyatovsky and Barkalov [1991] have developed this process, 
which they call thermobaric quenching, and have made glasses in the Cd-Sb, Zn-Sb and 
Al-Ge systems. 

Apart from the pressure effects outlined above, there is a further series of recent 
observations referring to silicon. Silicon undergoes a transformation to a metastable 
amorphous structure when subjected to an indentation with a diamond pyramid (e.g., 
Pharr etal. [1992]), a phenomenon linked with the metal/non-metal transition treated 
by Mott many years ago. The amorphous phase remains metastable after the indenter is 
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removed. The complicated pedigree of this observation has been explained by Cahn 
[1992]. Distinct from this stress-induced crystal-to-amorphous transition which is 
observed at ambient temperature is another, discovered by Pirouz et al. [1990] who 
found that indentation of silicon in the temperature range 400-650°C leads to a 
metastable, martensitic transformation from diamond-cubic to a hexagonal crystal 
structure; germanium behaves similarly. 

4. Metallic glasses: structure and properties 

4.1. Structure 

In a single-phase solid alloy the distinctive feature of the glassy state is that it has, in 
essence, no microstructure. There is no crystal lattice in which to have defects such as 
grain boundaries or dislocations. But the glass does have a structure in the way its atoms 
are arranged, and this structure is important in determining its properties. Relaxation 
(§4.2) of the glassy structure on annealing can, for example, lead to increases in 
viscosity by a factor of up to 10 5 . Structure and properties can also show a natural or 
induced anisotropy. This section reviews the great deal of work which has been done to 
determine and describe the structures of metallic glasses. Although it is not possible to 
obtain full information as for a crystal structure, the main features of glass structure have 
now been established. 

The problem of determining the structure of a crystal consists merely of identifying 
the coordinates of all the atoms in the unit cell; though the task is very difficult for large 
and complex unit cells, yet in principle (if the “phase problem” can be solved) the 
structure can be determined precisely. Conversely, for a glass, the structure can described 
only on a statistical basis: there are no unit cells and the environments of different, 
chemically identical atoms will necessarily vary. Herein lies the difficulty of determining 
and describing the structure of any glass (or liquid), and the intellectual challenge no 
doubt accounts for the attention devoted to the problem. Almost all the analysis of glass 
structure has been based on pair distribution functions (PDFs), because they can be 
determined from scattering experiments. PDFs give a measure of the probability of 
finding an atom centre at a distance r from an average central atom; they are often given 
in a reduced form G(r) = 47rr[p(r)-p 0 ], where p(r) is the number of atoms per unit 
volume at a distance r and p 0 is the number of atoms per unit volume in the sample as 
a whole. The PDF gives statistical information about distances only, and this is insuffi- 
cient to specify the structure. The angular distribution of interatomic vectors can readily 
be extracted from structural models, but cannot be determined from conventional 
scattering experiments. 

The determination of glassy structures is based on the comparison of measured PDFs 
with those calculated from trial structural models. PDFs can be determined with good 
precision (especially with the advent of synchrotron-based high-intensity sources of X- 
rays or neutrons (Egami [1994])) and can discriminate between models. When an 
acceptable match is obtained, there is still the problem that the solution is not unique; 
this is a more severe problem for glasses than for the analogous determination of crystal 
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structures, because of the angular information missing from PDFs. The discussion which 
follows is based on PDFs determinable in diffraction experiments, but at the end of the 
section less conventional methods for structural analysis will be introduced; some of 
these methods are capable of yielding angular information missing from PDFs. 

The PDF of a glass is determined by Fourier inversion of scattering data, for example 
from normalized X-ray scattering curves of the kind shown in fig. 25. For an alloy glass 
one set of data is sufficient to determine a radial distribution function (RDF) in which 
the different types of atoms are not distinguished. Alternatively, a partial pair distribution 
function (PPDF) can be determined for each type of atom pair. For a binary alloy there 
are three types of pairs, and to determine the three PPDFs it is necessary to perform at 
least three scattering experiments in which the two types of atom have differing relative 
scattering powers. An example of the type of analysis which can be achieved is given in 
fig. 26 showing the Ni-Ni, B-B and Ni-B PPDFs in a Ni 81 B 19 glass. For a ternary alloy 
there would be six PPDFs, and an analysis of this, or greater, complexity has not yet 
been attempted. There are various ways of achieving the differing relative scattering 
powers for the determination of PPDFs. These are: 

(a) Different radiations: X-rays, neutrons and electrons are all possible, but the poor 
precision available with electrons is a problem. 

(b) Anomalous dispersion: For X-rays the effective atomic scattering factors can be 
altered by using a wavelength close to an absorption edge. 



Fig. 25. Scattered x-ray intensity versus scattering angle, 8, for amorphous Ni 81 B 19 . (After Cargill, private 
communication, 1979). 
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Fig. 26. Partial pair distribution functions for amorphous Ni 81 B 19 , as measured (full curves) by Lamparter et 
al. [1982] and as calculated (broken curves) by Dubois et al. [1985] from the ‘chemical twinning’ model. 

(c) Polarized neutrons: The scattering from ferromagnetic samples is different for 
different planes of polarization. 

(d) Isotopic substitution: Isotopic substitution leaves the structure of the glass 
unaffected, but may strongly alter neutron scattering lengths. The PPDFs in fig. 26 were 
determined in this way. Three Ni 81 B 19 samples were prepared, one with natural nickel, 
one with 62 Ni and one with a mixture of “Ni and 62 Ni to give an effective scattering 
length of zero. In this last case there was no scattering by nickel, and Fourier inversion 
of the neutron scattering data yielded directly the B-B PPDF. 
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(e) Isomorphous substitution: It may be possible to substitute one element for another 
of similar type so that the structure is likely the same. X-ray scattering can then be used 
with the different atomic scattering factor. 

The experimental and computational procedures for determination of RDFs and 
PPDFs have been described by Wagner [1983], Egami [1981] and Suzuki [1983]. 
Assessment of the accuracy of pair distribution functions (obtaining a figure of merit 
analogous to the R (Rietveld) values currently used to assess the quality of a crystal 
structure determination) has been considered by Livesey and Gaskell [1982]. 

The basis for structure determination is the comparison between measured and 
calculated PDFs. Yet even without such comparison the measured PDFs reveal basic 
features of the glassy structure. The first peak in the radial distribution function indicates 
the nearest neighbor separation and the area under the peak gives the average co- 
ordination number. The separation is as expected for a condensed phase, and the coordi- 
nation numbers are typically in the range 11.5 to 14.5, indicating that metallic glasses are 
densely packed, as would be indicated by an extrapolation of liquid properties. When 
PPDFs are examined it is found that the distribution of the different elements in a glass 
can be far from random. This was first established by Sadoc and Dixmier [1976] who 
found that in a Co 8] P 19 glass the relevant PPDF showed that P-P nearest neighbor pairs 
were completely avoided. This avoidance of metalloid neighbors seems to be a general 
feature of transition-metal-metalloid alloys (types (i) and (ii) in § 2.2) in their normal 
composition ranges. When the metalloid content is increased, as for example in Ni^Bjg 
studied by Cowlam etal. [1984], metalloid contact becomes inevitable, but is still much 
less prevalent than in a random alloy. Table 3 shows evidence for such chemical ordering 
in a variety of glasses. The derivation of a suitable order parameter has been considered 
by Spaepen and Cargill [1985]. 

For the metal-metalloid glasses showing strong chemical ordering, more complete 
data from measured PPDFs are given in table 4. The coordination numbers, of metal 
atoms around a central metalloid, 8 to 9, and around a central metal atom, 10 to 12, are 
both well defined. Together with the rather small spread in interatomic distances, a 
limited variability of local atomic environments is suggested. Further progress on 
elucidating glass structure requires comparison with structural models. 

The modelling must use as much incidental information as can be gleaned about the 
sizes, bonding and coordination of the constituent atoms: observations of density, atomic 


Table 3 

Observed and hypothetical coordination numbers in some metallic glasses. 


Alloy, A-B 

A neighbors of B 
(actual %) 

A neighbors of B if there 
were no ordering (%) 

^ e 80®20 

too 

80 

Ni 81 B 19 

too 

81 

^>64^36 

88.8 

64 

Cu 57 Zr 43 

56.2 

57 


70.1 

65 

Z r 5? Be 4 3 

70 

57 
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Table 4 


Atomic coordination in metal-metalloid metallic glasses. 


Alloy 

Atom pair 

r (nm) 

cr (nm) 

Z 

Co 8l P 19 

P-Co 

0.232 

0.0105 

8.9 + 0.6 


Co-Co 

0.254 

0.0155 

10.0 + 0.4 

20 

B-Fe 

0.214 

0.0096 

8.6 


Fe-Fe 

0.257 

0.0165 

12.4 

Ni 31 B 19 

B-Ni 

0.211 

0.0142 

8.9 


Ni-Ni 

0.252 

0.0147 

10.5 


P-Fe 

0.238 

0.018 

8.1 


Fe-Fe 

0.261 

— 

10.7 

Pd M Si 16 

Si-Pd 

0.240 

0.0106 

9.0 ± 0.9 


Pd-Pd 

0.276 

0.0154 

11.0 + 0.7 


The mean interatomic spacing, r, the standard deviation, cr, and the coordination number, from Gaskell [1985]. 


radii in crystalline forms, interaction potentials (Lennard-Jones, Morse, etc.), interbond 
angles where appropriate are all taken into account in constructing a model. The actual 
construction was initially done physically, with balls and springs, or just steel balls, but 
now is always done by computer simulation. Models for covalently bonded amorphous 
solids such as silicon involve distinct problems from metallic glass models (only the 
former incorporate directed atomic bonds) and we discuss only the latter here. There are 
many difficulties in comparing measured and computed RDFs. The experimental 
determinations are limited by the range of scattering vector lengths which is accessible, 
and the modelling is limited by the finite size of the model; the problems and best 
techniques are comprehensively discussed by Finney [1983], 

Probably the first structural model suggested for metallic glasses was that based on 
microcrystallites, i.e., the suggestion that the “glass” is simply a polycrystalline solid in 
which the grain size is very small. Early work by Cargill [1970] showed clearly that 
the microcrystallite model does not provide good quantitative fits to measured RDFs of 
glasses such as Ni 76 P 24 . Recently the interest in nanocrystalline materials has led to a re- 
examination of this question. There are examples of apparently glassy alloys for which 
the structure can be successfully modelled assuming nm-sized grains of an icosahedral 
phase. A key observation, however, is that such materials do not transform in the way 
expected for true glasses; on annealing they transform to a polycrystalline aggregate of 
larger grains by a process of continuous grain growth rather than by nucleation and 
growth (Chen and Spaepen [1991]). For true metallic glasses, which form the great 
majority of cases, structural modelling has mostly been based on either the dense random 
packing (DRP) model of Bernal [1964], or the local-coordination model of Gaskell 
[1979], 

Bernal’s empirical model originally referred to the DRP of hard spheres all of the 
same diameter and was based on physical measurements with steel balls kneaded in a 
rubber bag. The packing is statistically reproducible, and the structure can be considered 
to be made up of only five simple polyhedra (canonical holes), shown in fig. 27, the 
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Fig. 27. Polyhedra formed by dense random packing of hard spheres, according to Bernal. 


smaller holes (tetrahedra and octahedra) being in the great majority. The topology of 
DRP is related to that of the continuous random network first proposed for tetrahedrally 
bonded oxide glasses by Zachariasen [1932] (Connell [1975], Chaudhari et al. 
[1976]). The average coordination number in the DRP of hard spheres is well defined 
and is 12.3 (Scott and Mader [1964]). Models based on the DRP of hard spheres give 
fair agreement between computed and measured RDFs. Two deficiencies in the model 
can be identified: the spheres in the modelling are infinitely hard, and only one type of 
atom is included. With realistic interatomic potentials, sequential computer relaxation of 
soft spheres can be used to increase the packing density of the model in better agreement 
with measured densities. In this way the fit between computed and measured RDFs 
becomes rather good in some cases, notably for vapor-deposited nominally pure metals 
(von Heimendahl [1975]). The extension of this type of work to two-component glasses 
was undertaken by Boudreaux and Gregor [1977]. For glasses of the metal-metalloid 
type they constructed a DRP with a mixture of metal and metalloid atoms and imposed 
a rule that there should be no nearest-neighbor metalloid pairs. While a model of this 
type gives a much better fit to experimental RDFs, there are still significant problems, 
particularly in matching PPDFs. 

The modelling of metal-metalloid glasses is now based on the local coordination 
model of Gaskell [1979], The coordination of the metalloid typically by nine nearest- 
neighbor metal atoms in glasses of this type (table 3) is the same as in many crystalline 
metal-metalloid compounds. In these compounds the basic coordination polyhedron is a 
trigonal prism (giving six nearest neighbors, with three more slightly further away). The 
prisms are linked in one of two ways (fig. 28). Gaskell [1979] proposes that the glass 
structure is a non-periodic array of such prisms. Computer models assembled in this way 
and then relaxed under appropriate atomic potentials give rather good fits to measured 
distribution functions. The second nearest neighbor metal-metalloid distance in measured 
PPDFs can correspond distinctly to one or the other of the distances expected from the 
types of linkage in fig. 28. Thus there is evidence for medium-range order in the glasses 
in which the packing of the prismatic units resembles that in particular crystal structures. 
More evidence for this is provided by the structure in the B-B PPDF in fig. 26 out to 
quite large distances; this structure cannot be reproduced by a model based on randomly 
packed trigonal prisms. The prisms themselves appear to arise not from any directed bonding, 
but from a compromise between the tendency to the closest possible packing and the need for 
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Fig. 28. The packing of trigonal prisms as found in (a) Fe 3 C and (b) Fe 3 P. Each prism has a metalloid atom at 
its center. The long-dashed line indicates the second-nearest metalloid-metal distance in each case. (After 
Gaskell [1983]). 


space to insert metalloid atoms between the preponderant metal atoms (Gaskell [1985]). 

The local coordination model can be adapted for cases other than metal-metalloid 
glasses around the 20 at% metalloid composition. Mehra etal. [1983] have applied it to 
(Ru 84 Zr I6 ) 1 _ J[ B J[ glasses with 0.40 <x< 0.53 — i.e., an unusually high metalloid content. 
They show that no DRP model can give even moderate agreement with the scattering 
data from such glasses, while a form of irregular prismatic packing based on the Ru 2 B 
crystal structure gives good, though not perfect agreement of RDFs. 

The local coordination model has been developed into the chemical twinning model 
by Dubois etal. [1985a], who point out that trigonal prismatic coordination is common 
not only in metal-metalloid compounds but also in many metal-metal compounds in 
which the minor component has a radius 65 to 85% that of the major component. The 
model then can be applied to a wide range of glass types. In crystalline compounds 
trigonal prisms can be considered to result from chemical twinning; for example the 
cementite (Fe 3 C) structure is obtained in this way from the hep lattice. The smaller atoms 
of the minor component are on the twinning planes, and the frequency of these planes is 
adjusted to match the composition. In modelling glassy structures, regular twinning is 
permitted only over a short correlation length of the order of 1 to 2 nm. The type of 
twinning is chosen to match the counterpart crystalline compound. As shown by the 
dotted curves in fig. 26, the fits to PPDFS obtained in this way can be quite good. The 
chemical twinning model has the distinction of being perhaps the only structural model 
for metallic glasses which has led to the discovery of new glass-forming compositions. 
According to the model, glass-forming ability should be greatest when there are 
competing types of twinning in the alloy. This idea has been used in developing glass- 
forming compositions in aluminum-based alloys (Dubois etal [1985b]). 

As exemplified in fig. 7, most glass-forming alloy systems are able to form glasses 
over a range of compositions. In many such systems, physical properties vary substan- 
tially with composition. Structural models of the kind discussed above give some reason 
to expect that there could be clear discontinuities in some properties and fig. 29 gives 
some measured examples. 

It has been emphasized that pair distribution functions are limited in the information 
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Fig, 29. Instances of discontinuities in the variation of properties with composition in metallic glasses: (a) 
electrical resistivity and nearest-neighbor distance in (Mo* l6 Ru (l4 ) 1 _ J B ;t glasses (after Johnson [1982]); (b), 
density, d, and Vickers microhardness, H y , in Fe 0J , s _ y Si > B l)i i S glasses (after Al Boat et al. [1982]; (c) densities 
of Fe-B glasses (after Ray etal. [1977]). 
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which they provide. For this reason there has been much interest in more local probes of 
atomic environments. One such technique is EXAFS in which the fine structure of an X- 
ray absorption edge is analysed to obtain information on the local environment of 
absorbing atoms. Unfortunately a structural model is still required and the details of the 
model are important (so that the influence of atom positions and atom identities is hard 
to separate). Nonetheless, useful results have been obtained; generally they confirm the 
conclusion from PPDF studies that the nearest neighbor environment in a glass is well 
defined and similar to that in the corresponding crystal (Wong [1981], Gurman [1985]). 
We turn now to techniques which can provide some information on the angular distribu- 
tion of nearest neighbors. Nuclear magnetic resonance (NMR) and Mossbauer spectro- 
scopy, through the interaction of the nuclear quadrupole moment of the probe atom with 
the local field gradient, can provide information on the symmetry of the local environ- 
ment. The extraction of further information is unfortunately very model-dependent. NMR 
experiments in general show local symmetry similar to that found in the corresponding 
crystalline phases. In Ni-B glasses, for example, Pannisod etal. [1983] have shown that 
the variation of the environments around "B nuclei is modelled well by a combination 
of the environments in crystalline Ni 3 B and Ni 4 B 3 . Each of these phases has trigonal 
prismatic coordination of the boron. Specifically the environment of the boron in the 
glasses is not similar to that in crystalline Ni 2 B (which does not have trigonal prisms). 
Mossbauer spectroscopy is a less discriminating test of structural models, but again there 
is evidence for crystal-like structural units (Whittle et al. [1985]) and for local chemical 
order (Scholte el al. [1985]). 

Field-ion microscopy (FIM) permits the imaging of individual atoms on the surface 
of a needle tip, and in combination with an atom probe the identity of the atoms can be 
established. By field evaporation, atomic layers can be stripped from the tip, and in this 
way a catalogue of atomic dimensions in three dimensions can be built up. Even though 
this has not been fully achieved for a metallic glass, there have been significant results. 
For example, Piller and Haasen [1982] have detected that a Fe 40 Ni 40 B 2 o glass is 
inhomogeneous on a scale of 2 to 4 nm. One phase is boron-poor, the other contains 
- 25% B, and Piller and Haasen interpret the structure as arising from a competition 
between DRP and defined local coordination. De Hosson [1983] has determined planar 
RDFs from FIM images of the palladium atoms in Pd 77 Cu 5 Si 18 glass, and has shown that 
they are consistent with RDFs obtained from scattering experiments. Of greater import- 
ance is that he has determined the angular distribution of nearest-neighbor interatomic 
vectors (unobtainable from scattering experiments). This distribution has peaks at 60°, 
90°, 120° and 150°. The strong peak at 60° is evidence for tetrahedral packing, but the 
other peaks suggest that other polyhedra may be present. 

High-resolution transmission electron microscopy (HREM) is not a useful tool for 
studying the structure of amorphous materials at the nearest-neighbor level; the resolution 
is barely sufficient and there is a severe problem with superposition even in thin foils. 
On the other hand, HREM can be a useful probe of medium-range structure. Gaskell 
and Smith [1980] have shown that in a Pd-Si glass there are domains about 2 nm across 
in which atomic positions appear to be correlated. This is consistent with the correlations 
observed in PDFs and may lend support to the chemical twinning model discussed above. 
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4.2. Relaxation 

When a metallic glass is annealed in the vicinity of its glass transition temperature, 
many physical and mechanical properties change, in pursuit of metastable equilibrium.. 
(The equilibrium must be metastable because the equilibrium state in this temperature 
range is crystalline). All such changes entail atomic diffusion, which has been extensively 
studied in metallic glasses. The phenomenology and mechanism of diffusion in metallic 
glasses is fully treated in ch. 7, §9, and will not be repeated here. However, that 
treatment omits reference to one striking characteristic of diffusion in metallic glasses, 
i.e., the diminution in diffusivity when a metallic glass is annealed: in effect, the process 
of diffusion tends to choke off that process itself. This phenomenon is analogous to the 
gradual drop in diffusivity when a crystalline metal has been quenched from a high 
temperature so that excess vacancies are retained, and these vacancies gradually anneal 
out (§5.1, below). 

Figure 30 shows an instance of an anneal-induced fall in diffusivity, in this instance, 
of boron in a nickel-niobium glass (Kijek et al. [1986]). Very precise measurements of 
x-ray absorption (Chason and Mizoguchi [1987]) have shown (fig. 31) that the fall in 
diffusivity in such a glass is correlated with an increase in density, i.e., with a reduction 
in free volume, which is the “carrier” of diffusion. The drop in diffusivity saturates after 



Fig. 30. Diffusivity of “B in Ni^Nb^ glass. O unrelaxed; * relaxed 420 s at 878 K. (After Kijek etal. [1986]). 
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Fig. 31. Diffusion in a metallic glass as affected by a relaxation anneal, (a) Interdiffusivity, D, in a Fe-Ti 
amorphous multilayer at 160°C, plotted against change in atomic volume, AV/V . The solid line is a fit to the 
free-volume theory of diffusion. (After Chason and Mizoguchi [1987). (b) Relative change in the density of 
a Pd a2 Si 18 glass as a function of annealing time at 260°C, determined from changes in x-ray absorption. (After 
Chason etal [1985]). 


a time if the annealing temperature is not too low: this indicates that the free volume 
tends towards a metastable equilibrium level characteristic of the annealing temperature. 
This aspect of diffusion is further discussed by Cahn [1990]. 

The change in properties that accompanies an anneal of a metallic glass in the 
neighborhood of T g is genetically called relaxation. This is by no means specific to 
metallic glasses: for instance, the refractive index of an oxide glass, of the kind used for 
lenses, changes steadily as it is annealed close to its glass transition. 

Properties which have been pressed into service to examine the consequences of 
structural relaxation include, in particular, density or length changes, elastic modulus, 
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Table 5 


Structural relaxation in amorphous alloys: properties affected and types of relaxation. 


Property affected by relaxation 


Type of relaxation 1 


Selected references 


(i) 

(ii) 

(iii) 

(iv) 

(v) 


Density 

I 


Y 



Gerling etal. [1988], TOULOUI etal. [1985] 

Length changes 

D 


Y 



Huizer and vanderBeukel [1987] 

Thermal expansion coefficient 

D 





Komatsu etal. [1983] 

Enthalpy 

D 


Y 


Y 

Woldt [1988], Inoue etal. [1983] 

Young’s modulus 

I 


Y 

Y 

Y 

Koebrugge and vanderBeukel [1988], Chen 
etal [1983] 

Hardness/yield stress 

I 




Y 

Bresson etal. [1982], Deng and Argon [1986] 

Viscosity 

I 

Y 


Y 

Y 

Volkert and Spaepen [1989] 

Relaxation lime for stress relief 

I 

Y 


Y 

Y 

Williams and Egami [1976] 

Diffusivity 

D 




Y 

Akhtar and Misra [1986] 

Curie temperature 

I, D 

Y 

Y 

Y 

Y 

Greer and Leake [1979], Egami [1978a] 

Electrical resistivity 

I, D 


Y 



Kelton and Spaepen [1984], Balanzat etal. 
[1985] 

Temperature coefficient of 

D, I 




Y 

Kelton and Spaepen [1984], Balanzat etal. 

resistivity 






[1985] 

Internal friction 

D 





POSGAY etal. [1985] 

Strain at fracture 

D 


Y 


Y 

Inoue etal. [1983], Mulder etal. [1983] 

Saturation magnetization 

D 




Y 

Nishi etal. [1986] 

Magnetic after-effect 

D 


Y 

Y 

Y 

Guo etal. [1986] 

Induced magnetic anisotropy 



Y 

Y 


Guo etal. [1986] 

Structure factor (X-ray, neutron) 

Y 




Y 

CAcruflo etal. [1989] 

Extended X-ray absorption fine 

Y 





Baxter [1986] 

structure 







Positron lifetime 

D 





deVries etal. [1988] 

Superconducting transition 

D 


Y 



Johnson [1981] 

temperature 

M&ssbauer spectra 

Y 

Y 



Y 

StrOm-Olsen etal. [1988] 

Corrosion resistance 

I 





Masumoto et al. [1986] 

Density of states at Fermi level 

I, D 




Y 

Zougmore etal. [1988] 

Debye temperature 

I 




Y 

Zougmor£ et al. [1988] 

Magnetostriction coefficient 

I 




Y 

Cunat etal. [1988], Tarn6czi etal. [1988] 


“ The types of relaxation are: (i) monotonic relaxation; (ii) reversible relaxation near the glass transition 
temperature 7^; (iii) reversible relaxation well below T g \ (iv) memory effects; and (v) effects of production 
conditions. I and D indicate an increase or decrease, respectively, of the proper value in monotonic 
relaxation; Y indicates occurrence of that type of relaxation. 


viscosity, diffusivity (as we have seen), Curie temperature (for ferromagnetic glasses), 
resistivity, and a number of others as shown in table 5. 

One can make a distinction between monotonic relaxation and wholly or partially 
reversible relaxation. Fig. 32 shows some examples of observed relaxation kinetics. In 
this figure, (a), showing change of density deduced from a reduction in length at 623 K, 
is a good instance of monotonic relaxation: if the annealing temperature is raised above 
623 K, the length does not again increase. The elastic (Young’s) modulus, however, 







Fig. 32. Manifestations of relaxation of metallic glasses: (a) Irreversible length change of a Fe^NimjBa, glass; 
(b) short-term reversible changes in Young’s modulus of Co^FejNiujB^Sin glass cycled between 623 and 723 
K; (c) irreversible changes in Young’s modulus in the same glass at various temperatures, starting from the as- 
quenched state; (d) changes in the Curie temperature of FegoB^ glass as a function of holding time at various 
temperatures above the Curie temperature ... substantially reversible; (e) reversible changes of coercive field 
of the above-mentioned Co-rich glass, a composition of low magnetostriction, during cyclic annealing. 
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shows both monotonic behavior (graph (c)) and reversible behavior once the monotonic 
process has gone to completion (graph (b)). The ferromagnetic Curie temperature shows 
this same double behavior, though in fig. 32 there is only space to show the monotonic 
form. These two main types of relaxation are further subdivided in the table, according 
in particular to the temperature range in question: the meaning of the columns is 
indicated at the bottom of the table. It is striking that (column (iii)) even though the 
kinetics of structural rearrangement well below T g are so slow that monotonic relaxation 
cannot go to completion (i.e., to metastable equilibrium), yet reversible cycles of the type 
seen in fig. 32b are possible in this low temperature range. Egami [1978] first observed 
this with respect to the Curie temperature; he found cycles just like those in fig. 32b, 
cycling between 250° and 300°C. Egami proposed that there are two distinct types of 
atomic process during structural relaxation: topological short-range ordering (TSRO) 
involves a rearrangement of atomic sites and a reduction of free volume (increase of 
density); it is this process that is associated with normal relaxation near T g . The other 
process, compositional (or chemical) short-range ordering (CSRO), is a statistical 
rearrangement of chemically distinct species on fixed sites, and can occur at lower 
temperatures. This classification has been widely accepted as providing the best 
interpretation of the many observations of relaxation in metallic glasses. Egami [1986] 
published a critical examination of his hypothesis some years after he first put it forward. 
— It would provide useful confirmation of Egami’s central hypothesis, that reversible 
property changes are associated with changes in CSRO, to compare the magnitude of 
such changes in those glasses which are known to have substantial amounts of CSRO in 
equilibrium, like Cu-Ti for instance, and others, such as Fe-Ti, which have been found 
to have little or no equilibrium CSRO. No such comparisons have yet been performed. 
However, there is mounting evidence, when the magnitude of reversible property 
relaxation is studied as a function of composition for a series of glasses in a ternary 
system, that the magnitude is greatest for compositions which permit the maximum 
exchange of species, and this supports the identification of reversible relaxation with 
changes in CSRO (Greer [1993b], StrOm-Olsen et al. [1988]). 

Direct structural studies by XRD have provided only limited evidence. Changes in 
TSRO are so small that any changes in x-ray scattering measurements, like those shown 
in fig. 25, are so slight that they often vanish in the experimental scatter. However, 
Egami [1978b] did succeed in demonstrating significant changes in the x-ray diffraction 
pattern of a metallic glass, and showed that the densification on annealing involves a 
change in coordination number and not a reduction in the mean nearest-neighbor 
distance. The same features have been found for relaxation of amorphous silicon, where 
changes in the x-ray pattern are more substantial (Roorda [1990]). — Reversible CSRO 
changes have not been detected by XRD, but rather by Mossbauer measurements, which 
are highly sensitive to nearest-neighbor geometry ((StrOm-Olsen et al. [1988]). 

X-ray diffraction has, however, been successfully exploited in relation to another 
form of annealing — stress-induced directional ordering. (Hilzinger [1982]). This is 
closely related to field-induced directional ordering, in which a magnetic field is applied 
while the metallic (ferromagnetic) glass is heated: the glass so treated possesses an 
induced uniaxial magnetic anisotropy, and it has been well established since the 1950s 
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(Graham [1959]), for crystalline solid solutions which behave similarly, that this effect 
is caused by directional order, that is, a very slightly different proportion of chemically 
unlike nearest neighbor pairs measured parallel and perpendicular to the originally 
applied field. In fact, the rate at which the anisotropy is established provides a rough 
measure of diffusion rates in glasses (Luborsky [1980]). It appears that the stress- 
induced effect is more pronounced: Suzuki etal. [1987], in a striking experimental tour 
de force, were able to show directly, by energy-dispersive x-ray diffraction from a stress- 
annealed Fe 40 Ni 40 Mo 3 Si 12 B 5 glass, done successively with two orthogonal diffraction 
vectors, that directional short-range order (called “bond-orientational anisotropy” by 
them) was induced by the stress-annealing and decayed gradually on subsequent stress- 
free annealing. These investigators presumed that what they had observed was really a 
change in TSRO, with changes in the absolute numbers of bonds in the two directions; 
however, a later study from the same laboratory (Tomida and Egami [1991]) recognized 
that chemical as well as topological anisotropy must be involved in the phenomenon. 
These exact studies show how difficult it is to secure definitive information about TSRO 
and CSRO. 

There is no space here to discuss in any detail the effects of irradiation on TSRO and 
CSRO: The effects of neutron irradiation on density and associated changes in ductility 
(see first entry in table 5) are linked with changes in TSRO, while the studies of Hillairet 
and his group (e.g„ Hillairet etal. [1985]), using electrical resistivity as a marker, have 
generated good though indirect evidence of substantial changes in CSRO within atomic 
displacement cascades in glasses such as Ti 50 Be 40 Zr 10 . 

The kinetics of relaxation became a disputed problem when the crossover effect was 
discovered, in respect of the Curie temperature and elastic modulus (Greer and Spaepen 
[1981]; Scott and KurSumovic [1982], respectively). Another name for what was 
discovered is memory effect. What this involves can conveniently be illustrated by an 
older study of changes in refractive index of an oxide glass (Macedo and Napolitano 
[1967]). Figure 33 and its caption explain how such an experiment is done. If there were 
only a single relaxation process, the plot would simply show a horizontal line when the 



Fig. 33. Refractive index vs time in a “crossover" experiment on a borosilicate glass. The glass was held at 
temperature T, until the index reached a value corresponding to equilibrium at a second, higher, temperature 
T v The temperature was then increased abruptly to T 2 . instead of remaining constant, the index followed the 
curve shown, returning eventually to the T 2 equilibrium value. (After Macedo and Napolitano [1967]). 
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temperature was raised from T { to T 2 . Experiments like this show that there is not a one- 
to-one relationship between a property and the internal state of the glass, in the sense that 
the same value of the property can be associated with distinct structural states generated 
by different heat-treatment programs. There must be at least two distinct processes, but 
in fact with respect to metallic glasses, detailed analysis has shown (Gibbs etal. [1983], 
Leake etal. [1988]) that an entire spectrum of processes with distinct activation energies 
must be involved. This kind of experiment, again, shows the near-impossibility of 
associating different aspects of relaxation with precisely defined structural changes. 
Nevertheless, a number of investigators have attempted to interpret relaxation kinetics in 
terms of the reduction of free volume, and this kind of theory (e.g., van den Beukel 
and Sietsma [1990]) is linked with theories of the glass transition, which are also 
concerned with ideas of critical free volume. 

Some remarks about how structural relaxation influences the creep behavior of 
metallic glasses will be found in § 4.4, below. 

4.3. Crystallization 

As emphasized in §2.1, a metallic glass can be considered to be an undercooled 
liquid, and as such it is susceptible to crystallization. While there may be alloy composi- 
tions for which a metallic glass is more stable than any single crystalline phase, the glass 
always has a higher free energy than a combination of crystalline phases. Crystallization 
of metallic glasses has been widely studied, and a number of reviews are available (e.g., 
Koster and Herold [1981], KOster [1984], KOster and Schunemann [1993]). There 
are several reasons for interest. It is important to characterize the stability of metallic 
glasses in order that desirable properties may be preserved under service conditions. 
Crystallization of a glass (also known as devitrification) is very much slower than 
conventional solidification of a liquid, and it can readily be interrupted by lowering the 
temperature. This facilitates fundamental studies of the processes of nucleation and 
growth, in particular of nucleation kinetics difficult to study quantitatively in the liquid 
state. A glass is formed only if crystal nucleation and growth are substantially avoided 
during liquid quenching; thus crystallization studies are of relevance for understanding 
glass formation. Finally, it is increasingly recognised, and is now a matter of commercial 
practice, that useful partially or fully crystalline microstructures can be obtained by 
crystallizing metallic glasses; this processing route is analogous to that used for glass- 
ceramics. 

We compare first the basic kinetics of crystallization from the glassy state and 
solidification from the liquid state. Figure 34 is a schematic illustration, for a glass- 
forming alloy, of the variation of the homogeneous crystal nucleation frequency and the 
crystal growth rate over the temperature range from the liquidus down to below the 
glass-transition temperature T g . As the undercooling is increased, the thermodynamic 
driving force for transformation to the crystalline state increases and the atomic mobility 
decreases. The forms of the temperature dependences of homogeneous nucleation and 
growth are readily derived from standard expressions; both show maxima at intermediate 
temperatures because of the lack of driving force at high temperature and the lack of 
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Fig. 34. Schematic representation of the homogeneous nucleation frequency and the growth rate of crystals in 
an undercooled melt or glass. The melting temperature, T m , and the glass transition temperature, 7 g , are shown, 
together with (shaded) the regions for conventional solidification and for crystallization of amorphous alloys, 
The temperature ranges in which homogeneous or heterogeneous nucleation dominate are marked, indicating 
that either can dominate for crystallization of amorphous alloys. 


mobility at low temperature. The maximum rate occurs at a lower temperature for 
nucleation than for growth, because of the need for the driving force to overcome the 
solid/liquid interfacial eneigy. As seen in fig. 34, conventional solidification occurs at 
rather low undercooling with moderate growth rates on heterogeneous nuclei. Crystalli- 
zation of a glass occurs near T g at very low growth rates on nuclei which could be 
heterogeneous (predominant at lower temperature) or homogeneous. In conventional 
solidification the rate of latent heat production is matched by the rate of external heat 
extraction, but there can be significant temperature gradients. As liquids are undercooled 
further, nucleation may occur at temperatures for which the atomic mobility is still large; 
crystal growth is then rapid and the release of latent heat is more rapid than the heat 
extraction, leading not only to temperature gradients but to a rapid overall temperature 
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rise ( recalescence ). For crystallization of the glass, however, the crystal growth is so 
slow that the transformation can be isothermal, with negligible temperature gradients. 
The solidification processes considered in chapter 8 mostly occur close to the liquidus, 
at temperatures above the range in which recalescence is significant. The crystallization 
processes of concern in this chapter mostly occur near the glass transition temperature at 
temperatures below the recalescence range. While solidification and crystallization can 
be viewed within one framework as indicated in fig. 34, significant differences can be 
expected. Taking dendritic growth as an example, the rate-limiting processes in solidifi- 
cation are heat and solute redistribution at the growing tip, whereas in crystallization the 
rate is limited by the mobility of the crystal/ glass interface itself and by solute redis- 
tribution. 

Nevertheless, the common aspects of the processes in the two regimes have for 
example enabled heterogeneous nucleation in a metallic glass to be used to study particle 
inoculation (grain refinement) processes in conventional solidification processing. In such 
a study, Schumacher and Greer [1994] have embedded inoculant TiB 2 particles in an 
amorphous aluminum alloy. Crystallization of the alloy to a-Al occurs on the particle 
surfaces, and novel microscopical studies of the nucleation mechanism are possible 
because the crystallization proceeds so much more slowly than conventional solidifica- 
tion. 

Figure 35 shows a typical differential scanning calorimeter (DSC) trace obtained on 
heating a metallic glass. In the DSC the power necessary to heat the sample through any 



Temperature (t) 

Fig. 35. Differential scanning calorimeter record during the heating, at 20K/min, of Pd, 7 5 Cu 6 Si lt5 glass. 6 mg 
sample. The ordinate represents power released (exotherm, positive displacements) or absorbed (endotherms, 
negative displacements). T % is clearly differentiated from the first crystallization peak, 7),. 
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small temperature interval is automatically compared with the power necessary to heat 
a reference sample (undergoing no transformations) through the same temperature 
interval. As plotted in fig. 35, an upward deflection corresponds to heat release from the 
sample. In this example of a comparatively easy glass-former, the glass transition is 
evident and shows the increase in specific heat when the glass changes to liquid. Before 
the glass transition is reached, temperature cycling experiments can reveal irreversible 
heat releases corresponding to structural relaxation in the glassy state (§ 4.2). The glass 
transition is followed by crystallization. As shown in this case, there are often separate 
peaks in the DSC trace revealing distinct stages in the crystallization of the sample, the 
integrated area under each peak giving the associated enthalpy change. The different 
stages may correspond to successive crystallization of different parts of the sample 
(perhaps arising from composition changes induced by the crystallization processes 
themselves), or to transformations from metastable crystalline states towards the final 
equilibrium state. As the thermodynamic driving force for crystallization of the glass is 
very high and the atomic mobility is low, it is likely that there will be metastable phases 
(or combinations of phases, for example a metastable eutectic) before equilibrium is 
reached. 

In thermodynamic terms, three types of initial crystallization can be distinguished. In 
polymorphic crystallization the glass transforms to a single crystalline phase of the same 
composition. The crystalline phase is not normally a solid solution of the base element 
because glass formation would be difficult if such a crystallization were possible and 
because any crystallization which did occur would likely involve solute partitioning. The 
phase formed in polymorphic crystallization is normally an intermetallic compound. In 
primary crystallization a single crystalline phase is formed which has a composition 
different from that of the glass. Solute partitioning is involved and the crystal growth is 
often dendritic. Complete transformation to the initial phase may not be possible and the 
remaining glassy matrix of changed composition may undergo a distinct crystallization. 
The third type of crystallization is eutectic, in which two crystalline phases grow 
cooperatively. There is no overall composition difference between the glass and a 
eutectic colony. Within the colony straightforward lamellar structures are only rarely 
observed, and the microstructure is on a fine scale which does not vary strongly with 
crystallization conditions. Microstructures arising from the three types of crystallization 
are shown in fig. 36. Ternary eutectic crystallization is also known, in which three phases 
form cooperatively. The growth rate of the eutectic colonies is particularly slow in such 
cases, which are usually associated with very easy glass formation (Garcia-Escorial 
and Greer [1987]). Another form of decomposition of a glassy phases is separation into 
two glassy phases (Tanner and Ray [1980]). These phases can then crystallize separate- 
ly. The driving forces for the various forms of decomposition are illustrated schemati- 
cally in fig. 37. 

Experimental studies of crystallization can be based on isothermal anneals or on 
continuous heating of the kind used for fig. 35. In either case, DSC (which in effect 
measures enthalpy changes in the sample) or measurement of other property changes 
(notably electrical resistivity) can be used to derive the volume fraction jc transformed as 
a function of time t. Isothermal jc vs t measurements are often analysed according to 
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Johnson-Mehl-Avrami kinetics, x(t) - 1 - exp[-(A7)"], where K is a temperature-depend- 
ent rate constant and n is an exponent which can relate to the modes of nucleation and 
growth (see, e.g., ch. 28, § 3.2). It has been frequently pointed out, however, that such 
analyses are subject to errors of interpretation and must be supplemented by microscopi- 
cal studies. In particular, if the transformation in a sample differs from place to place, 
quite erroneous conclusions may be reached, as has been elucidated by Holzer and 
Kelton [1991], X-ray diffraction and related techniques are useful for identifying the 
crystalline products, but microstructural techniques are essential for understanding 
transformation kinetics. The scale of crystalline volumes developing on annealing a 
metallic glass can vary enormously, from a few nanometers up to as much as a milli- 
meter in a very good glass-forming alloy. Depending on the scale, transmission electron 
microscopy (TEM), scanning electron microscopy of polished and etched surfaces or 
optical metallography may be appropriate. Recently, intense radiation sources, such as X- 
rays from a synchrotron, have permitted structural studies with very fine time resolution. 
Time-resolved studies during crystallization can reveal transient phases and other 
important aspects of the process (KOster etal. [1991]). 

An example of a kinetic analysis is given in fig. 38 from the work of Greer [1982] on 
FegoBj,, glass which crystallizes eutectically to a mixture of a-Fe and a metastable boride, 
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Fig. 36. Transmission electron micrographs showing: (a) Polymorphic crystallization: Fe 76 B 24 glass annealed at 
400°C for 10 min (reproduced with the kind permission of U. KOster). (b) Primary crystallization: Fe sl 5 B U5 Si 4 
glass annealed at 445°C for 20 min, showing a -iron dendrites, (c) Eutectic colony in Fe^Ni^B^, glass, annealed 
1 h at 390°C, showing y-(Fe, Ni) + (Fe, Ni) 3 B. Marker= 10/tm. 




Fig. 37. Schematic ffee-energy curves for a binary alloy system, showing how the glass or amorphous phase 
can lower its free energy by undergoing: (a) separation into two amorphous phases; (b) polymorphic 
crystallization into the supersaturated solid solution a; (c) polymorphic crystallization to the metastable 
compound jS; (d) polymorphic crystallization to the equilibrium phases y, (e) primary crystallization of a; (0 
eutectic crystallization to a and metastable jS; and (g) equilibrium eutectic crystallization to a and y. 
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Fig. 38. Crystallization kinetics, Ax/ At, deduced from continuous DSC heating data, for Fe 80 B 20 glass 32 mm 
thick (solid curves), compared with fitted curves derived from a model. The lower peak corresponds to a 
preannealed sample. (After Greer [1982]). 


tetragonal Fe 3 B. The traces In fig. 38 are derived from DSC data and show the rate of change 
of transformed fraction during continuous heating. The data (main peaks) can be well matched 
by a numerical model which assumes eutectic growth from a fixed number of quenched- 
in nuclei. The population of these nuclei can be determined from the numerical fitting 
and was found to be of the order of 10 18 /m 3 , in agreement with direct TEM observations. 
It was suggested that these nuclei most likely arose by homogeneous nucleation in the 
melt during quenching. The higher-temperature small peaks in fig. 38 arise from a thin 
layer on one side of the melt-spun ribbons which contains no quenched-in nuclei. 

Nucleation kinetics in metallic glasses can be determined by direct counting of the 
numbers of crystals, by inference from overall transformation kinetics, or by measuring 
the crystal size distribution in a partially transformed sample. Examples of the latter are 
shown in fig. 39. The types of nucleation behavior which have been observed are 
illustrated schematically in fig. 40. Nucleation can occur during an anneal and can be 
homogeneous or heterogeneous, showing site saturation (see ch. 15, §2.1). In either case 
transient effects may be evident. Alternatively, in some cases annealing merely leads to 
crystal growth on pre-existing nuclei. Homogeneous nucleation is perhaps most clearly 
shown in Fe 4 oNi 40 P 14 B 6 glass, in which nucleation kinetics of type (b) (fig. 40) are found. 
Even for this well studied glass, however, a fully quantitative fitting of the kinetics to 
nucleation theory has remained elusive. The steady-state homogeneous nucleation 
frequency I s (nT 3 s -1 ) should be of the form (Kelton [1991]): 
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Fig. 39. "typical histogram of crystal diameter distribution in a partially crystallized metallic glass: polymor- 
phous crystallization in Fe 65 Ni 10 B 2J . (After KOster and SchOnemann [1993]). 



Time 

Fig. 40. Schematic variation in the number of nuclei with time in an isothermal anneal for the following 
nucleation types: (a) steady-state homogeneous; (b) transient homogeneous; (c) steady-state heterogeneous; (d) 
transient heterogeneous; (e) quenched-in active nuclei. (After Greer [1988]). 
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where A G v is the free energy driving force (J/m 3 ) for crystallization, a is the crystal/ 
glass (or liquid) interfacial energy (J/m 2 ), rj is the glass or liquid viscosity (Pa s), k B is 
Boltzmann’s constant, and A is a prefactor. Of the three main materials parameters in this 
expression, rj can be measured, A G v can be estimated by various methods, but cr is not 
known. As reviewed by Kelton [1991], all reasonable assumptions about the form of 
the temperature dependence of i ], A G v , and a, when substituted in the above equation, 
fail to fit the full set of measured steady-state nucleation frequencies, shown in fig. 41. 
If a is independent of temperature, then a plot of Pt) vs \/TAG vl should be straight, but 
as can be seen in the figure this is so only over the central portion of the data. The value 
of A derived from fitting the central portion is reasonable for homogeneous nucleation. 
It is possible to fit the entire data set by assuming a temperature-dependent a, but then 
the fitted value of A is much reduced and characteristic of heterogeneous nucleation 
(GrAnasy etal. [1994]). Thus, even for this well studied glass the steady-state nucleation 
kinetics cannot be interpreted with certainty. While the crystallization of metallic glasses 
offers great potential for a quantitative test of nucleation theory in metallic systems, the 
potential remains largely unrealised, particularly because of problems with determining 
the materials parameters. Some aspects are already clear, however. The non-linear 
variation of free energy of crystallization with undercooling (arising from the excess 
specific heat of the liquid, as discussed in §2.1) must be taken into account to explain 
quite basic experimental findings, for example that the activation energy for nucleation 
is typically greater than that for crystal growth (Greer [1994]). 

In addition to Fe 40 Ni, )0 P 14 B 6 there are other metallic glasses for which the nucleation 
kinetics suggest homogeneous nucleation. For some glasses (for example the aluminum- 



Fig. 41. Crystal nucleation in amorphous Fe 40 Ni 40 P 14 B 6 ; the graph points are from two experimental studies. The 
plot of log/ 5 ?}, vs 1/TAGv 2 (where P is the homogeneous nucleation frequency and 7} is the viscosity) should 
be linear for a constant (i.e., temperature-independent) solid/liquid interfacial energy. 
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rare-earth-transition metal glasses discussed later in this section) the population density 
of nuclei, up to 10 25 /m 3 , is so high that homogeneous nucleation seems the only 
plausible hypothesis. In yet other cases (for example Pe^B^ as shown in fig. 38) 
crystallization is by growth on a large population of quenched-in nuclei. The large 
population of such nuclei and its strong dependence on quench rate suggest transient 
homogeneous nucleation during the rapid quenching of the alloy. Analysis of the 
transient effects suggests that in some cases, inhibition of nucleation is essential if glass 
formation is to be possible (Kelton and Greer [1986]). This is illustrated for the case 
of Au 8 ,Si 19 in fig. 42. The dashed line in the lower figure indicates the low level of 
volume fraction transformed (KT 6 ) which is usually taken as the condition for successful 
formation of a (metallic) glass; it can be seen that this level is achieved with transient 
nucleation for a quench rate of 10 5 K/s, whereas for steady state nucleation the unrealis- 
tic rate of 10 8 K/s would be needed. Thus, crystallization studies and their analysis 
reveal that for the formation of some metallic glasses from the liquid-high quench rates 
are absolutely necessary. 

In marked contrast, there are other metallic glasses for which crystallization is clearly 
dominated by heterogeneous nucleation, the homogeneous nucleation frequency being 
very small. Examples are the good glass-formers Pd 40 Ni 40 P 20 (Drehman and Greer 



Fig. 42. (a) The number of nuclei and (b) the transformed crystal fraction calculated for quenching molten 
Au81Sil9 at various rates, assuming either steady-state or transient homogeneous nucleation. (After Kelton 
and Greer [1986]). 
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[1984]) and Pd 775 Cu 6 Si 16J (Kiminami and Sahm [1986]. That the predominant nucleation 
is heterogeneous can be exploited, as processing under very clean conditions can permit 
bulk glass formation at very low cooling rates (as low as ~ 1 K/s for Pd 40 Ni 40 P 20 
(Drehman and Greer [1984]). 

Surfaces are often preferred sites for crystallization of a metallic glass. The main 
reason is modification of surface composition, for example by oxidation. There are clear 
possibilities to manipulate surface composition to enhance or impede nucleation, or to 
select crystallizing phases different from those which would be formed in the bulk. 
Comprehensive work in this area has been by KOster [1984], Surface crystallization can 
be used to improve bulk properties of a metallic glass (particularly magnetic properties) 
by inducing a compressive in-plane stress in a thin ribbon (Ok and Morrish [1981]). 
Wei and Cantor [1989] made a detailed study of surface crystallization of Fe^Ni^B^ 
and found that it is enhanced if the alloy is first phase-separated by relaxation and then 
abraded at the surface, or else the surface is enriched in nickel by electroplating and the 
glass is then annealed. Sometimes, surface oxidation can have the opposite effect of 
inhibiting crystallization at the surface: thus, with Pd^Ni^Pjo, a thin NiO layer protects 
the glass from local loss of phosphorus; such loss is here the prime cause of preferential 
surface nucleation. Removal of the original surface can drastically modify crystallization 
behavior: thus, Ni 66 B 34 glass (which normally crystallizes from the surface with a strong 
accompanying [100] fiber texture), loses the quenched-in nuclei responsible for this if the 
surface is etched off (KOster and SchOnemann [1993]); their chapter can be consulted 
for more detailed information concerning crystallization at surfaces. 

Finally in this section we consider crystallization of metallic glasses as a route to 
desirable microstructures. This concept is well known for glass-ceramics. For metallic 
glasses it has been comparatively neglected, but is receiving increasing attention. It may 
seem strange to adopt an extreme processing route (such as melt-spinning) to avoid 
crystallization and form a glass, only then to crystallize it. The important point is that the 
crystalline product made from the glass can have a very fine and uniform microstructure, 
fine because the high effective undercooling favors nucleation, and uniform because of 
the uniformity of the glass itself and the lack of macrosegregation during crystallization. 
Devitrification offers a particularly attractive way of making nanocrystalline material 
(with grain size less than 100 nm) cheaply, controllably, and in comparative bulk without 
the need for consolidation. 

The first serious work on the exploitation of devitrification was by Ray [1981] on 
alloys of Fe, Ni, Al, Cr, Mo, Co and W in multiple combinations, with 5 to 12 at.% of 
B or other metalloids as aids to glass formation. Similar alloy developments have been 
made by Das et al. [1985] and Vineberg et al. [1985], Ni-Mo-B and Ni-Al-Ti-X-B 
and later other Ni-Mo-B alloys with added Cr were made by melt-quenching, 
comminution and consolidation by extrusion or HIPping. During the processing, ordered 
phases including Ni 4 Mo, Ni 3 Mo, Ni 2 Mo and Ni 3 (Al,Ti) are precipitated from the 
crystallized matrix, together with stable boride precipitates. This family of alloys is now 
manufactured commercially under the trade name Devitrium® (Vineberg etal. [1985]), 
and the best examples have impressive high-temperature properties, exceeding high-grade 
tool steels. Work in this area is continuing; for example, Arnberg et al. [1991] have 
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developed a range of tool steels by crystallizing Fe-Cr-Mo-C-B or Fe-Cr-Mo-C-V 
glasses. 

The most important industrial application of metallic glass crystallization has been in 
the production of the modem family of permanent magnets based on the high-coercivity 
phase Nd 2 Fe 14 B. These materials can be made either by sintering an alloy powder, by 
melt-spinning the alloy to form a glass and then devitrifying, or by direct quenching to 
a nanocrystalline structure. The melt-quenching method was introduced by Croat et al. 
[1984], and has been fully commercialized. The process has been reviewed by Buschow 
[1986]. Figure 43 provides an overview of the production possibilities by direct 
quenching or by heating a glass. The hard magnetic phase is desired, the alternative soft 
magnetic phase must be completely avoided, and a fine grain size is desirable for high 
coercivity. In the materials of most interest, grain sizes range from 14 nm to 50 nm. 

Interest in nanocrystalline magnetic materials now extends to the opposite extreme: 
very soft magnets. Glassy alloys based on Fe-B-Si are well known as soft magnetic 
materials. If the composition is modified by addition of copper (apparently to enhance 
crystal nucleation) and niobium (to impede crystal growth), crystallization of such alloys 
leads to an isotropic microstructure with a very fine grain size of 10 to 20 nm 
(Yoshizawa et al. [1988]). At this scale the local magnetocrystalline anisotropies are 
averaged out, and soft magnetic properties result. The materials, now marketed under the 
trade name Finemet®, are of interest in comparison to glassy soft magnetic alloys because 
they can have very low magnetostriction; the magnetostrictions of a-Fe crystallites and 
of the residual amorphous phase in the microstructure are of opposite sign and substan- 
tially cancel (Herzer [1991]). 

Nanocrystalline materials made by devitrification of metallic glasses may also be of 
interest for their mechanical properties. Aluminum-based alloys with rare earth and late 
transition metal additions up to a total of 10 to 15 at. % can either by direct quenching 



Fig. 43. Schematic time-temperature-transformation diagram to rationalize optimum conditions for producing 
Nd-Fe-B permanent magnets. (After Warlimont [1985]). 
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or by devitrification yield microstructures — nanometer scale — with exceptional 
strength, discussed further in §4.4.1. Consolidation of glassy or partially devitrified 
material may lead to a fully crystalline product, but one in which a fine-scale 
microstructure with desirable properties is retained. A particular property of interest is 
superplasticity. An alloy of the aluminum - rare earth - transition metal type has shown 
superplastic elongations of more than 500% at strain rates as high as 1/s (Higashi etal. 
[1992]). Even after consolidation and deformation, the mechanical properties (for 
example, a tensile strength ~ 900 MPa) are superior to those of conventional high- 
strength aluminum alloys. 

4.4. Plastic deformation and fracture 

The principal plastic characteristics of metallic glasses are, first, their great strength 
(the flow stress is typically as much as 1/50 of the shear modulus); second, their fracture 
toughness, which in the normal state is very much greater than that of comparably strong 
fibrous materials such as graphite; and third, the mechanisms of flow and fracture which 
are entirely distinct from those found in crystalline materials. 

Table 6 compares the flow stresses of a number of ductile metallic glasses, both on 
an absolute basis and normalized for density, with those of some other ultra-high-strength 
wires or fibres. Metallic glasses can be seen to be among the strongest practical materials 
known (the adjective is to distinguish them from single-crystal whiskers). 

Metallic glasses deform plastically by two entirely different regimes. Under tension, 
at high stresses near their fracture limit, they deform extremely inhomogeneously: shear 

Table 6 


Tensile properties of various continuous filaments and ribbons. 


Material 

Yield 

stress 

tr/GPa 

Relative 

density 

d 


Young’s modulus 


<r y (T/GPa 

E/GPa 

E (T'/GPa 

S-glass 

(Si0 2 -Al 2 0 3 -MgO) 

5.0 

2.5 

2.0 

85 

34 

C fibre (high-yield 

PAN type)’ 

3.2 

2.6 

1.7 

490 

153 

B filament (on W core)” 

2.5-4.5 

2.6 

1.0- 1.7 

380 

146 

SiC microcrystalline 

filament" 

3.5 

2.6 

1.4 

200 

77 

Kevlar fibre (organic 

polymer)" 

2.8 

1.5 

1.9 

135 

90 

High-C steel wire 

4.2 

7.9 

0.55 

210 

27 

Fe so B 20 met. glass" 

3.6 

7.4 

0.5 

170 

23 

Ti 50 Be 40 Zr 10 met. 

glass 

2.3 

4.1 

0.55 

105 

26 

TifljBe^Sij met. glass" 

2.5 

3.9 

0.65 

110 

28 

Cu 30 Zrj 0 met. glass" 

1.8 

7.3 

0.25 

85 

12 


" Indicates heat-resistant materials. 

" Indicates materials with some ductility. 






Ch. 19, §4 


Metastable states of alloys 


1797 


bands form on planes close to the orientation of maximum resolved shear stress. These 
are very thin (typically 10-20 nm thick according to Donovan and Stobbs [1981] and 
high-speed cinematography and acoustic emission indicate that the bands form extremely 
rapidly. Neuhauser [1978] found a formation time for individual bands in Pd 80 Si 2 o < 2 
ms.) At low stresses, there are no shear bands and deformation is by homogeneous creep. 
A schematic deformation map separating the two deformation regimes is shown in fig. 
44. The ductility in inhomogeneous deformation is usually only about 1% in tension, 
whereas many glasses can be rolled to reductions exceeding 50% if sufficiently hard and 
stiff miniature rolls are used. Creep rates in most creep experiments have been so low 
that total creep strains in reasonable times are less than 1%, but it is possible at 
temperatures close to T g to hot-press a metallic glass foil such as Fe 40 Ni 40 P 14 B 6 to strains 



Fig. 44. Deformation map for an idealized metallic glass. T s is the glass transition, T c the crystallization 
temperature, T, the liquidus temperature, t is the shear stress, fi the shear modulus, y is shear rate. (After 
Spaepen [1977]). 
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of the order of unity (Patterson etal. [1978]). This approach has been used in Japan 
(Masumoto [1994]) to shape minute but complex objects such as a micro-gearwheel 
from a metallic glass. 

The mode of ductile fracture of a metallic glass is unique. Fracture following 
inhomogeneous flow always takes place by separation along a previously active shear 
band — i.e., it is strictly a shear failure — and the fracture surface shows a “river 
pattern” of fine ridges which can be exactly mimicked by simultaneously shearing and 
pulling apart a pair of glass slides coated with vaseline. Accordingly, it has been 
proposed that the material in a shear band which is about to fail behaves in a liquid-like 
manner. Shear along one of the shear bands clearly generates additional free volume and 
this lowers the viscosity locally and can eventually lead to separation along the shear 
band. The creation of free volume can be deduced from the measured decrease 0.15%) 
in density after extensive inhomogeneous shear (Cahn et al. [1984]). The complex 
fracture mechanics of metallic glasses are reviewed by Kjmura and Masumoto [1983]. 
There is no record of the appearance of a fracture in creep; apparently glasses subject to 
slow homogeneous flow undergo perfect viscous flow with no structural deterioration that 
could lead to fracture. 

When a metallic glass is stressed at a high temperature, as remarked, it undergoes 
homogenous flow by a purely viscous process. As creep proceeds, free volume is 
progressively annealed out and since the viscosity is directly linked to free volume, the 
creep rate under constant stress progressively diminishes. The relation between viscosity, 
rj, stress, a, and strain rate, e, is: 17 =(1/3) a/ e. Instead of a constant-stress creep test, 
one can also measure stress relaxation at constant strain or perform a bend-stress 
relaxation experiment (a ribbon is elastically bent and held at a fixed radius while it is 
annealed, and then released to see how far it springs back). A brilliant series of 
experiments to study these processes, mostly with Pd 82 Si 18 glass, was performed by 
Spaepen and Taub and reviewed by them (Spaepen and Taub [1983]). 

According to the Stokes-Einstein relation, 17 =kT/Db, where D is a self-diffusivity 
and b is a characteristic distance of the same order as the interatomic distance 
(Nachtrieb [1976]). The rise in viscosity on relaxation due to reduced free volume 
should thus be associated with a fall in diffusivity. However, there have been increasing 
doubts as to the applicability of the Stokes-Einstein equation to metallic glasses, and a 
number of experiments claimed to show that the equation breaks down have recently 
been critically discussed by van den Beukel and Sietsma [1994], Van den Beukel has 
recently proposed that, for metallic glasses, 17 D 2 is constant and has advanced reasons for 
preferring this relationship (Duine et al. [1992, 1993]). 

The free volume model of flow and viscosity, originated by Cohen and Turnbull, can 
be outlined as follows: Each atom is confined in a cage defined by its nearest neighbors; 
that cage can vary in volume, and the free volume is that part of a cage in which the 
atom can move without an energy change. Normally the atom can only oscillate, but if 
there is a density fluctuation there is a computable probability (dependent of course on 
temperature) that the atom will jump to a neighboring cage if the fluctuation generates 
an instantaneous local free volume exceeding a critical amount. Such jumps define both 
viscosity and creep rate under stress. Relaxation diminishes the average free volume and 
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Fig. 45. Rate of change of viscosity of Fe 40 Ni 40 P 14 B 6 glass with time as a function of temperature, as measured 
by three different experimental techniques. (After Spaepen and Taub [1983]). 


thus increases the viscosity, linearly with time as it turns out. Figure 45 shows how the 
(linear) rate of increase of viscosity varies with temperature for a metal-metalloid glass. 

Attempts to interpret such relaxation-induced changes quantitatively are still beset by 
some difficulties, notably the need to assume that the concentration of flow defects 
(unlike free volume collectively) to some extent changes reversibly with temperature. A 
’’flow defect” here is a particular configuration of free volume of a kind which favors 
atomic jumps, as sketched above. (The notion that the reversible element in creep 
behavior is linked, as suggested for other aspects of relaxation, with changes in CSRO, 
does not seem to have been seriously analysed.) For details of the theoretical consider- 
ations involved, the reader is referred to Spaepen and Taub’s review, and to other 
detailed reviews by Chaudhari et al. [1983] and Spaepen et al. [1986]. There is a 
surprising measure of reversibility in the viscosity (a value characteristic of a given 
temperature T, can be attained directly by annealing a quenched sample at that tempera- 
ture or by annealing at 7j a sample previously equilibrated at a lower temperature 
(Volkert and Spaepen [1989])). It is undoubtedly difficult to define in an acceptable 
operational manner what is meant by a defect in an amorphous structure. Very recently, 
Spaepen [1994] has proposed regularities governing the evolution of a population of 
defects (density fluctuations) in a metallic glass of different sizes (singlets ... i.e., 
effectively a single missing atom . . . doublets, triplets and quadruplets ), with a range of 
shifts from one to another among these categories. The idea is that singlets govern 
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diffusion, doublets govern plastic flow, triplets can annihilate partially and quadruplets 
are large enough to collapse and disappear entirely. This model is consistent with the 
apparent inapplicability of the Stokes-Einstein relationship and the fact that different 
regularities describe diffusion and flow. 

The figures cited in table 6 indicate that there is scope for exploiting the very high 
strength of metallic glass ribbons or wires in composites (especially in view of their 
usually excellent corrosion resistance), but surprisingly not much has been done to pursue 
this. Hagiwara et al. [1985] shown that iron-based INROWASPed glass wires, 
strengthened by additions of niobium, tantalum, chromium or molybdenum, combine high 
yield strengths with particularly good fatigue strengths. At about the same time, Ohnaka 
[1985], the main originator of the 1NROWASP process, suggested the use of wires made 
by this process as motor tire reinforcement, and subsequently an American company 
showed that Fe-Si-B, with an appropriate coating, fulfil the requirements of tire 
reinforcement, which are: a high tensile strength, good adhesion to rubber, and excellent 
resistance to fatigue and corrosion (Ogino [1986]). A recent study (Vaidya and 
Subramanian [1990]) has shown the potential of metallic glass ribbons made by melt- 
spinning to reinforce glass-ceramics (i.e., devitrified oxide glasses). The melt-spun 
ribbons are assembled with the oxide glass precursor in powdered form; the assembly is 
sintered at 400-450°C, to full density, and sintering is followed by devitrification of the 
powder without devitrification of the metallic glass. Large increases of strength and 
toughness were achieved with as little as 1 vol% of reinforcing ribbons. 

Apart from the good strength of uncrystallized (undevitrified) metallic glasses, recent 
researches on aluminum-base glasses have revealed the remarkable properties that can be 
attained by partial devitrification. This topic is covered in the next section. 

4A.1 . Partially devitrified (nanostructured) metallic glasses 

As emphasized in the previous section, metallic glasses can have very high yield 
strengths, approaching the ideal strength (that is, the strength shown by a dislocation-free 
material). In steels, because of the microstructural changes associated with the ferrite- 
austenite transition, conventional materials can also reach a significant fraction of the 
ideal strength. However, for light alloys based on Al or Mg such a transition does not 
exist, and the hardening based on precipitates or dispersoids tends not to be so effective. 
For light alloys, therefore, the very high specific strengths (that is, strength/ density) of 
amorphous alloys are of particular interest. As outlined in §2.2, there are amorphous 
alloys based on aluminum with rare earth and transition metal additions and these alloys 
are ductile. Their flow strengths can be as high as 1250 MPa (Blank-BewersdOrff 
[1991]), about twice as high as the best precipitation-hardened conventional alloy, and 
immensely harder than the commercial purity element. 

Partial or complete devitrification usually embrittles a metallic glass. In some cases, 
however, the ductility can even be improved (Hillenbrand etal. [1982]). In the case of 
the Al-based glasses, partial devitrification leads to increased tensile strength — as much 
as a 50% increase on the corresponding amorphous alloy — while ductility is retained. 
The partially devitrified materials, introduced in § 4.3, have a fine dispersion of 3- to 
10-nm Al crystallites in an amorphous matrix. They are obtained by quenching at below 
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the critical rate for full glass formation or by annealing an initially fully glassy material. 
The devitrification can also be induced by mechanical working. These materials, 
developed in Japan (Kim etal. [1990]) and the United States (Chen etal. [1991]) show 
optimum tensile strength for a crystalline volume fraction of ~ 25%, and the values are 
quite exceptional — as high as 1560 MPa for Al 88 Ni 9 Ce 2 Fe, (Inoue etal. [1992]). The 
strengths of the devitrified materials are compared with other aluminum alloys, including 
amorphous alloys, in fig. 46. This shows that as the microstructural scale of aluminum 
alloys is decreased the strength increases. The reasons why a nanometer-scale dispersion 
of crystallites should give such a pronounced hardening effect are not yet clear. 
However, the aluminum crystals appear to be too small to contain any dislocations. 

4.4.2. Thermal embrittlement 

Many metallic glasses are capable of a moderate degree of deformation at room 
temperature: the process is heterogeneous and involves the formation of shear bands 


nm 


pm 


mm 
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Fig. 46. The tensile strengths of aluminum alloys compared as a function of microstructural scale: (a) 
commercial purity aluminum, (b) the strongest conventional precipitation-hardened aluminium alloys, (c) 
amorphous, or partly amorphous, aluminum-rare earth-transition metal alloys, consolidated and crystallized 
(Higashi etal. [1992], Ohtera etal. [1992]), and (d) amorphous aluminum-rare earth-transition metal alloys 
partially devitrified (either during quenching or by annealing) to give nanometer-scale crystallites (Chen et al 
[1991], Inoue etal. [1992], Kim etal. [1991]). The diameter of an aluminum atom, and the wide range of 
measured tensile strengths for fully amorphous Al-TM-Ln alloys are shown for comparison (TM = transition 
metal, Ln = lanthanide). 
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approximately along a plane of resolved shear stress. A ductile metal ribbon can 
generally be bent sharply back on itself without fracturing. Some glasses, mostly those 
based on iron (one might call these “glassy steels”) show (1) a ductile-brittle transition 
temperature, and (2) the phenomenon of thermal embrittlement , which is marked by an 
increase in the ductile-brittle transition temperature on annealing well below the 
crystallization temperature. Figure 47 exemplifies these phenomena. The mechanism of 
this embrittlement has given rise to much investigation and argument. 

Spaepen [1977] first put forward the idea that shear band formation (morphologically 
similar to slip bands in crystalline metals) is to be attributed to a multiplication of free 
volume at an incipient band, which reduces the flow stress locally so that the shear band 
develops preferentially. As mentioned above, Cahn etal. [1984] showed that the overall 
density reduction in a Pd-Cu-Si glass due to a 40% rolling reduction was 0.14%, which 
implied a much larger density reduction within the narrow shear bands themselves. This 
notion was further analyzed by Argon etal. [1985], and they concluded, from measured 
overall density changes in heavily deformed glasses (rolled or wire-drawn) and the 
observed shear-band geometry, that within a shear band itself the local dilatation due to 
free volume can be as large as 50%. Cahn et al. [1984] further showed that the excess 
free volume in the shear bands was dissipated during a relaxation anneal, while 
Krishnanand and Cahn (1975) had earlier shown that after such a relaxation anneal 
and renewed plastic deformation, the new population of shear bands was not in the same 
sites as the old, i.e., the old shear bands had been locally “healed” by the relaxation 
anneal. 

It can thus be taken as well established that free volume plays a central role in both 
homogeneous (creep) and heterogeneous plastic flow in metallic glasses. 

The theories to explain thermal embrittlement fall into two categories: (1) the 
hypothesis that annealing removes excess free volume and (2) the hypothesis that the 
homogeneous glass structure separates into two distinct amorphous phases. The basis of 
the first hypothesis is clear from the foregoing: further, Wu and Spaepen [1986] have 
assembled detailed evidence on thermal embrittlement consistent with this model. The 
second hypothesis leans on observations such as that due to Walter et al. [1976], who 
by Auger electron spectroscopy established an enrichment of phosphorus at fracture 
surfaces of a Fe 40 Ni 40 P 14 B 6 glass, and Piller and Haasen [1982] who used field ion 
microscopy to demonstrate the formation, during relaxation annealing of Fe^Ni^jg, of 
minute zones enriched in boron to 25 at.%. However, tests by X-ray small-angle 
scattering of some other glasses have failed to demonstrate phase separation of this kind, 
whereas other experiments have given contrary results. The literature is quite extensive. 

A possible marriage between the two types of hypothesis has been proposed in an 
interesting paper by Yavari [1986]. By analysing the volume per metal atom in Fe-B 
glasses of different compositions, he shows that a local enrichment of B will need to 
“suck” free volume from the adjacent matrix, reducing the free volume concentration 
there. He suggests, therefore, that the phase separation hypothesis resolves itself, 
indirectly, into the free volume hypothesis. According to Yavari, the fact that some 
additives (e.g., Ce) delay embrittlement to higher temperatures, whereas others such as 
Sb promote it, can also be interpreted on the basis of his model. — Another observation 
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Fig. 47. Thermal embrittlement: (a) Strain at fracture in a bending test as a function of temperature for a 
Fe^jBe^jSijoCoj glass, annealed for 2 h at 350°C after melt-spinning. T BD is the ductile-brittle transition 
temperature, (b) Change of ^BD with isochronal anneals at progressively increasing temperatures. (After 
Spaepen et al. [1986]). 
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(Yamasaki et al. [1985]) to the effect that Fe-based glasses whose compositions fall 
close to an equilibrium eutectic composition embrittle much more sluggishly, if at all, is 
attributed to a reduced tendency to phase separation for such compositions. 

The clearest demonstration to date of the central role of free volume in determining 
the incidence of thermal embrittlement comes from an important series of publications 
by Gerling et al. [1985, 1988, 1989, 1990] relating to Fe 40 Ni 40 B 20 glass. Briefly, they 
have established in circumstantial detail how this glass is embrittled by the loss of free 
volume on annealing: for instance, the embrittlement behavior varies with ribbon 
thickness, because the quenched-in free volume varies also. They have further shown that 
a glass which has been thermally embrittled can be reductilized by neutron irradiation, 
which creates fresh free volume. The restored ductility can in turn be removed by a 
second relaxation anneal. 

4.5. Other properties of metallic glasses 

Apart from their mechanical properties, some metallic glasses are beginning to be 
exploited for their magnetic, chemical and diffusion properties, and as precursors for 
crystallization (devitrification). Ferromagnetic (mostly iron-based) glasses, as used 
nowadays in transformers, are treated in chapter 29 and will not be discussed here. 
Precursors for devitrification are treated in §§4.3 and 4.4.1, and this includes a brief 
account of permanent magnets made by this route. There is space here only for a 
summary account of the other properties. 

Many metallic glasses have exceptionally good corrosion resistance, even in quite 
strong acids, and this is clearly due to their structural homogeneity, which obviates the 
creation of the electrolytic microcells that enhance corrosion of crystalline metals. For 
some uses, metallic glasses can be used “as is”: thus, Fe-Cr-P-C ribbons were used for 
the active element in electromagnetic filters to remove rust from water (Kawashima et 
al. [1985]); since the field is strongest at the edges of ribbons, narrow ribbons were used. 
Their high corrosion resistance ensured a long life for the filter elements. 

For other purposes, it is necessary to put a glassy coating on a crystalline metallic 
substrate; this can be done by either laser-glazing or sputter-deposition. Diegle and 
Hashimoto [1988] and Hashimoto [1993] have shown that deposits made by sputtering 
can provide quite remarkable corrosion protection: thus, a-Cu-Ta (anywhere from 20 to 
80 at% Ta) passivates spontaneously even in 12N HC1, and behaves much better than 
pure crystalline tantalum, itself notably resistant to acids. 

Hashimoto [1993] has also surveyed the use of metallic glass coatings in electro- 
catalytically active electrodes, for instance, for use in chlorine manufacture by electroly- 
sis. However, for lack of space the reader must be referred to Hashimoto’s excellent 
review. There has also been extensive research, published in the chemical literature, on 
the use of metallic glasses (in the devitrified condition) as catalysts for gas-phase 
reactions such as the hydrogenation of carbon monoxide: here again, the reader is 
referred to an excellent review by Molnar et al. [1989], as well as to Diegle and 
Hashimoto’s conference proceedings. 

Finally, metallic glasses have found an exotic use as diffusion barriers in microelec- 
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Fig. 48. Diffusivity of gold in amorphous and in polycrystalline Ni 5J Nb 43 . The righthand ordinate shows the 
diffusion distance in a period of one year. (Afityer Doyle et al. [1982]). 


tronic circuits, especially for those to be used at temperatures of several hundreds of 
degrees Celsius, for which they are superior to most crystalline alloys (Cahn [1986], de 
Reus [1995]). This works because in the temperature range of interest, atomic transport 
in polycrystalline metals is predominantly by grain-boundary diffusion, which is absent 
in glasses. Figure 48 shows this: the diffusion of gold in a-Ni 55 Nb 45 is seen to be seven 
orders of magnitude slower than in the same alloy after it has been devitrified (which 
however requires higher temperatures to do). Ta-Ir glass, which crystallizes at over 
900°C, has been effectively used to make diffusion barriers between silicon substrates 
and Y-Ba-Cu-O ceramic superconductors layers (De Reus et al. [1988]). 

5. Rapid-solidification-processed (RSP) and other metastable 
crystalline alloys 

5.1. Age-hardening of light alloys 

Before we treat the properties of RSP alloys, it is appropriate here to remind the 
reader that metastable crystalline states are by no means restricted to alloys which have 
undergone extreme cooling rates and often indeed, solid-state quenching is all that is 
required. The two such states that have been known and studied for longest are 
martensite in steels and the age-hardened state in light alloys, Al-Cu in particular: both 
these are products of solid-state quenching. Both were cited as archetypes in the first 
paragraph of this chapter. Martensite is discussed in chapters 16 and 17; here we devote 
some paragraphs to the key phenomenon of age-hardening, since the structural changes 
(as distinct from the hardening mechanism) are not treated elsewhere in Physical 
Metallurgy. 

Age-hardening was discovered accidentally in 1906 by a German engineer, Wilm, 
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who examined an aluminum alloy containing 3.5 wt% copper, 0.5 wt% magnesium and 
0.5 wt% manganese, and his findings were eventually published in 1911. His alloy, 
quenched from 500°C at which temperature it had been homogenised, hardened substan- 
tially over a period of days at room temperature. This process came to be called age- 
hardening. The original paper is not cited here, though it contains much of interest; it is 
too difficult to locate. Wilm’s paper (in translation) and extracts from many other of the 
early key papers have been reproduced in a most useful book about age-hardening by 
Martin [1968], which also includes an extended commentary about the meaning of the 
early discoveries. Wilm did not understand why his alloy hardened and this was left to 
three American metallurgists, Merica, Waltenburg and Scott, who in 1919 established the 
shape of the solvus curve for copper in aluminum, establishing that the solubility 
increased sharply with temperature, and they were the first to suggest that the precipita- 
tion of the equilibrium phase, CuA 1 2 , was the basis of the hardening, although it could 
not be seen in the microstructure. (In fact, the age-hardening of Wilm’s alloy, which he 
called “duralumin”, was later found to be due not only to the copper but also to the 
precipitation of a form of the phase Mg 2 Si, the silicon being an unintended impurity in 
the alloy). 

Understanding did not develop further until in 1938 two independent notes appeared 
side by side in the journal Nature, by Guinier [1938] and Preston [1938], soon 
followed by more substantial presentations by these same authors. They showed by Laue 
or oscillating-crystal x-ray diffraction photographs that age-hardening of Al-Cu alloys 
was associated with a pre-precipitation process: thin “zones” parallel to [100] became 
enriched in copper, without change as yet in crystal structure, and the lattice spacing was 
concurrently locally reduced because copper atoms are smaller than aluminum atoms. 
They also indicated that these zones (which later came to be called Guinier-Preston, or 
G-P, zones) developed into better defined intermediate crystal structures at a later stage 
of ageing. These later, distinct, phases in Al-Cu came to be called 8", 8' and 8 (all 
tetragonal), and all coherent with the matrix on [100]. 6 is in fact the equilibrium CuA 1 2 
phase; 8' and 8 are formed only when ageing is done above room temperature. The 
association of the successive stages of hardening with the development in the pre- 
precipitation structures was soon demonstrated, first by Gayler in 1940. Research on both 
the structural and the hardening stages proceeded intensively in the two following 
decades: an important synthesis was by Silcock et al. [1953-54], Soon after this, the 
age-hardening stages began to be examined by transmission electron micrography, and 
such matters as forms of heterogeneous nucleation of zones and transition phases were 
examined. A definitive review of what had been discovered about these stages came a 
few years later (Kelly and Nicholson [1963]). This review covered not only Al-Cu 
alloys but also others, such as Al-Ag in which the initial age-hardened zones are 
spherical and Al-Mg-Si alloys in which they are rod-shaped. After that, the fast pace of 
research in this field slackened, but x-ray studies, especially by small-angle scattering 
(see ch. 12, § 5.2) have continued. An up-to-date outline of age-hardening in light alloys 
(aluminum, magnesium and titanium base) has been published by Polmear [1995]. 

Figure 49 shows the metastable solubility curves for the various intermediate phases 
as deduced from the researches in the 1950s and 1960s. GP zones and 8" nucleate 
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Fig. 49. Solubilities of copper in aluminum in the presence of various, mostly metastable phases as a function 
of temperature. 


homogeneously, 6' forms preferentially at dislocations and the equilibrium phase, 0, at 
grain boundaries. The succession of stages follows Ostwald’s (and Turnbull’s) “rule of 
stages”: the zones or particles of each stage undergo reversion (i.e., re-solution) before 
the next phase is formed; the initial presumption that each stage formed by atomic 
rearrangement of the earlier form proved to be mistaken. Reversion was studied in detail 
by experiments in which an age-hardening treatment at one temperature was interrupted 
by a shift to a higher or lower temperature (e.g., KOster and Kam [1938]); the sudden 
shifts of hardness, later supplemented by structural studies, showed that structural 
reversion took place. 

The kinetics of aging were studied not only by x-ray diffraction and hardness 
measurements but also by resistivity measurements, conducted by Turnbull and his 
associates in the late 1950s on Al-Cu alloys. The. resistivity is closely linked to the 
process of formation of GP zones. These systematic studies were of great importance 
because they proved unambiguously that the rate of zone formation (or clustering, as they 
called it) at room temperature was orders of magnitude faster than would have been 
predicted by extrapolating measured diffusivities of copper in aluminum (Desorbo el al. 
[1958]), and the same thing was found for the age-hardening kinetics of Al-Ag (Turn- 
bull and Treaftis [1957]) and later of Al-Zn. The activation energy of the process 
clearly is that for vacancy migration, showing that there was a metastable excess of 
quenched-in vacancies which accounted for the rapid aging. The striking effect of these 
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Fig. 50. Fractional change in resistivity vs aging time at 0°C for an Al-2at% Cu alloy after the indicated 
quenching sequences. (Holding times were 1 h at 0°C, 2 min at 200°C). (After Turnbull et al. [I960]). 


excess vacancies emerges from fig. 50, taken from a later study by Turnbull et al. 
[I960]: here it is seen that a reversion treatment at 200°C, interspersed with aging at 
0°C, greatly slows down aging kinetics because such reversion quickly gets rid of a large 
fraction of the excess vacancies (the authors believed that this happens because the 
reversion treatment creates dislocation loops which act as efficient vacancy sinks). 
Another feature of these studies was a comparison of aging after quenching into different 
fluids: the faster the quench, the more rapid the ultimate aging kinetics. This series of 
studies constitutes one of the great classics of physical metallurgy. (See also ch. 7, § 8.1). 

Turnbull’s experiments stimulated, after a long interval, some very interesting 
investigations of concentrated Al-Mg alloys, by van Rooyen et al. [1988]. Using 
scanning differential calorimetry, these authors compared, using both melt-quenched and 
solid-quenched specimens, the kinetics of both the formation of GP zones at room 
temperature, and their subsequent redissolution during heating. (The melt-quenching took 
place by melt-spinning, producing 30 jj.m thick ribbons, the solid quenching was carried 
out by reusing these same ribbons). It was shown that in the solid-quenched specimens, 
both GP zone formation and GP zone redissolution were much faster than in the melt- 
quenched specimens. This was attributed to the greater efficacy of the solid-state quench 
in “freezing” in excess vacancies. That, in turn, is no doubt due to the greater effective- 
ness of a quench in water in maintaining a fast cooling rate down to quite modest 
temperatures, whereas melt-spinning leads to very fast solidification but slow cooling in 
contact with air after solidification is complete. This reinforces the message, conveyed 
in § 3.1, that a quenching method must take into account the measured cooling rate in the 
temperature range of interest, and is in harmony with the observation cited there, that in- 
rotating-water wire quenching is more effective in maintaining fast cooling in the solid 
state than is melt-spinning. 

In recent years, research has been focused on the Al-Zn alloys, mostly by means of 
the small-angle scattering of x-rays or neutrons (see also ch. 12, §5.2). These alloys 
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decompose by a spinodal mechanism (see ch. 15, §3.1) below a well defined spinodal 
temperature, though the similarity of aging kinetics above and below this temperature 
(Hennion etal. [1982]; Guyot and Simon [1982]) indicate that the spinodal singularity 
does not play any substantial part in the decomposition kinetics. Many other relevant 
references, as well as a detailed explanation of spinodal decomposition as observed in 
Al-Zn alloys, can be found in a book chapter by Binder [1991], 

5.2. General characteristics of rapid-solidification-processed (RSP) crystalline alloys 

The principal changes which can be brought about in crystalline alloys consequent on 
RSP have already been outlined in § 1.1. They include: 

(1) Extension of solid-state solubility; 

(2) refinement of grain size, with possible modification of grain shapes and textures; 

(3) reduction or elimination of microsegregation; 

(4) formation of metastable phases; 

(5) achievement of high point-defect concentrations. 

All of these have been surveyed in a general review by Honeycombe [1978], and 
various aspects, grain refinement and metastable phase formation in particular, are treated 
elsewhere in this chapter. 

We have already seen, in §2.1, how thermodynamic and kinetic considerations, the 
variation of T 0 with composition in particular, affect solute trapping — that is, metastable 
extension of solid solubility. Aluminum alloys have been far more studied than any 
others, especially those with transition-metal solutes, and fig. 51, taken from Midson and 
Jones [1982], summarizes available results. It is plain that the solubility enhancement 
factor is in general much larger for solutes with a small equilibrium solubility. (This is 
reminiscent of the inverse correlation between intensity of segregation at grain bound- 
aries and equilibrium solubility, see ch. 13, §4.2.) Midson and Jones analyze these 
results in terms of the morphological criterion and Jones has also sought to interpret the 
lattice parameters of some greatly supersaturated solid solutions in terms of rival physical 
models (Jones [1983]). An early but substantial collection of experimental data on 
extended solubility in many different solvents was assembled by Anantharaman and 
SURYANARAYANA [1971]. 

Wood and Honeycombe [1981] review the available information on extension of 
solubility in iron-base alloys. Their figures together with some additional ones assembled 
by Cantor [1982] are collected in table 7. In addition, C solubility in martensites is 
enhanced up to 2.0 wt% and in austenites up to 3.5 wt% before cementite is detected; 
but results are very variable and dependent on the measurement technique used. Boron, 
which is almost insoluble in equilibrium in pure a-iron 50 ppm) was forcibly 
dissolved by RSP up to 1 wt %; as with aluminum alloys, it appears that the least soluble 
elements have the highest enhancement factor. 

Extended metastable solid solubility of course leads to greater volume fractions of 
pre-precipitation structures (GP zones) and consequently enhanced hardening. Much of 
the work on this has been purely empirical and there is rather a dearth of basic studies 
of pre-precipitation and precipitation mechanisms in RSP alloys. The rejection of solute 
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Fig. 51. Solubility extensions of various solutes in aluminum. The black bars indicate the increase beyond 
equilibrium room-temperature solubilities achieved by rapid solidification processing. (After Midson and Jones 
[1982]). 

from the matrix and the identity and density of zones and phases need to be studied and 
compared with those for solid-quenched alloys of the same type. The rejection of solute 
from supersaturated RSP Al-12 at% Cu alloy has been studied by Krishnanand and 
Cahn [1976] and the decomposition products of RSP Al-17 at% Cu by Scott and 
Leake [1975]. The behavior of supersaturated RSP Al-8 wt % Fe alloys was examined 
by Jacobs etal [1974], Al-Fe-Cr alloys by Yearim and Shechtman [1982], while a 
range of supersaturated Al-Zr alloys were examined in detail by Sahin and Jones [1978] 


Table 7 


Extensions of solubility for substitutional solutes in rapidly solidified binary iron-base alloys. 


Solute 

Max. equil. 
solubility 
(at %) 

Extended 
solubility 
(at %) 

Cu 

7.2 

15.0 

Ga 

47.0 

50.0 

Ge 

20.0 

25.0 

Mo 

26.0 

40.6 

Rh 

50.0 


Sn 

9.8 

20.0 

Ti 

9.8 

16.0 

W 

13.0 

20.8, 18.0 
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and Hori etal. [1982], 

It is important to emphasize that to obtain the full benefits of improved mechanical 
properties from RSP of alloys, it is necessary to develop special solute-rich compositions 
and not to restrict oneself to standard compositions used for normal casting. 

One of the most common consequences of RSP is refinement of microstructure, 
including such features as dendrite arm spacings, eutectic lamellar spacings, particle 
dispersions and grain size. When there is solute partitioning, rapid solidification leads to 
shorter diffusion distances and thereby to a refined structure. Even without partitioning, 
however, for example in pure metals, there is refinement of grain size. Sizes in the range 
1 to 5 /urn are quite common, and in splat-quenched foils can be as small as 0.01 /un. 
The refinement of grain size by RSP may have a number of distinct origins (quite apart 
from refinement brought about by the addition of inoculant particles). The possible 
origins have been reviewed by Greer [1991]. As shown in fig. 34, solidification at high 
undercooling should favor homogeneous nucleation and yield finer grains. Boswell and 
Chadwick [1977] attempted thus to rationalize micrograin sizes in aluminum. In 
general, however, recalescence precludes grain refinement occurring in this way, except 
in glass-forming alloys near T g . After rapid quenching, for example by melt-spinning, the 
grains are not necessarily equiaxed. Melt-spun ribbons have a have a fine equiaxed chill 
zone at the surface next to the quenching wheel, but this is followed by columnar 
growth, with column width increasing through the thickness of the ribbon because of 
competitive growth of differently oriented grains. The initial grain size at the wheel-side 
of the ribbon is governed by the heterogeneous nucleation frequency and the crystal 
growth rate. A simple analysis based on the temperature dependence of these quantities 
suggests that the grain size should be inversely proportional to cooling rate, in good 
agreement with experiment (Greer [1991]). Rapid solidification also occurs in highly 
undercooled bulk melts, and it is known that grain-refined structures are obtained for 
undercoolings beyond a critical value (as seen, for example, in the work of Kattamis 
and Flemings [1970]) (see also ch. 8, § 11.2). There has been much investigation of this 
phenomenon, and it appears that it is not due to copious nucleation in the melt, but rather 
to a break-up of the initial dendrites (Schwarz etal. [1994]). 

Consequences of a fine grain size include substantial “Petch-strengthening” at ambient 
temperature (Duflos and Cantor [1978]) as well as an enhanced liability to super- 
plasticity in RSP alloys (e.g., Cheese and Cantor [1980]), in the sense that super- 
plasticity persists to higher strain rates. Quite generally, fine grains enhance hot- 
formability and thus RSP alloys are easier to shape as well as often having better 
mechanical properties after forming and heat-treatment, than slowly solidified analogues. 

Fine grains also have consequences for martensitic transformations in steels and for 
creep in superalloys; these are discussed below. 

Few studies have been reported of preferred orientations in as-cast structures 
produced by RSP. Blake et al. [1982] found remarkably perfect alignment of (0001) 
planes of Zn and Cd parallel to the surface of melt-spun ribbons; several other metals 
formed weaker textures of various types. The texture of high-purity (99.999%) zinc was 
found to deteriorate with time, and this was tentatively attributed to selective grain 
growth at ambient temperature. Kan et al. [1982] have melt-spun ribbons of highly 
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alloyed Fe-Si (6.5 wt% Si, unlike the usual 3 wt%) and examined the preferred orien- 
tation. Grains with (100) [0W] orientations grew preferentially to give a sharp planar 
texture, leading to much reduced alternating current losses. The texture-sharpening is no 
doubt an instance of surface-controlled grain growth (ch. 28, §4.4.2). 

The general technological implications of the combination of extended solubility, 
microcrystallinity and reduced segregation have been lucidly reviewed by Grant [1978]. 
KOster and SchOnemann [1993] provide a broad-based overview of phase transform- 
ations in rapidly solidified alloys. 

5.3. RSP light alloys 

A particularly important practical objective is the long-standing requirement of 
substantially increasing the elastic stiffness of aluminum alloys for airframe applications. 
Elastic moduli are not structure-sensitive and so this objective is best approached by 
introducing large amounts of stiff intermetallic compounds; initially, those based on 
manganese were preferred. When this was first attempted (Dudzinski etal. [1947-48]), 
the segregation associated with the slow freezing used led to coarse, uneven particle 
distributions and the alloys were hopelessly brittle. Once RSP methods had become 
established, numerous investigators recognized that this approach was the answer to the 
original difficulties. It should be noted here that while there is evidence that melt- 
quenching traps fewer excess vacancies than does solid-state quenching (§ 5.1), yet melt- 
quenching is more effective in permitting large supersaturations of solute; this is helped 
by fast cooling through the freezing temperature. 

One company began using a form of planar flow-casting to produce alloy ribbons 
which were then chopped into small fragments and compacted by hot extrusion or by one 
of several rapidly acting high-temperature processes, such as impact forging or explosive 
consolidation, which restrict or eliminate structural coarsening during compaction (Adam 
and Lewis [1985]). These authors, for instance, give elastic properties of hot-extruded 
Al-(8-12)Fe-2V alloys: the Young’s modulus for pure aluminum is 70 GPa, and the 
alloying enhanced this value by up to 37% without substantial embrittlement. The 
enhancement is due to a combination of enhanced solubility and fine dispersion of 
intermetallic compounds. Since the 1985 account, development of this kind of alloy has 
continued and further improvements of properties has been achieved (Das and Froes 
[1993]). Thus, silicon has been added to make Al-Fe-V-Si alloys, in which the chief 
dispersoid is Al n (Fe, V) 3 Si. Figure 52 shows the high-temperature strength of different 
members of this family, with different contents of this dispersoid. The top plot refers to 
an alloy with a stiffness almost 40% up from that of pure aluminum, though its fracture 
toughness is only moderate; the bottom curve refers to a somewhat weaker alloy which 
is highly formable; it has been drawn into wire. 

Enhanced stiffness is a special requirement of modem airframe aluminum alloys, but 
so is strength, notably at elevated temperatures, because of aerodynamic heating in 
flight). Figure 53, from Adam and Lewis’s 1985 review, shows the success which had 
already been achieved 10 years ago in attaining this objective. All these advanced alloys 
contain iron: ever since Thursfield and Stowell’s 1974 study, this has been the primary 
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Fig. 52. Tensile properties of RSP and consolidated AI-Fe-V-Si alloys as a function of temperature. (After Das 
and Froes [1993]). 

solute of choice (iroh has a small equilibrium solubility and a large enhancement of 
metastable solubility through RSP). In the figure, 70775-T6 and 2219-T851 are non-RSP 
high-grade alloys, and the “AFWAL program goal” was a military objective of that time. 
It can be seen that, even then, the new alloys came close to matching the state-of-the-art 
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Fig. 53. Elevated-temperature strengths of advanced aluminum alloys. (After Adam and Lewis [1985]). 


aeronautic titanium alloy, Ti-6A1-4V. A notable feature of such alloys, arising from their 
microstructural homogeneity, is their good corrosion resistance. 

For the range of alloys based on Al-Li, which has received intense attention in the 
past decade because of the prospect of much enhanced specific stiffness (lithium reduces 
density and enhances stiffness), RSP has been extensively explored since the early work 
of Sankaran and Grant [1980], and at present there is still much dispute as to the 
relative benefits of standard ingot processing and RSP. It has been shown that RSP 
allows larger lithium contents than does ingot metallurgy (5 wt% as opposed to 2.5) 
without causing embrittling coarse dispersoids (Meschter et al. [1988]). Also, RSP 
alloys show less anisotropy than do those made by ingot metallurgy. However, the 
economics of the process may rule it out. A recent overview of Al-Li-based alloys 
(Grimes [1990]) asserts that early worries about the adequacy of the ingot route and 
claims that powder metallurgy was essential, had largely vanished by 1990. 

Das and Froes’ overview of 1993 also covers the striking progress that has been 
made with RSP magnesium-base alloys: these alloys are always hovering on the verge 
of large-scale application and then do not make progress, largely because of corrosion 
problems. RSP alloys have improved corrosion properties as wfcll as improved strength, 
and their prospects may now improve. The overview also covers the considerable work 
done with RSP titanium-base alloys. The review must be consulted for further details. 

5.4. Steels 

The quite extensive early work on microstructure of RSP steels has been fully 
reviewed by Wood and Honeycombe [1981]. Extension of solid solubilities in iron was 
outlined in §5.2. The findings concerning the precipitation morphology of various 
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carbides, both in simplified “model” steels and in commercial compositions, do not lend 
themselves to summary and the review should be consulted. 

The principal driving force for research in this field has been the desire to improve 
the characteristics of tool (“high speed”) steels. Much of the research has centred on 
simplified model alloys such as Fe-W-C. The most striking finding (see for instance 
Sake and Honeycombe [1978] and Rayment and Cantor [1981]) is the fact that for 
the fastest quenches and for solute-rich steels, 5-iron is retained; it is only at intermediate 
quench rates that austenite is retained in such steels, and at conventional cooling rates, 
martensite is the major phase. The kinetics of hardening during heat-treatment are not 
much affected though the pattern of carbide precipitation is refined and, in particular, 
carbide segregation, a serious problem, is minimised (Kasak and Dulis [1978]). The 
peak hardness is however raised for optimum treatments (Rayment and Cantor [1978]). 
Practical developments based on RSP of tool steels are surveyed by Jones [1982], 

In scientific terms, one of the most striking sets of findings concerns the modification 
of the martensitic transformation by RSP. Inokuti and Cantor [1976] found a variation 
of microhardness values for a series of RSP Fe-Ni alloys, as shown in fig. 54a; values 
for the solid-quenched alloys are shown for comparison. This was traced to a very 
pronounced change in martensite start (M s ) temperatures on cooling (fig. 54b) This curve, 
and also the a -» y start temperatures on reheating, were obtained calorimetrically: the 
martensitic alloy was heated slowly in a DSC until it had all transformed back to y and 
then slowly cooled again; M s (y — » a) was marked by a pronounced exothermic peak, 
a — » y by an endotherm (Inokuti and Cantor [1982]). The striking feature was that the 
depressed M s caused by the original rapid quench from the melt does not disappear on 
repeated cycling between a and y; only a prolonged y anneal restores the alloy to normal 
behavior (labelled “solid quench” in fig. 54b). It is therefore not the rapid cooling per se 
which reduces the Ms, so it can only be the austenite structure resulting from that 
quench. In fact, various earlier studies not involving RSP have shown a relationship 
between austenite grain size, d, and M s (e.g., for Fe-32% Ni, AM S = 40-100 K for d 
reduced from 200 to 10 /im). In the RSP alloys, d is only 4-5 fim and this leads to a 
drop in M s of up to 200K for Fe-17% Ni. Thermal cycling, via the reverse shear 
transformation, restores the fine austenite grains, and only a long y anneal gives the y 
grains a chance to grow substantially. Such austenite grain growth seems often to be 
particularly sluggish in RSP steels (Suga etal. [1980]); a micrographic study suggested 
that stable, extremely fine sulphide dispersions at austenite grain boundaries in RSP 
steels are responsible for such retardation of grain growth. (Kelly and Vander Sande 
[1980]). A slight reduction in the cooling rate (substituting melt-spinning for piston-and- 
anvil quenching) generated massive transformation instead of martensite (Hayzelden 
and Cantor [1984-85]). A study of Fe-Ni-C alloys, with lower martensite start 
temperatures (Hayzelden and Cantor [1982], using a scanning calorimetry approach, 
similarly showed M s depressed by a very fine grain size generated by RSP. 

Figure 29a shows that pure iron can also form martensite. The structure, formation 
conditions and crystallography of this martensite were studied by Duflos and Cantor 
[1978]; hardness goes up with martensite/ ferrite ratio, and for 100% martensite, the 
microhardness of pure iron is 600-700! A Hall-Petch plot (ch. 17, §5.1) for pure iron is 
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Fig. 55. Microhardness vs (grain size )' 1/2 for pure iron. (After Duflos and Cantor [1978]). 

shown in fig. 55, taking the thickness of a martensite lath as “grain size”. It can be seen 
that the remarkably high hardness of martensitic iron can be largely, or even wholly, 
attributed to “grain-size strengthening”. Recently, the limit of applicability of the 
Hall-Petch straight-line plot for very small grain sizes, because of insufficient space for 
dislocation pileups, has been discussed by Smith et al. [1995]; clearly, a grain size of 
about 0.35 /tm is still within that limit. 

A study by Mawella et al. [1982] of a melt-spun ultra-strong steel indicated that the 
considerable strength increase generated by melt-spinning was primarily due to refine- 
ment of the martensite structure. It has also been established that high-carbon cast iron 
(2.4-3 wt% C) following RSP by centrifugal atomization consists largely of retained 
austenite. Subsequent annealing produces a microcrystalline (a + carbide) structure. Even 
at 650°C, this microcrystalline structure is so stable that the material behaves super- 
plastically (Eiselstein et al. [1983]). The conversion of brittle cast iron into a 
superplastic material is a particularly striking indicator of the changes in metallurgical 
characteristics obtainable by RSP. 


5.5. Superalloys 

Apart from tool steels and aluminum alloys, the other principal category of alloys to 
which RSP has been extensively applied is that of the superalloys, both those used for 
turbine discs and those destined for blades. The two Reston Conferences, in 1977 and 
1979, are the principal source of information up to now on these studies; some other 
publications are by Tracey and Cutler [1981] and by Smugeresky [1982], Jones 
[1982] also provides a lucid summary. 

Most investigators up to now have used atomized powders, especially those made by 
centrifugal atomization in helium. However, Duflos and Stohr [1982] have demon- 
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strated that melt-spinning can provide higher quench rates and thus comminuted melt- 
spun ribbon provides an alternative material for subsequent hot consolidation, most 
commonly done by hot isostatic pressing. The benefits of RSP include the possibility of 
forcing more solute into the y and y' phases, producing alloys fine-grained and homo- 
geneous enough to permit easy hot-working, even superplastic forming in extreme cases, 
and uniformity of structure throughout a large piece. The microstructural implications of 
a change from gas atomization to melt-spinning of superalloys have been examined in 
some detail by Liebermann etal. [1983], Shohoji etal. [1982] have also examined the 
structure of melt-spun superalloys and estimated cooling rates. 

Inoue etal. [1983b] melt-quenched a number of superalloys consisting of y' only. 
They found that these RSP alloys, instead of being almost perfectly ordered with large 
antiphase domains, were imperfectly ordered with APDs well below 0.1 mm in diameter, 
and were free of grain-boundary segregation. These structural modifications lead, in 
several alloys, to greatly enhanced ductility and some increase in strength. However, this 
much-cited research has recently been called into question, because the authors used 
melt-spun ribbons for microstructural examination and INROWASP-wires for mechanical 
testing, presuming that both forms had similar microstructures. Chiba and Hanada 
[1996] have now demonstrated that the wires had quite a different microstructure from 
the ribbons, apparently because of slower cooling through the freezing-temperature. 

There is, however, a major problem. The fine grain sizes produced by RSP, while 
they facilitate forming, leave the alloy very liable to creep at high temperatures but low 
stresses. This can be overcome by a process of directional recrystallization of the RSP 
alloy after it has been consolidated and formed: the process consists of a slow passage 
through a steep thermal gradient, to form large elongated grains rather like those 
generated by directional solidification. However, as Smugeresky [1982] indicates, the 
prolonged period spent at high temperatures during consolidation, forming and directional 
recrystallization may damage the desirable phase structure generated by RSP, and this 
problem is attracting further attention. The technique is currently applied primarily to 
oxide-dispersion-strengthened superalloys like MA 6000 (which contains 1.1 wt% Y 2 0 3 ). 
Such alloys cannot be turned into coarse or single crystals by directional solidification, 
and thus directional recrystallization is the solution of choice (Roberts [1993]). The 
mechanism of this process has been examined in detail by Baloch and Bhadeshia 
[1990], and computer simulation has recently cast further light on it (Godfrey and 
Martin [1995]). 
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Further reading 

Numerous international conferences have been devoted to the field of rapidly solidified alloys since 1970, 
recently at 3-yearly intervals; initially the conference title referred to “rapidly quenched metals”, latterly to 
“rapidly quenched and metastable materials". The proceedings of most of these are cited in the Bibliography, 
in relation to specific papers, as are other important proceedings such as those of the Reston Conferences on 
Rapid Solidification Processing and other ‘one-off’ meetings. Most of these will not be listed separately here, 
but it should be recognized that these proceedings are prime sources in this broad field. 

A few selected overviews are listed below. Others are recommended in the body of the chapter. 

T. R. Anantharaman and C. Suryanarayana, Review: A decade of quenching from the melt, J. Mater. Sci. 6 
(1971) 1111. 

R. W. Cahn, Metallic Glasses, Contemp. Phys. 21 (1980) 43. 

F. E. Luborsky, in: Ferromagnetic Materials, vol. 1, ed. P.E. Wohlfahrt (North-Holland, Amsterdam. 1980), ch. 
6, p. 451, (Detailed quantitative data on magnetic properties, densities, etc., of many metallic glasses are 
included). 

H. Herman, ed., Ultrarapid Quenching of Liquid Alloys, Treatise on Materials Science and Technology, vol. 
20 (Academic Press, New York, 1981). (The chapter by Li, on mechanical properties, tabulates T g , T x , 
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Crystal structure, see “Structure” 

Curie law, 2502 

Curie temperature, 2503, 2509 

Curie-Weiss law, 2504 
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Darken-Guny plot, 161 
Darken’s equations, 609 ff 
Daltonides, 206 
Dauphin^ twin in quartz, 868 
Debye- Waller factor, 1104 
-, “static”, 1133 

Decagonal symmetry, 378, 382 ff 
Defect structures, 186 ff 

Deformation 

- bands, 2427 ff, 243 1 ff (see also “Transition 

bands”) 

-, cyclic, 2336 ff 

- mechanism, in iron, 1584 ff 

- mechanism maps, 1268 ff, 1887, 1958 ff, 

2001, 2002, 2379 

- (of) polymers, 2692 ff 

- textures, evolution, 1943 ff 

— : Taylor model, 1943 
twinning, see “Twinning” 

Dendrite 

cell-to-dendrite transition, 748 ff 

- formation in solids, 1421 ff 
microsegregation around dendrites, see 

“segregation”, below 
primary arm spacing, 741 ff 
— , comparison with cell spacings, 745 
secondary arm spacing, 746 ff 

- segregation, 749 ff 

— and solid-state diffusion, 752 
-tip radius, 732 ff, 1429 ff 
Dendritic growth, 731 ff, 739 ff 
-, anisotropy, 737 ff, 742 
branches, 746, 1430 
-: coarsening, 746 ff 
computer modelling, 755 

- in eutectics, 765 
instability in solids, 1421 ff 

— : dendritic growth in solids and in 
liquids compared, 1425 
examination of nonmetals, 739 
interdendritic fluid flow, 789 ff 
-: theories, new, 755 

- velocity, 736 ff, 751, 813 

- with peritectic solidification, 776 ff 
Density of (electron energy) states, 66, 74 ff 

-; transition metals, 83 ff 
Diamond structure, 4, 6, 11, 25, 99, 283 ff 
-, hardening, 2038 ff 
-, nanocrystalline, 920 
-, the ultimate polymer, 2700 
Diatomic molecule 
-, heteronuclear, 61 
-, homonuclear, 61 


Differential scanning calorimetry, 1786 ff, 1790, 
2401 ff, 2722 ff 
Diffraction theory 

-dynamical, 1044 ff, 1082 ff 
-, kinematical, 1094 ff 

Diffuse scattering of radiation, 1 1 18 ff, 1 1 34 ff , 

1 1 39 ff, 1145, 1 148 ff 
Diffusion, 536 ff 

-: activation volume, 558 
-: amorphous alloys, 643 ff, 1731 ff, 1778 ff, 
1804 ff 

-, anomalous, 573 
-: Arrhenius behavior, 1661 ff 
-, chemical, 541 ff, 607 ff 
— in ternary alloys, 611 

- coefficient, see “Diffusion coefficient” 

-, collisional, see “Ion-beam mixing” 

-: complex mechanisms, see “Fast diffusion” 

-: concentrated alloys, 595 ff 
-, correlation effects, 548, 550 ff, 598, 621 
-: correlation factor, 543, 550, 591 
-creep, 1268 ff, 1988 ff 
critical slowing down, 610 
-; Darken’s equation s, 609 ff 

- dilute alloys, 542 ff, 582 ff 

— : diffusion in terms of jump frequencies, 
584 

— : linear response method, 586 ff 
— : solute diffusivity as a function of 
solute concentration, 588 
— : solute and solvent diffusivities, ratio, 
591 ff, 594 

— : standard model, 583 
-, dislocation-, 621 

divacancies, role of, 538, 579, 591 
-: Einstein relation, 546 
-: electromigration, 612ff, 616 ff 
-: exchange mechanism, 536 
-, extrinsic temperature region, 558 
-: (anomalously) fast diffusion, 593 ff, 1187, 
1746, 1807 ff 
-: Fick’s first law, 542 
-: Fick’s second law, 545 
— , limitations of, 552 
-, grain-boundary, 620, 623 ff 
— , atomic model, 624 ff 
— : effect of impurity segregation, 1255 ff 
— : role in diffusion creep, 1268 ff 
interstitial diffusion, 592 ff 
-: interstitial mechanism, 537 
— : dumbell mechanism, 592 ff 
— : Zener formula, 582 
-: intrinsic diffusion region, 558 
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irradiation-enhanced, 635 ff 
- isotope effect, 558 ff 
— , reversed, 594 
jump frequency, 547 ff 
Kirkendall effect, 608 ff, 1625 
macroscopic theory, 539 ff 
Manning’s random alloy model, 596 ff 
Matano plane, 546 
-, mechanisms, 536 ff 
mixed mechanisms, 538 
molecular dynamics approach, 560 
Monte Carlo method, 561, 597 
Nemst-Einstein relation, 550 
non-equilibrium defect concentrations, 
effect of, 633 ff 
numerical simulation, 559 ff 
ordered (long-range) alloys, 599 ff 
— , with B2 structure, 602 ff 
— , with LI 2 structure, 604 ff 
— , with other structures, 606 ff 
— : use of spectroscopic methods, 601 
— : six-jump cycle model, 601 
— , variation with temperature for CuZn, 
2079 

-, pipe-diffusion, 619 ff, 621 ff 
pressure effects, 558 
quenched-in vacancies, effect of, 633 ff 
-, radiation-enhanced, 638 ff 
-, random-walk theory, 546 ff 
-, self-, 572 ff 

— , prediction, 581 ff 
— , empirical relationships, 582 
-, short-circuit, 539, 619 ff 

— : short-circuit networks, 622 
short-range order, effect of, 598 
solute-vacancy binding energy, 592 
-, surface, 626 ff 

— : experimental results, 630 ff 
— : effect of contaminants, 632, 1254 
Thermomigration, 611 ff, 615 
vacancy mechanisms, 538 ff 
— , relaxation mechanism, 538 
— , theories, 554 ff 
— : vacancy aggregates, 538, 579 
— , vacancy concentration, 553 
— : vacancy jump frequencies, 589 ff 
— : vacancy wind term, 585, 589, 6 10 ff 
Varotsos formula, 582 
Zener formula, 582 
Diffusion coefficients 

activation energy, 557 
anelasticity approach, 566 ff 


-, anomalously high, 593 ff, 1187, 1 746, 

1807ff 

- at infinite dilution, 542 

-, chemical (interdiffusion), 544, 608 ff 
— : experimental methods for measuring, 
563 ff 

-, classification, 543ff 
Darken’s equations, 609 
empirical prediction methods, 582 
— : Keyes relation, 582 
— : Nachtrieb relation, 582 
— : Van Liempt relation, 573 
— : Varotsos formula, 582 
— : Zener formula, 582 
experimental methods, 562 ff 
frequency factor, 557 

- in dilute alloys, 542 ff, 582 ff 
-, interdiffusion, see “chemical” 

-, intrinsic, 540 

-, phenomenological, 540 

— , Onsager reciprocity relation, 540 
pressure variation, 558 
relaxation methods, 565 ff 
— , Gorsky effect, 568 
— , magnetic relaxation, 569 
— , Snoek relaxation, 567 
— , Zener relaxation, 567 ff 

- self-, 544, 

— , in iron, changes due to phase 
transformation, 1560 

— , studied by inelastic neutron scattering, 
1187 ff 

— , in pure metals, 572 ff 
— , tabulation, 575 ff 
-, solute, 544 

spectroscopic methods (NMR and 
Mossbauer), 570 ff 

— : quasielastic neutron scattering, 572 
-, surface, 629 

vacancy wind effect, 610 ff 
Diffusion creep (diffusional flow), see “Creep” 
Diffusion-induced grain-boundary migration, 

623 ff, 1461 ff, 2447 

-, attributed to elastic coherency stress, 1463 ff 
Diffusion-induced recrystallization, 1467 
Diffusional processes (in solid-state changes), 

1371 

Directional short-range ordering, 2535 ff, 2553 ff 
Discommensurations, 1550 
Dislocation(s), 1832 ff 

- activation volume, 2180 
“atmosphere” drag, 1867 ff 
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-: attractive junctions, 1863 

- (in) body-centred cubic metals, 1845 
bowout, 1842, 1855 ff, 1860 ff, 2114ff 
Burgers vector, 1832 

Burgers vector density, 2241 

- cell formation, 1978 ff, 2134 ff 

— : flow stress in relation to cell size, 
1839, 1922, 1930 

-climb, 1863 ff, 1866, 1960, 2040 ff, 2186 ff 
— : climb resistance, 2186 ff 
— : general climb model, 2187 

- core energy, 1 845 

- core structure, 1844ff, 2084 ff 

— , planar and nonplanar in intermetallics, 
2084 ff, 2089 
Cottrell atmosphere, 1867 

- created by moving grain boundaries, 897 
critical velocity in a solid solution, 1971 

- density 

— : changes during creep, 1985 
— (in) deformed iron, 1590 ff 
-- (in) metal-matrix composites, 2584 
— , relation to yield stress, 1925 

- diffusion, 

- dipole, 1848, 2307 

— : loop patches, 2306 
-, edge, 1 832 ff 

- elastic field, 1834 ff 

- etch pits, 1921, 1926, 

— , solid solutions, 2014 
fatigue structures, see “Fatigue” 

Fisher mechanism, 

Fleischer-Friedel mechanism, 1903, 2018, 
2187 

-, forest, 1862 

— : cutting, 1903 ff, 1926 ff 
— : flow stress in relation to forest 
dislocation density, 1 838 ff 
-, Frank partial, 1848 

friction stress, see “Solute drag” 

- (in) gallium arsenide, 1 849 ff 

-, geometrically necessary, 2124, 2358 
Granato-Lticke internal friction theory, 

1856 

-, image strain (stress), 1840, 1881, 2120 
initiation of precipitation, 889 
interaction between dislocations, 1837 
internal stresses (dynamic) 1984 ff 
-, intrinsic (in interface), 1527 

intrinsic resistance to motion, 1895 ff 
— : interplanar resistance, 1895, 1913, 1937 
— : intraplanar resistance, 1895, 1913, 1937 


-jog, 1853 ff 

— , extended, 1854 
— : production, 1904 
— : supetjog, 1854 
-jog drag, 1 865 ff 
-: kinks, 1844 

— : motion, 1854ff 

— types, 1844 

- line tension, 184 Iff 

— (of) a bowed segment, 1842 

- locking mechanisms, 2016ff 

— : chemical locking, 2016ff 
— : elastic locking, 2017 
— : electrostatic locking, 2017 
— : stress-induced order-locking, 2017ff 
— : superimposition of different locking 
and drag mechanisms, 2020ff 
-: Lomer-Cottrell barrier, 1847, 2015 

- loop analysis in the electron microscope, 

1064ff 

- loop formation, 1063, 2121 

- loop lattice, 893 ff 

- mechanics in relation to continuum 

mechanics, 1947 ff 

- mesh-length (link-length), 1839, 1923, 2417 

— : principle of similitude, 1923, 1928, 
1981 

— : relation to yield stress, 1923 
-, misfit, 2145 

-: (dislocation) microstructure, 1920 ff, 1972, 
1975 ff 

— , solid solutions, 2014ff 

- motion at high homologous temperatures, 

1 863 ff 

- motion at low homologous temperatures, 

1854 ff 

-: Mott-Labusch mechanism, 2018 
-node, 1834, 1839 
— , extended. 1848 

- (in) ordered phases, 1 850 ff 

— : core structure, 

— , slip systems, 1853 
— : superdislocation, 1 850 ff 
-: Orowan relation, 1869 
-: osmotic climb forces, 1 863 ff 
-, partial, 1846, 2081 

-: Peach-Koehler force, 1 836 ff, 1864ff, 1867 
-: Peierls barrier (stress), 1 843 ff 

- pile-up, 1858 ff, 2195, 2198 
-pinning, 1855 ff, 1897 ff, 2044ff 

— : direct observation, 2047 ff, 2189 

— in alloys, 1860 ff 
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— : particle bypassing, 2046, 2119 
— : particle shear, 2044, 2048, 2116, 

2194 ff, 2201 

— : (effect of) particle size, 1901 ff 
— : thermally activated penetration, 1893 ff 
plastic punching, 2594 
point forces acting on, 1855 

precipitate interaction, see “pinning” 
prismatic loop, 2121 ff 
Schwarz-Labusch mechanism, 2192 
-, screw, 1832 ff, 1845 
-, secondary, 

-(in) semiconductors, 1849 ff, 1855 
-, sessile, 1926 

— in Ni 3 Al, 2089 

-, Shockley partials, 1847, 2181 

short-range order destruction by dislocation 
motion, 2021 

slip systems, 1852 ff (sec afro “Slip") 

— in body-centred cubic crystals, 1852 

— in face-centred cubic crystals, 1852 

— in hexagonal close-packed crystals, 1852 
small-angle scattering, 1 178 ff 

-: solute drag and locking, 1 866 ff, 2016ff, 
2018 ff 

— : microcreep, 2023 
— : superimposition of different 
mechanisms, 2020 ff 
— : thermal activation, 2021 ff 
-sources, 1857 ff 

-: stacking-faults associated with, see 
“Stacking-faults" 

-: stair-rod partial, 1847 
stair-rod dipole, 1854 
-storage, 1920 ff 
-, super-, 1850 ff, 2056 ff, 2081 ff 

— , motion at high temperature, 2061 ff 

— (and) planar faults, 2081 ff 
-, superpartials, 2081 

-, surface, elastic field near, 1839 ff 
-: threshold stress for detaching a dislocation 
from a dispersoid, 2188 
-: Thompson tetrahedron, 1846 
- tilt boundary, 1078, 2413 
-: transmission electron microscopy, 1056 ff 
-: width of core, 1843 
Dispersed-phase alloys, 1 897 ff, 2106 ff 
-: coherency loss, 2144 

creep, 2134 ff, 2154 ff, 2183 
-: high-temperature behavior, 2133 ff 
-: internal stress, 2128, 2136, 2138, 2155 


- (with) large particles, 2124 ff 

— : misorientation of matrix near particles, 
2125 ff 

-, particle bypassing, 2119 
recovery, 2127, 2134 
recrystallization, 2158 ff (see also 
“Recrystallization”) 

-: subgrain formation at high temperatures, 
2134 ff 

-: tensile properties, 2111 ff 
-: threshold stress for detaching a dislocation 
from a dispersoid, 2188 
Dispersion strengthening, 1897 ff, 2106 ff 

- distinguished from precipitation hardening, 

1899 

Displacements, atomic, in crystals, 

-, thermal, 1102ff, 1133 
static, 1105, 1133 
Displacement cross-section, 1649 
Displacement spike, 1684 
Diplacement threshold energy, 1648 ff 
Di-vacancy, 1643 

Dodecahedral symmetry, 378, 391, 400 
Doolittle equation, 1732 
Dorn equation, 1964 
Double diffraction, 1038 ff, 1166 
Droplet emulsion technique, 693 ff, 698 ff 
DSC lattice, 847 

Ductile-brittle fracture transition, 1259ff, 2280 ff 
-: grain-size effect, 2281 
-: strain-rate effect, 2281 
-: transition temperature, 2280 
Dumbbell atoms, 1659 

Duplex structure, microstructural change in, 878 ff 
Duwez gun, 1748 

Dynamic recovery, 1924, 1929, 2003 
Easy glide, 2029 

Edge dislocations, see “Dislocations” 

Effective interplanar spacing, 1238 
Einstein relation (random walk theory), 546 
Elasticity and anelasticity, 1879 ff 

anelastic deformation, 1880 ff, 2132 
— : isomechanical scaling laws, 1999 ff 
-: elastic properties of metal-matrix 
composites, 2581 ff 

-: elastic strains developing during plastic 
deformation, 1923 ff 
rubberlike elasticity, 2735 ff 
Electrochemical effect, 147 
Electrochemical measurement of activity, 467 ff 
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Electrodeposition, study by scanning tunneling 
microscopy, 978 

Electromigration, 611 ff, 616 ff, 632, 886 
-, use for purification, 618 
Electron band formation, 63 ff 
Electron concentration, 107 ff, 147 ff, 325 ff 
Electron-beam microanalyser, 970 ff, 989 
Electron channelling patterns, 968 ff 
Electron energy bands, 50 ff 
Electronegativity difference, 108, 1 14 ff, 147, 161 
Electron energy loss spectroscopy, 1087, 1091 ff, 
1217 

Electron irradiation, 1648 ff 
Electrons-per-atom ratio, see “Electron 
concentration" 

Electron phases, 108 ff, 111, 166 ff, 225 
-, hexagonal, 170 
Electron probe techniques, 992 ff 
Electron theory of metals and alloys, 48 ff 
Electronic specific heat, 173 ff 
Elements 

-, crystal structure, 2, 12 ff 
Elinvar alloys, 2541 
Ellingham diagrams, 429 ff, 1294 ff 
Ellingham line, 430 ff 
Ellipsometry, 960 

Embedded atom method (EAM), 2247, 2256 
Embrittlement 

-, hydrogen, 2217 ff, 2282 ff 
-, liquid-metal, 1386, 2286 
Energy band 

volume dependence, 72 ff, 84 ff 
Energy-dispersive X-ray analysis, 970 ff 
Energy gap, 70 

Energy levels of atoms, see “Atomic energy 
levels” 

Enthalpy of formation, see “Heat of formation” 
Enthalpy, 416, 499 
Entropy, 415 

-catastrophes, 1731 
-, configurational, 436 
-: measurement, 419 ff 
ESCA, 989 

Eshelby’s model of misfit strain, 2581 ff 
Etching, metallographic, see “Metallography” 
Eutectic, see “Phase diagrams” and “Solidification” 
Eutectoid coarsening, 1458 ff 
Eutectoids, lamellar spacings in, 1 460 ff 
Eutectoidal decomposition, 1451 ff, 1468 ff 
Ewald sphere, 1 101 ff 
Extended X-ray absorption fine structure 
(EXAFS), 1183 


Fast diffusion, 593 ff, 1187, 1807ff 
Fatigue, 2294 ff 

- (in) age-hardened alloys, 2340, 2354 ff 
anisotropy factor, 2343 ff 

-: bicrystals, 2343 ff 

chemical environment, 2374 ff 
Coffin-Manson law, 2295, 2303 
copper-aluminum alloys, 2347 ff 
-: crack initiation 

— (in) ductile metals, 2362 ff 

— (at) grain boundaries, 2372 ff 
— , mechanisms, 2369 ff 

— : role of PSBs, 2363 
-: crack propagation, 2376 ff 

— : elasto-plastic fracture mechanics, 2378 
— : short crack growth, stage I, 238 1 ff 
— , stage II, 2385 ff 
-: crack-tip blunting, 2389 
-: cyclic (plastic) deformation 

— compared with monotonic deformation, 
2336 ff 

— (of) polycrystalline metals, 2338 ff 

-: cyclic hardening in fee metals, 2295, 2297, 
2300 ff 

— in bcc metals, 2333 ff 

-: cyclic softening, 2295, 2300 ff 
-: cyclic stress-strain curves (CSSC), 2295, 

— for single crystals (orientation 
dependence), 2309 ff 

-: defect structure studied by small-angle 
neutron scattering, 1181 
-: deformation mechanisms, 23 12 ff (see also 
“rapid hardening, models”) 

-: dislocation cell structure, 2324 ff 
-: dislocation dipoles, 2307 
-: dislocation patterning (structures), 2308, 
2361 

— (in) copper-aluminum alloys, 2350 ff 
— : loop patches, 2306, 2311, 2317, 2321 
— : low-energy dislocations (LEDs), 2327, 

2332 

— : maze structure, 233 1 ff 
— : transition from loop patches to PSBs, 
23 15 ff 

— walls (dipolar), 2320, 2333 
-: environmental effects, 2374 ff 

— , in vacuo, 2375 
-: extrusions and intrusions, 2363 ff 
— : formation mechanism, 2371 ff 

- failure boundary maps, 2379 

-: grain-boundary migration during high- 
temperature fatigue, 2447 ff 
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grain-size effects, 2340 ff 
hysteresis loops, 2314, 2348 
history of phenomenon, 2294 ff 

- life, 2303 ff 

- limit, 2294 

- (and) linear elastic fracture mechanics, 2296 
metal-matrix composites, 2606 ff 
non-linear (dislocation) dynamics, 2360 

- (of) oxide-dispersion-strengthened alloys, 

2189 ff 

Paris curve, 2296 
persistent LUders bands, 2347 
persistent slip bands (PSB), 2043 , 2295 , 
2313, 2316 ff, 2321 ff 
— , models of dislocation behavior in, 
2326 ff 

— , non-uniform strain in, 2322 
— , nucleated at (annealing) twins, 2317 
— : demonstration of strain concentration 
at, 2323 ff 

plateau stress, normalized, 2305 
— , models, 2329 
point-defect emission, 2331 
protrusions (bulging), 2324, 2364 
rapid hardening, 2304 
— : models, 2313 ff 
recovery, 2408 

saturation stress , 2305, 232 1 ff 
S-N curve, 2294, 2304 
slip irreversibility, 2370, 2376 
solid solutions, 2043 

-, stainless steel, 2354 ff 

steady-state (saturation) stress amplitude, 
2303 

strain bursts, 2312 ff 
strain localisation, 2304, 2321 ff 
striations (ductile), 2387 ff, 2390 
Taylor lattice, 2314ff, 2317 
testing methods, 2297 ff 
— : constant amplitude stress tests, 2297 ff 
— : constant plastic strain amplitude tests, 
2298 ff 

— : increasing stress amplitude tests, 2298 
— : results compared, 2339 
■ — : variable amplitude tests, 2299 ff 

- (and) texture, 2342 ff 
threshold for crack growth, 2381 

— , metal-matrix composites, 2606 
(annealing) twins, stress-concentrating 
effects, 2342 
Wohler machine, 2294 
Fermi energy, 66 


Fermi sphere, 65, 108 

-, distorted, 109, 153, 171, 176 
Fermi surface, 71 

- and charge-density waves, 1549 
Ferrimagnetism, 2503 

Ferrite, 1568, 1570ff 

- morphologies, 1571 

- solid-solution hardening (and softening), 

1593 

strength, 1589ff 

Ferromagnetism, 123ff {see also “Magnetism”) 

Fibers, polymer, 2700 ff 

Fibonacci sequence, 377 ff 

Fick’s first law, 542 

Fick’s second law, 545 

Fictive temperature, 2723 

Flory-Huggins equation, 2684 

Flow stress, see “Yield stress” 

Flux-line lattice 

neutron scattering, 1181 
Fractals, 866 
Fractography, 2213 ff 
Fracture, ( see also Cracks”) 
amorphous alloys, 1798 
-, brittle, in practical situations, 2275 ff 
Charpy test, 2280 
-, chemically enhanced, 227 1 
crack shielding, see “Cracks” 
critical Griffith stress, 2237 
-, ductile, 2220, 2277 ff 

— at interfaces, 2269 ff 
ductile-brittle transition, 1259ff, 2280 ff 

— : grain-size effect, 2281 
— : strain-rate effect, 2281 
— : transition temperature, 2280 
-: (and) grain-boundary impurities, 1259ff 
-: grain-size effects, 2277, 2281 
Griffith criterion, 2236 ff 
HRR crack-tip field, 2242 ff 
hole growth, 2278 ff 
hydrogen embrittlement, 2217 ff, 2282 ff 
-, ideally brittle, 2220 
intergranular, 2270 ff 
-: liquid-metal embritdement, 2286 

- mechanics approach, 2276 ff 

— in fatigue, 2378 

- (of) metal-matrix composites, 2596 ff, 

2604 ff 

- modes, 2212 ff, 2223 ff, 2380 
-, models, limitations, 2244 ff 
-: necking, 1949 ff 

R-curve, 2278 ff 
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stress intensity factor, 2222 ff 
summary of concepts, 2272 ff 
temper-brittleness, 1270 ff, 1281, 1612, 

2285 ff 

toughness concept, 2213, 2238 

- toughness parameters, 2236 

— of metal-matrix composites, 2604 ff 
transformation-toughening, 2286 ff 
-, work of, 1 259 ff 
Frank partial dislocation, 1848 
Frank-Kasper phases, 225, 237, 306 ff, 392 
Frank-Read dislocation source, 1 857 ff 
Frank-Van der Merwe model, 1222 
Free-electron approximation, 64 ff 
Free energy, 

-, Gibbs, 416 
-, Helmholtz, 416 

- of mixing (Gibbs), 436, 439, 475 

— , ideal, 445 
-, standard, 426 
Free volume, 1731 ff, 2699 
Freezing, see “Solidification” 

Frenkel defect (pair) 1648ff, 

- concentration, 1654 

effect on electrical resistivity, 1655 
-: formation enthalpy, 1666 

- production by irradiation, 1683 ff 
Friedel sampling length, 1900 
Friedel-Fleischer theory, 1903, 2018, 2187 
Fusion welding, 803 ff 

(jadolinium 

-: allotropy linked with magnetic changes, 33, 43 
Gallium 

- crystal structure, 22 ff, 34 

Gibbs adsorption isotherm, 453 ff, 458, 1205 ff, 
1249, 1252 

Gibbs-Duhem equation, 439 
Gibbs energy of fusion, 492 
Gibbs free energy, 416 
Gibbs phase rule, 450 
Gibbs-Thomson effect, 683, 733, 760 
-, for lamellae, 1453 
Gibbs-Thomson equation, 1423 
Gibbs- Wulff theorem, 1381 
Glass 

Doolittle equation, 1732 
-: free volume, 1731 
-, polymer, 2720 ff 
-: thermodynamics, 1734 

- transition, 649, 1729ff, 1733, 2720 ff 


Glassy reaction layers at interfaces, 863 ff 
Gold-silver alloys, 1 1 52 ff 
Gorsky effect, 568 
Grain aspect ratio, 2170 
Grain-boundary 

- allotriomorphs, 1571 

-: boundary periodicity, 849 
-: broken bond model, 850 ff 

- character distribution, 866 

- cohesion 

— : effect of solute segregation, 1 258 ff. 
1262, 1 270 ff 

-: coincidence models, 847 ff 
-: coincidence site lattice, 844 ff 
-: computer simulations, 858 

- design, 866, 1282 ff 

-: diffusion, see “Diffusion” 

-: DSC lattice, 847, 1870 ff 
-: dislocation model, 853 ff, 1869 ff 
-, doped, in nanocomposites, 925 ff 
-embrittlement, 1259, 1270 ff, 2270 ff 

- energy, 

— , in terms of bond density, 852 
— , in terms of dislocation models, 853 ff, 
1879 

- engineering, 2463 

- enrichment factor, see “Segregation” 

- fracture, 1259 ff 

- microchemistry, see “Segregation” 

- migration, 2440 ff 

— : acceleration by vacancies, 2450 ff 
— : ‘Beck approach’, 2442 
— , defects created by, 896 ff 
— , diffusion-induced, 623 ff 
— , impurity drag, 2440 ff, 2443 ff 
— : Kronberg-Wilson rotation, 2440 
— : low-angle boundaries, mobility, 2446 
— : misorientation effect, 2445 ff 
— : particle drag, 889 
— : (in) primary recrystallization, 2440 ff 
— : (effect of) recovery, 1588 
— : segregation effects, 1248 
— : special orientations, 2448 ff 
— : strain-induced migration, 2435 ff 

- models, limitations of, 856 ff 

- (in) nanociystalline materials, 911 ff 

— , atomic structure, 916 ff 
-: O-lattice theory, 846 

- pinning, 1009, 2159, 2467 ff 
-: planar structure factor, 85 1 
-: polyhedral unit models, 855 

-: quasiperiodicity in boundaries, 850 
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secondary dislocations, 1076 

- segregation, see “Segregation” 

-sliding, 1960, 1992 (see also “Creep”) 

— : during creep, 1993 ff 
— (of) individual grain boundaries, 1 995 
— : Lifshitz-type, 1992 
— : Rachinger-type, 1992 
Sigma (2) value, 845 ff, 2462 
small-angle, 2446 
-, special, 848 

structural unit models, 848 ff, 1077 
symmetry model (Pond), 1871 
tilt boundary, 1078, 2413 
transmission electron microscopy of, 1 075 ff 
-, vacancies in, 2450 ff 

- (as) vacancy sinks, 2632 
Grain growth, see “Recrystallization” 

Grain (orientation) clusters, 865 
Grain refinement, 8 1 0 ff , 1811 

critical supercooling, 8 1 1 
-, energy-induced, 8 14 ff 
inoculation methods, 812 ff 
Grain size 

- aspect ratio, 2170 

- (prior) austenite, 1604 ff 
creep rate, effect on, 1 99 1 ff 

-, determination, 1 006 ff 

- distribution, 1008 
effects in fatigue, 2339 ff 

-effects in nanocrystalline materials, 918 ff 

- in solidification, 700, 8 10 ff 

yield stress, effect on, see “Hall-Petch 
relationship” 

Granato-Liicke internal friction theory, 1856 ff 
Graphite structure, 288 
Grassfire transformation, 1015 
Grazing-incidence X-ray scattering, 858 ff 
Greninger-Troiano orientation relationship, 1514 
Griffith crack and criterion, 2236 ff 
Growth of precipitates, 1393 ff 

-.diffusion-controlled, 1402ff, 1404 ff 
-, dual martensitic and diffusive, in 
aluminum-silver alloys, 1407 
growth instabilities, 1421 ff 
— : absolute instability, 1424 
— : relative instability, 1424 
-, interface-controlled, 1402 ff 
interface velocity, 1399 

- involving long-range diffusion, 1400 ff 
-: kinetics, 14 1 5 ff 

-, (with) ledges, 1396, 1405 ff, 1409 ff 
— : computer simulation, 1410, 1412 ff 
— : ledge formation, 1415 


linear growth models, 1 427 ff 
— : needle-like crystals, 1427 ff 
-: massive phases, see “Massive 
transformation” 
metastable phases, 1398 
mixed control, 1402 
rates, 1415ff 
solute drag, 1396ff 

Growth of solid from liquid, see “Solidification” 

Growth steps, see “Growth of precipitates, (with) 
ledges” 

Guinier approximation, 1163ff 

Guinier-Preston zones, see “Pre-precipitation” 

Hafnium 

-: polymorphism, 20, 24 

Hagg phases, 225 

Hall-Petch relationship, 1008 ff, 1589 ff, 1605, 
1811, 18 15 ff, 1859, 2168 ff 

- and fracture, 2277 

Hamiltonian, 59 

Hardening (see also “Yield stress”) 

-: diamond structure, 2038 ff 
-, fee solid solutions, 2011 ff 
-, forest, 2133 
-, latent, 2133 
-, magnetic, 

NaCl structure, 2038 ff 
-, order-, 2055 ff, 2192 

— : maximum at intermediate order, 2060 
— : quench effects, 2062 
— : temperature effects, 2063 ff 
— : theory, 2059 ff, 2195 ff 
-, precipitation-, 2043 ff, 2106 ff, 2141 ff 
— : A1 alloys, 642, 1 805 ff. 2049 ff 
— , classification, 2141, 2192 
— : deformation modes, 2147 
— : dislocation pinning, 1861, 1897 ff 
— : hardening mechanisms, 2 1 47 ff 
— (under) high stress, 2144ff 
— , iron-carbon alloys, 2052 ff 
— , kinetics, 634 
— : reversion, 1807 

- (see also “Pre-precipitation” and 

“Superalloys”) 

-. quench-, 2062 ff 

- (due to) rapid solidification, iron, 1594 
-: short-range order, 2017, 2021, 2061 ff 
-, solid-solution, 1593 ff, 201 Iff 

— , bee solid solutions, 2034 ff 
— , fee solid solutions, 2023 ff, 2143 
— , hep solid solutions, 2032 ff 
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— : plateau hardening, 2024 ff 
— : stress equivalence, 2022 
— , theory, 2016 ff (see also “Dislocations, 
locking mechanisms”) 

Heat capacity, 417 ff 

Heat of formation, simple metal phases, 141 
Heat transfer in solidification, 670 ff 
Helmholtz free energy, 416 
Henry’s law, 442 

Herring-Nabarro-Coble creep, 1988 ff 
Heterogeneous nucleation, see “Nucleation” 
Heusler alloys, 194, 272 
High-resolution electron microscopy, 1035, 

1079 ff, 1110, 1112 

- applied to amorphous alloys, 1777 

- applied to grain interfaces, 858 ff 
image reconstruction, 1084 
optical transfer function, 1081 ff 
(of ) quasicrystals, 372, 389, 399 
Scherzer focus, 1083 

weak-phase object approximation, 1083 ff 
High-strength low-alloy steels, see “Steels” 

Holes, octahedral and tetrahedral, 277 ff 
H5nl correction, 1121 
Homeotect structures, see “Polytypism” 
Homogeneous equivalent medium, 2182 
Homogeneous nucleation, see “Nucleation” 

Hot isostatic pressing, 2579, 2644 

- maps, 2647 ff 

sensors for measuring compact dimensions 
in situ, 2649 

technological considerations, 2648 ff 
Hot pressing, 2644 ff 

— : densification models, 2645 ff 
— : densification stages, 2645 
Hot-salt corrosion, 1 3 17 ff 
Huang scattering, see “X-ray scattering" 
Hume-Rothery phases, see “Electron phases” 
Hume-Rothery rules, 142 ff 

- and strain in solid solutions, 162 
Hydrogen 

-, atomic energy levels, 53 

- diffusion, 593, 1187 

- embrittlement, 2217 

-, migration in stress gradients, 895 

heats of solution in metals, calculation, 118 

- in iron, 1253, 1279, 1615 

- in niobium, 1384 

- solubility in Laves phases, 177 
Hydrogen embrittlement, 2217ff, 2282 ff 
Hypercooling, 1756 


Image analysis, see “Quantitative metallography” 
Incommensurate phases, 1549 
Incommensurate-to-commensurate transformations, 
1550 ff 

Inelastic scattering, 1 126 ff 
Icosahedral symmetry, 378, 384, 391 ff, 396 ff 
hypercubic phases, 395 ff 
Ingot structure, 781 ff 
chill zone, 78 1 ff 
columnar zone, 782 ff 
columnar to equiaxed transition, 786 ff 
computer modelling, 783 ff 
equiaxed zone, 785 ff 
inclusions, 794 ff 
lnoculants, 8 12 ff 

Interatomic pair potential, 95 ff, 98, 121 
Interface 

-, adsorption at, 1203, (see also “Segregation”) 
— , thermodynamics, 1205 ff 
-, coherent, 1396, 2107ff 
-cohesion, 1258 ff, 1262, 1270ff 
coincidence model, 844, 847 

- -controlled growth of precipitates, 1402 
-, curved, 458 ff 

-, diffuse, 707 

- energy, 850 ff, 1210ff, 1395 

— as affected by segregation, 1249 ff 

- enrichment factor (ratio), 1209, 

- (and) fracture, 2269 ff 
Frank-Van der Merwe model, 1222 

-, glissile, 1524 

— : conservative motion, 1526 
heterophase, see “interphase” 

-, incoherent, 2108 

- instability in solid-solid transformations, 

1421 ff 

-, interphase, 859 ff, 1078 ff 

- kinetics, 700 ff 

-, ledged, 1405 ff, 1409 ff 
-, martensite-parent, 1524 ff 

- microchemistry, 1202 ff (see also 

“Segregation”) 

— and materials design, 1280 ff 

— : methods of measurement, 1209 ff 
moving, causing transformation, 1451 ff 
-: segregation, see “Segregation” 

-, semi-coherent, 864, 1379, 1524 ff, 2108 
-, solid-liquid, see “Solidification, liquid-solid 
interface” 

-: thermodynamics, 1205 ff, 1228 

- transmission electron microscopy, 1075 ff 
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Interfacial process (in solid-state changes), 1371 
Interference-layer contrast, 957 ff 
Interferometry in optical microscopy, 960 
Intermediate phases, 166 
-, homogeneity range, 490 
-, solid solubility in, 137 ff, 151, 166, 178 , 
490 

Intermetallic compounds 

binary, electron per atom ratio, 107 
binary, relative size fac tor, 107 
binary, stability of structure, 102 ff 
commonest structure types, 323 
-, congruently (or incongruently) melting, 491 
coordination number, 23 1 
— , ratios, 228 
coordination polyhedra, 229 
— , as building blocks, 238 ff 
crystal-chemical relationships, 263 
crystal structures, 206 ff, 2141 ff 
— , data bases and books, 264 ff 
— , representation, 214 ff 
— : statistical distribution of types, 315 ff 
cubic structure types, 343 
definition, 206 

derivative and degenerate structures, 247 
“gazetteer” of structures, 355 ff 
-t Gibbs energy of formation, 492 
-, ideal and approximate formulae, 211 
-, identifying symbols, 209 ff 
- in phase diagrams, 489 ff 
interstitial structures, 249 ff 
isotypic and isopointal, 221 
lattice complexes, 217 ff 
Laves’s stability principles, 326 ff 
layer stacking sequence, 23 1 ff, 246 
mechanical properties, 2076 ff 
-, non-stoichiometric, 501 
-, order in, 193 ff, 248 
oxidation, 1309 

recombination structures, 260 ff 
reduced strain parameter, 334 ff 
-, site occupation formulae, 213 
-, solid solubility in, see “Intermediate phases” 
-, space-filling principle, 326 ff, 331 ff 
-, stability, 317 ff 
-, stacking symbols, 233 ff 
-, stoichiometric ratios, 317 

strength as function of homologous 
temperature, 2077 

-, structural notations, alternative, 241 ff 
structure families, 247 ff, 265 ff 
structure prediction, 345 ff 


-, structure types, 220 ff 

— : atomic-environment classification, 
342 ff 

— , systematic description, 264, 267 ff 
superstructures (superlattices), 248 ff 
-, ternary, 507 

— : structure distribution, 321 
-, type names, 224 ff 
vacant sites, ordered, 248 
Internal friction, 

Bordoni peak, 1857 
Granato-Liicke theory, 1856 
Niblett-Wilks peak, 1857 
Internal oxidation, 2108ff 
Internal stresses 

dispersed-phase alloys, 2128 ff 

- during creep, 1984 ff 
Interphase boundaries, 859 ff 

chemistry, 862 ff 
crystallographic structure, 864 

- with reaction (intermediate) layers, 862 ff 
Interstitial (self-) 

- agglomeration, 1673 ff 

— : cluster size, 1674ff 
-configuration, 1 663 ff 

- created by dislocation intersection, 1904 

- diffusion, one-dimensional, 1706 

- dumbbell configuration, 1140, 1659ff, 

1673 ff 

-: dynamic properties, 1658 ff, 1 672 ff 

- entbalpy of formation, 1656ff, 1665ff 
-: enthalpy of migration, 1666ff 

-: experimental approach, 1663ff 

- -free steels, 1594ff 

-: ion-channeling method, 1680 

- lattice, 894 

mechanical relaxation method, 1681 

- mechanism of diffusion, 594 ff 
-: Mossbauer effect, 1681 

-, multiple, 1662ff 
-: phonon coupling, 1184 

- position, see 'Interstitial position” 
-production, 1647ff 

- properties 

— , calculation, 1 654 ff, 1657 

- relaxation volume, 1663 

- saddle-point configurations, 1656ff 

- solid solutions, 139 

— , thermodynamic analysis, 501 

- -solute interaction, 1676ff 
-, split, 1659ff 

-, trapping by solutes, 1678ff 
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- -vacancy interaction, 1651 ff 

— : close pairs, 1653 
— : spontaneous recombination, 1651 ff 

- X-ray scattering from, 1136 
Interstitial positions 

body-centred cubic structure, 1562 
Invar alloys, 2540 ff 
Inverse melting, 1734 ff 
Ion-beam mixing, 637 
Ionic bond, 61 
Ionicity, degree of, 61, 137 
Ion microprobe analysis, see “Secondary-ion 
microscopy" 

Ion-probe techniques, 989 ff 
Ion-scattering spectroscopy, 1214 ff 
Iron, 1556 ff 

- allotropy, 20 ff, 30 ff, 1416 

— : effect of substitutional solutes, 1566 ff 
— : property changes at phase change, 

1560 

— : thermodynamics, 1558 
— : role of entropy of demagnetization, 
1558 ff 

-carbides, 1563 ff 

— precipitate microstructure, 2053 

- -carbon phase diagram, 771, 1565 

- -carbon solid solution, 1561 ff, 2035 

— : precipitation hardening, 2052 ff 
— : discontinuous yield, 2052 ff 

- -chromium-cobalt permanent magnet alloys, 

2517 ff 

-, cleavability, 2217 

diffusion rates of interstitial and 
substitutional solutes compared, 1563 

- dislocation density in deformed iron, in 

relation to flow stress, 1590 ff 
fatigue behavior, 2334 ff, 2382 
-: interstitial alloys, 1561 ff, 

— : fatigue, 2334 ff 
— : flow stress, 2035, 2037 
-: interstitial plus substitutional alloys, 1568 ff 
-nitrides, 1563 ff, 1571 

- -nitrogen solid solution, 1561 ff, 2037 
-: octahedral and tetrahedral voids, 1562 

-, phase transition, 1416 {see also “allotropy”) 
-, phosphorus in, 1208, 1214, 1215, 1237, 
1272, 1582, 2271 

-: properties of pure element, 1 557 ff 
solubility of elements in, 1556, 1563 
-: strength of ferrite, 1589 ff 
-: substitutional alloys, 1566 ff 

— : effect on form of gamma-field, 1566 ff 


-, sulphur in, 1224, 1582 
-: vacancies in ar-iron, 1558 
-: yield stress, in dependence on temperature 
and grain size, 1583 ff 
Iron aluminides, 2078 

- as soft magnetic materials, 2533 ff 
Iron-chromium alloys, 2035 
Iron-oxygen-sulphur system, 1312 ff, 1316 
Iron-silicon steels, see “Silicon steels” 

Irradiation (effects) 

-: amorphization, 1758 ff 

- (in) amorphous alloys, 1804 
atom redistribution, 640 ff, 1708 

-: biased point-defect annihilation, 1697 
cavities, electron microscopy of, 1066 
-: defect clusters, 1689 ff 
dislocation wall lattice, 1703 

- effects, miscellaneous, 1 682 ff 

— : collision (displacement) cascade, 

1684 ff 

— : displacement spike, 1684 
— : intracascade defect reactions, 1688 ff 
-: electron, see “Electron irradiation” 

-, fast heavy-ion, 1 690 ff 

- -enhanced diffusion, 635 ff 

- -induced creep, 1700 ff 

- -induced Guinier-Preston zones, 1709 

- -induced phase transformation, 643, 1709 

- -induced precipitation, 640 ff 

- -induced segregation, 640 ff, 1708 
loss of order, 1687 

-: swelling, 1695 ff 
— , reduction, 1698 ff 
void formation, 1695 ff {see also “cavities”) 
void rearrangement, 1706 ff 
-: void lattice, 1701 ff 

Janecke coordinates, 518 
Jellium, 861 
Jogs, 1853 ff, 1904 

Johnson-Mehl-Avrami-[Kolmogorov] (JMA[K]) 
kinetic equation, 1435 ff, 1788, 2421, 2674 
-: relation to soft impingement, 1435 ff 
necessity for a spatially uniform driving 
force, 1436 
Jominy test, 1579 

Jones theory of solid solubility, 1 5 1 ff 

ICagomd net, 234, 246 
Kauzmann paradox, 1731 
Kerr effect, 957 
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Kikuchi lines, 969, 1040 
Kinematical diffraction theory, 1094 ff, 1117 ff. 
Kinking, 1912 ff 
Kirkendall effect, 608 ff, 1625 
-, inverse, 1709 
Kossel patterns, 969 
Kronberg-Wilson rotation, 2440 
Kurdjumov-Sachs orientation relationship, 1 57 1 

Labusch’s theory of hardening in solid solutions, 
2019 ff 

Langevin law, 2502 

Langmuir adsorption isotherm, 456 ff, 1252 
Langmuir-McLean theory, 1 2 1 9 ff 
Lanthanides 

- crystal structures, 28 ff, 39 ff, 100 

-, dependence of properties on atomic number, 
351 ff 

Laplace equation, 2631 
Laser surface treatment, 1760 ff 
Lattice complex concept, 217 ff 
Lattice strain in solid solutions, 161 ff 
Lattice spacing, 

- in primary solid solutions, 180 ff 

- in ternary alloys, 181 ff 

Lattice stability, see "Structure stability” 

Laves phases, 176 ff» 3 10 ff 
heats of formation, 117 

Lead 

unusually large atomic radius, 25 
Ledges, see “Growth of precipitates” 

Lever rule, 473, 506, 715 
Liquid-solid interface, see “Solidification” 

Line compounds, 206 
Lifshitz-Slyozov- Wagner theory, 873 
Liquid crystals, 2680 
Liquids 

-, fragile and strong, 1733 
specific heats, 1733 
Liquid simple-metal alloys 
heats of formation, 116 
Liquid-metal embrittlement, 1386, 2286 
Liquid-solution calorimetry, 2402 
Liquidus, 472 

Local density functional, 50, 90, 101 
Long-period superlattices, 195 ff, 894, 1544 ff 
Lomer-Cottrell barrier, 847, 2015 
LUders bands, 1586, 2023 
-, persistent, 2347 

- (in) polymers, 2695 ff 


IVIackay icosahedron (cluster), 395, 406 

Macrosegregation, 789ff 

Magnesium 

- -aluminum alloys, 1457 

- -cadmium alloys, 2032 ff 

- crystal structure, 16 

-, solid solutions based on, 183 

- -zinc alloys, 2033 
Magnetic 

- aftereffect, 2507 

- anisotropy, 2505, 2509, 2512 

— , amorphous alloys, 2551 ff 
— , directional-ordering, 2535ff, 2553 ff 
— : shape anisotropy, 2512 
— , slip-induced, 2535 
— , thermomagnetic, 2535 

- annealing, 2535 

- coercivity, 2507 

— in relation to microstructure, 2513ff, 
2521 

curling, 2513 

-: defects and domain- wall pinning, 2514 

- domain wall(s) 

— pinning, 2514 

— thickness, 2512 

- domain(s), 2505 ff 

--, nucleation and growth, 2512, 2514 

— reversal, 2510 ff, 2520 

— reversal in relation to microstructure, 
2513 ff 

— rotation, 2510 ff 

- force microscopy, 976 

- 'hardening’ in relation to mechanical 

hardening, 2514 

- materials, see “Magnetic materials” 

-: maximum energy product, 2507 

- measurements, 2507 ff 

— : Hall-effect probe, 2508 

- permeability, 2506, 2527 

- properties of materials, 2501 ff 

— , fundamental, 2502 ff 

- relaxation, 2556 

- scattering of neutrons, 1123 ff 

- structure factor, 1124 

- susceptibility, 2502, 2506 
Magnetic materials, 2501 ff 

-, amorphous, 2543 ff 
— : anisotropy, 2551 ff, 

— , anisotropy, induced, 2535 ff, 2553 ff 
(see also “Directional short-range 
ordering”) 
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— : core loss, 2557 
— : Curie temperature, 2546 ff 
— : low-field properties, 2555 ff 
— : magnetostriction, 2553 , 2555 
— : preparation, 1748 ff, 2544 ff 
— : saturation magnetization, 2546 ff 
— : temperature dependence of 
magnetization, 2549 ff 
-, permanent, 251 Off 

— , cobalt-platinum, 2523 ff 
— , cobalt-rare earth, 2519 ff 
— : crystal-anisotropy materials, 25 1 9 ff 
— : effect of plastic deformation, 2518 
— : electrodeposited rod-shaped materials, 
2516 

— : hard ferrites, 2522 ff 
— , iron-rare earth, 2521 ff 
— : list of properties, 25 1 1 
— : manganese-aluminum-carbon (non- 
magnetic constituents), 2523 
— : shape-anisotropy materials, 2525 ff 
— : spinodal alloys, 2516ff 
— , two-phase (ferromagnetic plus 
paramagnetic), 2517 

- (for) recording heads, 2543 
-, soft, 2524 ff 

— : high-permeability alloys (permalloy, 
supermalloy), 2536 
— : invar alloys, 2540 ff 
— : iron-aluminum-(silicon) alloys, 

2533 ff 

— : iron and low-carbon steels, 2525 ff 
— : iron-cobalt alloys, 2541 ff 
— : iron-silicon alloys, see “Silicon steels” 
— : nanocrystalline alloys, 2542 ff 
— : nickel-iron alloys, 2534 ff 
— : square-loop alloys, 2539 
Magnetic measurements in metallurgy, 2558 ff 
-: hysteresis loop, applications, 2559 
-: thermomagnetic analysis, 2558 
-: magnetic anisotropy, 2559 
Magnetically modulated structures, 260 
Magnetism 

- and lattice parameters, 184 ff 
-: core loss, 2510, 2528, 2557 
-: demagnetizing field, 2509 

-: diamagnetism, 2502 
-: directional short-range ordering, 2535 ff, 
2553 ff 

-: eddy-current loss, 2507 
-: exchange for ces , 2503 
-: hysteresis curve, 2507 


-: residual magnetization, 2507 
-: saturation magnetization, 2507, 2546ff 
-: skewed-loop alloys, 2540 
-: square-loop alloys, 2539 
-: superparamagnetism, 2513 
Magnetocrystalline anisotropy energy, 2505 
-: anisotropy constants, 2509 
Magnetometer, vibrating-sample, 2508 
Magnetostriction, 2505, 2510, 2553, 2555 
Manganese 

- -aluminum-carbon magnetic alloys, 2523 
-: crystal structures, 20, 27 ff 

Maraging steels, 1 607 ff 
Martensite 

- aging, 1580ff 

-, crystal structure, 274 

- growth, 1524ff 

- -like structures in rapidly soldified pure iron, 

1594 

-, low-carbon, 1603 

- (to) martensite transformation, 1543 ff 
-morphology, 1510ff, 1522 ff, 1576 

— , banded, 1522 
— , butterfly, 1525 
— : laths, 1522 ff, 1526, 1576 
— : midrib, 1524 
— : needle shape, 1522 
— , thin-plate, 1525, 1576 

- nucleation, 1530 ff 

- -parent interface, 1524 ff 

— , dislocations in, 1522 ff 
-plates, 15 10 ff 
-: premartensitic state, 1550 
-: semicoherent interfaces, 1524 

- (in) steels, 1572ff 
-strength, 1602ff 

--as function of carbon content, 1603 
--as function of tempering, 1606 
-, stress-induced, 1540, 1912 

- substructure, 1517 ff, 1522 
-: surface martensite, 1522 

surface relief, 1510 ff 

- temperature, 1509 

- tempered, 

— strength and ductility, 1604 ff 
-tempering, 1579ff 

- variants, 1538 ff 

Martensitic transformation, 1508 ff, 1572ff 
-, athermal and isothermal, 1531 

Bain distortion and correspondence, 1512, 
1515, 1520 

-: butterfly morphology, 1525 
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critical stress, 1535 ff 

crystallographic theory (phenomenological, 

1 5 14 ff 

— : complementary shear, 1514 
— : dilatation parameter, 1521 
— : lattice-invariant deformation, 1514, 
1526, 1531 

— , mathematical description, 1518 ff 

- (as) displacive transformation, 1532 ff 
driving force 

— , chemical, 1532 ff 
— , mechanical, 1533 ff 
Greninger-Troiano orientation relationship, 
1514 

habit plane, 1511 ff, 1515, 1517, 1521 
hysteresis, 1527 
inhomogeneous shear, 15 17 ff 
invariant-line strain, 1514, 1520 
-: invariant-plane strain, 1511, 1520 
mechanical effects, 1531 ff 
— , M a temperature, 1 536 
M s temperature, 1509, 1535, 1572, 1574 ff 
— : effect of precipitation on, in steels, 1574 
orientation relationships, 1 5 1 2 ff, 1516, 

1571 

oxides, 1544 
pseudoelasticity, 1541 ff 

- (in) rapidly solidified steels, 1815 ff 
-, reverse, 1527 

rigid-body rotation, 1513 
shape-memory effect, 1538 ff (see also 
“Shape-memory effect” 

-: shape strain, 1510 
-: stabilization of austenite, 1575 
-, thermoelastic, 1527 ff (see also 

“transformation-induced plasticity") 

-, thermodynamics, 1529 ff, 1533 
-: transformation-(induced) plasticity, 1532, 

1 536 ff 

-: twinning, 1517 ff 

Massive transformation, 1393, 1398, 1417, 1577 
Matano method, 546 

Maximum resolved shear stress law, see “Schmid 
law” 

Maxwell element, 2726 ff 
Mechanical alloying, 2109, 2167 
Mechanical milling, 1766 ff 
Mechanical threshold, 1886 
Mechanical properties of single-phase crystalline 
materials, 1 878 ff. 1957 ff (see also “Elasticity", 
“Plastic deformation” and “Creep”) 
Mechanochemical reactions, 923 


Melt, transient conductance measurement, 1761 
Melt-extraction, 1749 
Melt-spinning, 1749 

Melt subdivision method of studying nucleation, 
693 ff 
Melting 

-, inverse, 1 734 ff 
-, surface, 978 
Mendeleev number, 102, 211 
Mercury 

- crystal structures, 22, 32 
Mesotextures, 2460 ff 

-: grain-boundary character distribution, 2462 
grain-boundary misorientation distribution, 
2460 ff 

-: Rodrigues method, 2460 
Metal-ceramic interfaces, 859 ff 
Metallic character, criteria, 149 
Metallic glasses, see “Amorphous alloys” 
Metallography, 

-, definition, 944 
etching, 950 ff 
— : anodic oxidation, 952 
— : interference-layer contrast, 952 ff 
— : ion-etching, 951 
-: grinding, 947 

-: image analysis, see “Quantitative 
metallography” 

-: polishing, 948 ff 
— , chemical, 948 ff 
— , electrolytic, (electrochemical) 948 ff 
— , thermal, 948 
— : ultramicrotoming, 949 
-, quantitative, see “Quantitative 
metallography” 

-: replica techniques for TEM, 950 
-: specimen preparation, 945 ff 
-: specimen sampling, 945 ff 
-: stereology, see “Quantitative metallography” 
Metal-matrix composites (see also “Composites”) 

- by solidification, 824 ff 
Metal recycling, 1283 ff 
Metastability (in alloys) 

-: categories 

— , compositional, 1727 ff 
— : configurational freezing, 1728 
— , kinetic, 1727 
— , morphological, 1727 ff 
— , structural, 1727 ff 
-, methods for achieving, 1725 ff 
-: microstructural manifestations, 1724 
-, nature of, 1726 ff 
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Metastable phases by undercooling, 699 ff. 

Metastable structures, 192 ff, 771, 1562, 1569, 
1724 ff 

Metastable equilibrium at melt-solid interface, 

684 

Metastable phase diagrams, 684 ff, 701, 772, 1735 

Microchemistry of grain boundaries and surfaces, 
1202 ff 

Microhardness, 961 

Microscopy 

-, acoustic, 980 
-, analytical electron, 1086 ff 
-, atomic-force, 974 ff 
— , applications, 977 ff 
-, atom-probe field-ion, 982 ff 
— , applications, 983 ff 
-, Auger-electron (scanning), 986 ff 
-, electron-channeling, 968 ff 
-, field-electron, 983 
-, field-ion, 981 ff, 1626 
— , applications, 983 ff 
-, fluorescence, 988 
-, high-resolution electron, see 

“High-resolution electron microscopy” 

-, optica], 945 ff 

— , confocal, 958 ff 
— , high-temperature, 959 
— : illumination, 955 ff 
— : interference contrast, 958 
— : interferometry, 960, 1211 
— , near-field (scanning), 959, 

--: phase contrast, 957 
— (with) polarized light, 956 ff 
— : resolution and depth of focus, 955 
— , scanning, 958 ff 

orientation imaging microscopy, 865, 969 ff, 
2462 

-, photo-electron emission, 985 ff 
-, quantitative television, 

-, scanning Auger electron, 986 ff 
-, scanning electron, 961 ff 
-, scanning transmission electron, 

— : contrast modes and detectors, 964 
— : contrast, atomic-number, 967 ff 
— : contrast, backscattered electron mode, 
962 

— : contrast, cathodoluminescent, 971 
— : contrast, electron-channeling, 968 ff 
— : contrast, magnetic, 970 

contrast, secondary-electron mode, 962 
— : contrast, topographic, 967 


— : depth range, 963 
— : images, 965 
— : signal processing, 963 
— : specimen preparation, 966 
— : stereomicroscopy, 97 Iff 
— : X-ray mapping, 970ff, 1217 
-, scanning acoustic, 979 ff 
— , applications, 980 
-; scanning techniques, various, 976ff 
-, scanning thermal wave, 979 ff 
-: scanning tunneling, 973 ff 
— , applications, 977 ff 
-, surface, 943 ff 

-, thermal wave, see “scanning thermal wave 
microscopy” 

-, transmission electron, see “Trans- mission 
electron microscopy” 
tunneling spectroscopy, 976 
— , applications, 979 
-, X-ray, 987 ff 

Microsegregation, 726, 749, 1204 (see also 
“Segregation”) 

Microstructural transformations, 866 ff 
-: coarsening by Brownian motion, 882 
-, driven by interfacial energy reduction, 

870 ff 

- due to electric fields, 886 

- due to magnetic fields, 885 

- due to stress fields, 885 ff 

- due to thermal cycling, 884 

in presence of temperature gradients, 883 ff 
-, experimental techniques for studying, 1372 ff 

- initiated by moving dislocations, 889 
Microstructure, 844 ff, 944 ff 

characterization, 865 ff, 
definition (constituent elements), 844, 944 
development, 870 ff 
-, self-organized (periodic), 890 ff 
superalloys, 2076 

Miedema’s model for heats of formation, 1 1 1 ff, 
141, 349 ff 
Miscibility gap, 478 
-, liquid-liquid, 483 
-, solid-solid, 478 
Misfit strain 

- from differential thermal contraction, 2584 
-: Eshelby’s model, 2581 ff 

Mixing energy (Gibbs), 475 
Mohr diagram, 2129 ff 
Mossbauer effect 

interstitial atoms, 1681 
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Molecular dynamics simulations 

- (of) crack structure, 2246 ff 

- (of) irradiation effects, 1685 ff, 1691 
Molybdenum-rhenium alloys, 2038 
Monotectic, 483, 771 ff 

Monotonic Laue scattering, 1145 
Morse potential, 1624 
Mosaic structure of crystals, 1132 
Motional narrowing (in NMR), 570 
Mott (metal-insulator) transition, 81 
Mott-Labusch mechanism, 2018 
Mould-metal system 
air gap, 673 

-, computer-modelling, 680 ff 
freezing at mould wall, 676 ff 
heat transfer, 673 ff 

Multiphase alloys, mechanical properties, 2106 ff 
Mushy zone, 672, 792 


i i abarro-Herring-Coble creep, 1988 ff 
Nanocomposites, 923 
Nanocrystallinc materials, 908 ff 
catalytic properties, 935 
-: consolidation, 916 ff 

- (with) doped grain boundaries, 925 ff 
-: generation methods, 914 ff 

giant magnetoresistance, 932 ff 
-: grain growth in, 2479 ff 
-: luminescence from nanocrystalline porous 
silicon, 933 ff 

- (for) magnetic recording, 932 
-: soft magnetic, 930, 2542 ff 

technological applications, 928 ff 
Nanoglasses, 921 ff 

Nanostructured materials, 900 ff, 1800ff 
-: magneto-caloric cooling, 93 1 ff 
Nearly-free electron approximation, 64, 151 
Neel point, 2504 
Neodymium 

- crystal structure, 39 
Nemst-Einstein relation, 550 
Neutron 

absorption coefficient, 1 1 20 f f 

- radiation, 1 1 19 ff 

- sources, 1 128 ff 
Neutron scattering, 1 1 1 6 ff 

aluminum-r alloys, 1140 
-: Bragg peaks, see “X-ray” 

-, diffuse near Bragg peaks, 1 134 ff 
-, diffuse between Bragg peaks, 1 139 ff 
diffusive motion, 1 1 87 ff 


-: inelastic, 1126ff 
-: isotope replacement, 1145, 1155 
-, magnetic, 1 1 23 ff, 1179 
-: order (short-range), 1144ff 
-, small-angle, 1161 ff (see also “Small-angle 
Scattering”) 

Niblett-Wilks peak, 1857 
Nickel, recovery from deformation, 2403 
Nickel-aluminum alloys (mainly NijAl), 1 173, 
1178, 1180, 1186, 1218, 1241, 1260, 1261, 
1308, 1391 ff, 1426, 1441, 1447, 1488, 1853, 
2046, 2076 ff, 2084, 2146, 2452, 2473 
-: plastic deformation and the flow stress 
anomaly, 2085 ff, 2195ff 
— : catalogue of features, 2086 
— : creep, 2196 
— : models, 2086 ff 
— : particle shear, 2201 
NiAl, mechanical properties, 2091 ff 
Nickel-base high-temperature alloys, 2171 (see 
also “Superalloys”) 

-: micromechanisms of plasticity, 2190ff 
Nickel-chromium alloys, 1 157 ff 
Nickel-cobalt alloys, 2015 
Nickel-manganese alloys, 2059 
Nickel-oxygen-sulphur system, 1316 
NiO band structure, 80 ff 
Niobium alloys 

-, hydrogen in, 1384 
-: oxidation, 1309 

phase distortions due to solutes, 1141 ff 
-superconducting Nb-Ti, examined by small- 
angle neutron scattering, 1 174 ff 
Nitrogen in iron, 1561 ff 
Noble metals 

-: crystal structures, 21 
-, lattice spacings in solid solutions of, 180 ff 
Nondestructive testing, 2276 
Nowotny phases, 258 ff 
Nuclear magnetic resonance, 570 ff 
Nucleation 

- alloys, solidification, 695 ff 

- (in) amorphous alloys, 1784 ff 

- and growth transformations, 1369 ff, 1374 ff 
-, cavity, 1265 

-• critical radius, 688 ff 

- (of) disorder, 1766 

-, experimental methods, 693 ff 
-: experimental findings, 1389 ff 
— : orientation relationships, 1389 

- from the melt, 687 ff 

- (at) grain boundaries, 1807 
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- heterogeneous, 689, 697 ff, 1378, 1385 ff, 

1389 

— (at) dislocations, 1387 
-- (at) GP zones, 1387 

— (at) grain boundaries and edges, 1386 ff 

— (at) grain boundaries, with lattice 
matching, 1388 ff 

— : test of theory, 1393 ff 
-, homogeneous, 689, 1374 ff, 1389 ff, 1756f 
-, metastable, 1389 (see also “Pre- 
precipitation”) 

- (in) primary recrystallization, 2425 ff 
-, pure metals, solidification, 693 ff 

- rate, 691 ff 

-: strain effects, 1 384 ff 

- theory, 1374 ff 

— : critical radius, 1375 

experimental tests, 1390 ff 
— : nucleation rate, 1376 
Nusselt number, 1756 


V/ctagonal symmetry, 378, 381 ff 
O-lattice theory, 845 ff 
Omega phase formation, 1546 ff 
Optical microscopy, see “Microscopy” 
Orbitals, 4ff, 51 ff, 59 
Order in solid solutions, 121, 193 ff, 198 ff, 
252 ff 

antiphase domains, see “Antiphase 
domains” 

creep, 2064 ff, 2078, 2080 
destruction by irradiation, 1687 
diffraction pattern, 1039 ff 
diffusion in ordered phases, 599 ff 
dislocations in ordered phases, see 
“Dislocations” 
flow stress, 2059 ff 

- hardening, see “Hardening” 
lattice parameter change, 2060 ff 

-, long-range, 198 ff 
magnetic field effects, 

-, magnetic, in relation to chemical SRO, 
1 158 ff 

mechanical properties, 2059 ff 
neutron scattering, 1144 ff 

- parameter, 198 
recrystallization, 2471 ff 

-, short-range, 198 ff, 

— : computer simulation, 1 149 ff 
— , directional, 2535 ff, 2553 ff 
— , in liquids, 501 


— : kinetics, 570 
— parameters, 1145 ff 
— : quasichemical theory of, 450 
— , studied by diffuse scattering of X-rays 
and neutrons, H44ff 
superdislocations, 1 850 ff, 2056 ff, 2081 ff 
-, vacancies in, 1646ff 
X-ray scattering 
— , short-range order, 1 142 ff 
Order-disorder transformations, 251 ff, 494, 1544 ff 
Ordered crystal structures, 252 ff 
-, electron microscopy of, 1039 ff 
-, stability, 121 
Ordering 

- and clustering, thermodynamics of, 437 
-, continuous, 1370, 1490 ff 

-energy, calculation, 1 19 ff 
Orientation function (parameter), 2680 ff 
Orientation distribution function, 2456 
Orientation imaging microscopy, 865, 969 ff, 2462 
Orowan loops, 1893, 1900, 1948, 2115 ff 
Orowan mechanism, 2114 ff, 2148 
Orowan stress, 2046, 2185 
Orthogonal plane-wave method, 73 
-: repulsive contribution from, 73 
Ostwald ripening, 460, 873 ff, 1437 ff, 2144 

- (at) early stage of precipitation, 1444 ff 

-: inhibition by solute segregation to particle 
interfaces, 1274 ff 
-: late-stage coarsening, 1442 ff 
-: radius distribution, 1438, 1441 
— , effect of this on kinetics, 1439 
scaling laws, 876 
stability against coarsening, 877 
technological applications, 878 
Overshoot in slip, 2029, 2056 
Oxidation 

- (of) alloys, 1306ff 
-, cyclic, 1328 

-: dissociative mechanism, 1301 

- (of) intermetallics, 1309 
-, internal, 1309 

-: internal stress, measurement, 1330 ff 
-: kinetics, 1297 ff 

--, measurement, 1325 ff 
--, parabolic, 1299 
— : Wagner’s theory, 1299 ff 
-: life prediction modelling, 1338 ff 
-, mechanism, 1298 ff, 1328 ff 

- (of) metallic materials, 1292ff 

- (in) multicomponent atmospheres, 13 1 1 ff, 

1335 ff 
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preferential, 1294 ff 

- (in a) solution, thermodynamics, 449 ff 
selective, as a function of alloy 

composition, 1306 ff 

-, surface, inhibition by segregants, 1279 ff 
thermodynamics, 1293 ff 
-, transient, 1308 
Oxide layers 

-, diffusion in, 1303 ff 
electrical properties, 1303 
macrodefects, measurement, 1334 ff 
mechanical properties, 1304 ff 
scale adhesion, 1309 ff 
scale failure, detection, 1333 ff 
spallation, 1 305 ff 

stress generation and relief in, 1305 ff 
Oxide-dispersion-strengthened alloys, 1310, 2107, 
2184 ff, 2187 

high-temperature fatigue properties, 2189 ff 
-, recrystallization, 2203 ff 
Oxides 

-, amorphous, 1296 

-, crystalline, non-stoichiometry, 1296 ff 
Oxide stability, 1293 ff 

- (of) mixed oxides, 13 14 ff 

P acking densities (atomic) in elements, 12 
Pair distribution function, 1769 ff 
Particle drag on grain boundaries, 889, 1443 
Particle hardening, see also “Dispersion- 

strengthening” and “Hardening, precipitation-” 
macroscopic behavior, theory, 2182 ff 
particle shearing, 2044, 2048, 116, 2194 ff, 
2201 

threshold stress, 2185 ff 
Pauli exclusion principle, 48 
Peach-Koehler force, 1836 ff 
Pearlite, 1564, 1570ff, 1600 
Pearson (structure type) symbol, 223 
Pcclet number, 714ff, 733 
Peierls barrier (stress), 1 843 ff, 1894 ff 
Pencil glide, 1585 

Penrose tiling, see “Quasiperiodic tilings” 
Peritectic, see “Phase diagrams” 

Peritectoid, see “Phase diagrams” 

Permalloy, 2536 

Periodic table of the elements, 14, 54 ff 
Persistent slip bands, see “Fatigue” 

Perturbed y-y angular correlation, 1636, 1638 
Phase (interphase) boundary, 453 ff 
-, limiting slope, 488 
metastable, 699 ff 


Phase diagrams, 472 ff 
binary, 472ff 

calculation from thermodynamic input, 495 ff 
— , optimization of phase boundaries, 496 ff 
— , ternary and multicomponent systems, 
516ff 

classification, 482 ff, 524 ff 
compilations, 530 
computer-coupled analysis, 495 
-, constant-composition section, see 
“temary-isopleth (section)” 

-, eutectic systems, 480ff 
— , ternary, 507 
extension rule, 493 
gaseous phase in, 503, 519ff 
interdiffusion, use of for measuring, 529 
invariant reactions, nomenclature, 515 
-, iron-carbon, 1565 
-: law of adjoining phase regions, 513 
-: measurement techniques, 525 ff 
-, metastable, see “Metastable phase 
diagrams” 

-: miscibility gaps, 478 ff 
-, monotectic, 483 
-, multicomponent, 514ff 
-: peritectic, 483 ff 
— , ternary, 508 
-: peritectoid, 493 
-, with potentials as axes, 5 1 8 ff 
quenching techniques, 528 
-, syntectic, 485 
-, ternary, 503 ff 

— : isopleth (section), 512 
— : isothermal sections, 509 ff 
— : polythermal projection, 506 
-: thermal analysis, 526 ff 
-: thermodynamic interpretation, 443 ff, 474 ff 
-: tie-lines, 473 
-, topology of binary, 492ff 
-, topology of ternary, 5 1 1 ff 
-: two-phase fields, extrema in, 477 
-: zero phase-fraction lines, 515 
Phase equilibria, 472 ff 

-: equilibrium constant, 426 
-, heterogeneous and activity measurement, 464 
- in a one-component system, 422ff 
stability diagrams, 434 
-: triple point, 424 
Phase morphology, 866 
Phase rule, 450 

components, 45 1 
-: degrees of freedom, 450 
-: species, 45 1 
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Phase stability, 140 ff, 434 
calculation, 142 ff 

Phase transformations, see ‘Transformations” and 
“Solidification” 

Phonon modes, 1141 
Phonon spectra 

by inelastic neutron scattering, 1183 ff 
Kohn anomalies, 1185 
Phonon wind, 883 

Phonons, role in diffusion theory, 555 ff 
Photon probe techniques, 994 ff 
Pilling-Bedworth ratio, 1305 
Piobert-Litders band, see “Liiders band” 
Piston-and-anvil quenching, 1749 
Pitsch orientation relationship, 1572 
Planar flow-casting, 1749 ff 
Plastic deformation (see also “Deformation”, 
“Dislocations” and “Slip”) 
activable cluster, 1887 ff 
activation area for dislocations, 2180 
activation parameters for plasticity, 1891 ff, 
2180 

activation time, 1884 

activation volumes, apparent and true, 2180 
amorphous alloys, 1796 ff 
-, asymmetric, bcc metals, 
athermal stage, 2180 
cyclic and monotonic deformation 
compared, 2336 ff 

critical resolved shear stress for glide, 

1885 ff 

- in presence of diffusion, 1957 ff 
instability in tensile deformation, 1949 ff 
jump experiments, 1892 

kinking, 1912 ff 

- resulting from dislocation glide, generalities, 

1881 ff 

stress-strain curves, see “Stress-strain 
curves” 

thermally activated, 1887 ff 
Plasticity 

-, continuum (phenomenological), 1946 ff, 
2698 ff 

— : Mohr diagram, 2129 ff 
— : von Mises condition, 1946, 2590, 

2698 

Plutonium 

allotropy, 34, 44 
Pnictides, 36 
Point defects, 1622 ff 
-clusters, 1 1 80 ff 
condensation, 896 


- created by intersecting dislocations, 1904 

- created by moving grain boundaries, 896 ff 
effect on precipitation, 894 ff 

emission during fatigue, 2331 
lattice, 894 

-, small-angle scattering from clusters, 1 180 ff 
-, X-ray scattering by, 1 136 ff. Point 
compounds, 206 
Poisson’s ratio, 1880 
Polarized-light microscopy, 956 ff 
Polysynthetic twinning, 2096 ff 
Pole figures, 2456 ff 

Polishing, metallographic, see “Metallography” 
Polycrystals, plastic deformation of, 1940ff 
Polygonization, 2410 ff 
Polymer science, 2663 ff 

alloys (blends), 2682 ff 
— : critical solution temperatures, 2685 
— : entropy and enthalpy of mixing, 

2683 ff 

— : polymer-polymer miscibility, 2684 ff 
-: amorphous polymers, 2665 ff 

— , chain conformations (structures), 

2730 ff 

- — : chain conformations and solvent 
effects, 2733 ff 
— : chain statistics, 2732 ff 
— , textures in, 2677 ff 
— : viscoelasticity model, 2729 ff 
-: annealing of polymers, 2671 
-: chain folding, 2670 ff 
-: concept of crystallinity with respect to 
polymers, 2668 ff 
-: conjugated polymers, 27 1 3 ff 
-: copolymers, 2689 ff 

— , block, , 907 ff, 2689 ff 
— , random, 2691 ff 
crazing, 2707 ff 

— : anisotropy of craze initiation, 2710 
— : craze criteria, 2707 
— : environmental effects, 2710 
— , microstructure and mechanisms, 271 Off 
— , propagation, 2709 ff 
-: crystal thickening, 2671 
-: crystals, single, of (poly)diacetylene, 2672 
crystallinity, percentage, 2670 
-: crystallization, sluggishness of, 2668 
-: deformation (plastic) of polymers and 
metals compared, 2692 ff 
-: director, 2679 

drawing of polymers, 2697 ff 
— : natural draw ratio, 2697 
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equilibrium diagrams, see “phase diagrams 
of polymeric systems" 
electrical conduc tion, 27I2ff 
— , conjugated polymers, 27 1 3 ff 
— , applications, 2718 ff 
fibers, 908, 2700 ff 
— , conventionally drawn, 2703 ff 
— , high-performance, 2705 
— , Kevlar, 2672 
— , microstructure, 2704 
— , theoretical axial modulus, 2700 ff 
fibrils, 2673 
glass transition, 2720 ff 
— , control, 2724 ff 
— , interpretation, 2725 
— : melt or rubber?, 2725 ff 
liquid-crystalline polymers, 2705 
lyotropic phases (systems), 2667, 2687 
naming of polymers, 2668, 2669 
non-periodic layer crystals, 2692 
phase diagrams of polymeric systems, 

2684 ff 

(poly)acetylene, 2713 ff 
— , band structure, 2714 
— , polarons, 2717 
— , solitons, 27 1 5 ff 
(poly)ethylene 

— : modification of crystal morphology, 
2672 ff 

— : relationship to diamond structure, 
2702 

polymer-solvent systems, 2686 ff 
relationship to physical metallurgy, 2664 ff 
rubberlike elasticity, 2735 ff 
— : affine deformation of a network, 2735 
— bond rotation in real chains, 273 1 ff 
— : dependence of entropy on strain, 

2737 ff 

— : entropy spring concept, 2736 
— : high-strain anomaly, 2739 ff 
— : stress-strain curve, 2738 ff 
rubbers, 2725 ff 
— , structure, 2734 ff 
— , vulcanization, 2726 
semicrystalline, 903 ff 
— : spherulitic crystallization, 905, 2673 
textures of polymers, 2676 
— : orientation functions (parameters), 
2680 

— : rolling textures, 2680 ff 
— : texture (strength) parameter, 2679 ff 


-: thermoplastics, 2655 ff 

— , amorphous (non-crystalline), 2665 ff 
— , drawing, 2696 
— , liquid-crystalline, 2667 
— , semicrystalline, 2666 ff 
-: thermosets, 2665 

thermotropic polymers, 2667 
viscoelasticity, 2726 ff 
Polymorphism, 10 ff 
Polytypism, 7 f f, 257, 286 ff, 310ff, 384 
Porod approximation, 1165 
Porosity, 793 ff 

- and gas in melt, 793 ff 

- and sintering, see “Sintering” 

Porous silicon, 933 ff 
Portevin-Le Chatelier effect, 2042 
Positron-annihilation spectrometry 

- and the Fermi surface, 175 

- and interstitials, 1681 

- and vacancy concentrations, 1633, 1636 ff 
Powder metallurgy, see “Hot Pressing”, “Hot 

isostatic pressing" and “Sintering” 

Powder solidification, 679 ff 
Praesodymium, crystal structure, 42 
Precipitate^) 

-, coherency, 2107, 2109 
-: -dislocation interaction, see Hardening, 
precipitation-” 

- dissolution, 143 1 ff 

equilibrium shape, 1380 ff, 1405 ff, 1426 

- -free zones, 895 
grain-boundary migration, 

- growth, 1393 ff (see also “Growth”) 

- growth instability, 

- growth under stress, 1465 ff 

-: imaging in the electron microscope, 1067 ff 
-, incoherent, see “Interface” 

- lattice, 892 

-: needle morphology, 1396 

-, plate-like, see “Widmanstatten precipitates" 

- reversion, 1433 ff 

-: segregation to interfaces, 1274 ff 
-, semicoherent, see “interface” 

-, shearable, 1898 

solute pileup at growing precipitates, 

1244 ff 

- stress (in and around), 

-, Widmanstatten, see Widmanstatten 
precipitates” 

Pre-precipitation, 1140, 1143, 1 1 55 ff, 1 166 ff, 
1369, 1385, 1485, 1709, 1806 ff, 1861, 2360 
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Precipitation 

- aided by moving dislocations, 889 

- combined with coarsening, 1444 ff 
competitive growth 

— : early stages, 1435 ff 
competitive dissolution of small 
precipitates, before precipitation is 
complete, 1443 ff 
-, discontinuous, 1456 ff 
-, driving forces for, 1365 ff 
-, enhanced by point defects, 894 ff 
growth, see “Growth” 

- hardening, see “Hardening” 

- in nanoporous materials, 9 1 5 

- (of an) intermetallic phase, 

thermodynamics, 1367 
-, irradiation-induced, 896 

Johnson-(Avrami>-Mehl kinetics, 1435 ff, 
1788, 2421 

nucleation, see “Nucleation” 

soft impingement effect, 1426, 1435 ff 

strain energy effects, 1383 ff 

- thermodynamics, 1366 ff 
Preferred orientations, see ‘Textures” 
Premartensitic effect, 1550 

Primary solid solutions, solubility in, 150 ff 
Principle of similitude, 1923, 1928, 1981 
Protection of metallic materials, 1343ff 
Protective coating technology, 1343 ff 
diffusion coatings, 1345 ff 
future trends, 1354 ff 
laser processes, 1351 
overlay coatings, 1348 ff 
physical vapor deposition, 1349 ff 
spraying processes, 1350 ff 
Protective coatings 

-, mechanical proerties, 1 353 ff 
-, oxidation and hot-salt resistance, 1 35 1 ff 
-, thermal stability, 1352 ff 
Pseudoelasticity, see “Shape-memory effect" 
Pseudopotential (empty-core), 73 ff, 95, 150 

^Quantitative metallography (quantitative 
microstructural analysis), 989 ff 
image analysis, 997 ff 
— , automation, 999 
— , instrumentation, 1001 
mathematical morph ology, 1014 ff 
stereology, 1001 ff 
— : applications, passim, 1001 ff 
— : arrangement parameters, 1013 ff 


— : contiguity, 1010 ff 
— : curvature, 1012 ff 
— : grain size, 1006 
— : interface density, 1004 ff 
— : mean intersect area, 1006 
— : orientation of interfaces, 101 Off 
— : particle size distributions, 1007 ff 
— : planar features in relation to three- 
dimensional variables, 1002 
— : shape distributions, 1008 , 1011 ff 
— : topological parameters, 1012 
— : volume fraction analysis, 1003 
Quantum numbers, 51 
Quasicrystals (quasiperiodically ordered 
structures), 372 ff 

approximants, 373, 379 ff, 385 
external; facets, 400, 405 
higher-dimensional approach, 376 ff 
hyperatoms, 385, 392, 403, 407 ff 
-, one-dimensional, 380 
-, orientational order in, 375 
-, structure, 379 ff, 395 ff, 404 
superspace groups, 385, 398 
-, symmetry, 378 
tiling, 375, 390 
-, two-dimensional, 380 

X-ray structure analysis of an alloy of 
decagonal symmetry, 388 
Quasi-elastic neutron scattering, 572 
Quasiperiodic tilings, 374 ff 
Quasiperitectic equilibria, 508 

dial (electron) probability density, 56 ff 
Radiation effects, see “Irradiation effects” 
Radius of gyration, 1164 
Rafting, see “Superalloys” 

Random walk motion 

- in a crystal, 546 ff 

- in a glass, 649 ff 
Raoult’s law, 436 

Rapid quenching from the melt, see 
“Solidification, rapid” 

Rapid solidification processing (RSP), see 
“Solidification, rapid” and “RSP crystalline 
alloys” 

pseudo-RSP, 1758 

Rare earth metals, see “Lanthanides" 

Rayleigh instability, 880 
Reaction equilibrium in solutions, 447 ff 
Read-Shockley equation, 1870, 2412 
Reciprocal lattice, 68, 1043, 1097 ff 
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Recovery from deformation, 2401 ff 
aluminum, 2403 ff 

annealing textures, effect of prior recovery 
on, 2418 

cell formation, 1978, 2412 ff 

cell evolution, 1980, 2418 

cell size in relation to flow stress, 1981 

- (in) copper... does it exist?, 2402 ff, 2407 
(role in) creep, 1973 ff 

dislocation density reduction, 1978 
-, dynamic, 1924, 1929, 2003, 2030 ff, 2127, 
2408 

— : dynamic secondary recrystallization, 
2486 

fatigue-strain enhanced, 2408 
impurity influence, 2403 
iron alloys, complete recovery, 2405 ff 
kinetics, 2405 ff 
— , theories of kinetics, 2417 ff 
-, (of) mechanical properties, 2405 ff 
-, meta-, 2408 ff 
-, ortho-, 2410 

polygonization, 241 Off 

- (in) steels, 1587 

stored internal energy and its recovery, 

2401 ff 

-, stress-enhanced, 2406 ff 
Recovery of electrical resistivity after irradiation, 
1692 ff 

Recovery of electrical resistivity after quenching 
, 1634 ff 

resistivity per interstitial, 1654 ff 
resistivity per vacancy, 1 629 ff 
stage I, 1 667 ff 
stage II, 1674 ff 

stage III controversy, 1622 ff, 1636 ff, 1640, 
1670ff 

-, use to determine volume of vacancy 
formation, 1628 

- (and) vacancy concentrations, 1634 
Recrystallization, 2419f 

annealing textures, see “Annealing textures" 
classification of phenomena, 2400 

- diagram, 2421 ff 

-, directional, 1818, 2205 
-, dynamic, 1999ff, 2453 ff 

grain-boundary migration, see “Grain 
boundary” 

grain growth, 870 ff, 2474 ff 
— , epitaxial, 2491 ff 
— : grain-size distribution, 2476 
— : impurity influence, 2475 


— kinetics, 2476ff 

— : mechanism, 2474 ff 

— (in) nanocrystalline materials, 2479ff 

— (and) pores, 2642 ff 

— : second phase influence, 887 ff, 2476 

— (and) sintering, 2492ff, 2642ff 
— : stagnation in thin films, 2490 ff 
— : texture inhibition, 2477 

— : thickness inhibition, 2476 ff 

— (in) thin films, 24891T 

- kinetics, 1588, 2421 ff 

— : effect of minor solutes on precipitates 
in steels, 1588ff 
laws of recrystallization, 2419 
-, metadynamic, 2164, 2454 
neutron radiation influence, 245 1 
nucleation, 2425 ff 
— : models, 2427 
— , oriented, 2427 ff 
— : role of inhomogeneity of orientation 
after deformation, 2428 ff 
— : strain-induced grain-boundary 
migration, 2435 ff 

— nucleation, stimulated, 216, 

— : subgrain coalescence, 2435 ff 
— : techniques of investigation, 2425 
— , twin-based, 2438 ff 

- (of) ordered alloys, 247 Iff 

— : antiphase domain creation during, 

247 Iff 

— : retardation of grain-boundary 
migration, 2471 ff 
-, primary, 

— : annealing textures, 2205 
— : critical strain, 2420 
— : growth of grains, see “Grain-boundary, 
migration” 

hot working, see “dynamic” 

— : impurity influence, 2423 ff 
— : kinetics, 1588, 2421 ff 
— : Kronberg-Wilson rotation, 2440 
— : microgrowth selection, 2435 
— : nucleation of grains, see “nucleation” 
recrystallization-controlled rolling, 2455 
retardation due to recovery , 2424 
-, secondary, 2482 ff 
— : driving force, 2483 
— , dynamic, 2486 

— : role of disperse phase, 2485, 2487 

— (and) sintering, 2492 
surface-controlled, 2487 ff 

— texture, 2486 ff, 2488 
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- (and) sintering, 2492 ff 

- (in) steels, 1587 ff 
-, tertiary, 2487 ff 

threshold strain for recrystallization, 2420 

- (of) two-phase alloys, 2158 ff, 2203 ff, 

2463 ff 

grain-boundary pinning, 2467 ff 
— : micromechanisms, 2163 ff 
— : misorientation near large particles, 
2125 ff, 2466 

— : nucleation at particles, 2463 ff 
— : effect of particle spacing, 2161 ff, 

2464 ff 

vacancies in grain boundaries, 2450 ff 
Zener force, 1009, 2159 
Recycling of metals, 1283 ff 
Reduced dimensionality, 900 ff 
Relative valency effect, 147 
Relaxation methods in diffusion measurements, 
565 ff 

Renormalization, 91 
Replacement collision sequence, 1651 
Reversion, 1807 
Rheocasting, 826 ff, 829 
Richard’s rule, 419, 476 
Rigid band approximation, 109, 151 
Rodrigues method, 2460 
Rough liquid-solid interface, 702 ff 
Roughness transition at surfaces, 626 
RSP (rapid-solidification-processed) crystalline 
alloys, 1 809 ff 

aluminum alloys, 1795 ff, 1810, 1812 ff 
steels, 1594, 1809 ff, 1814 ff 
superalloys, 1817 ff 
Rubberlike behavior 

- in alloys, 1542 ff 

- in polymers, 2735 ff 

Samson phases, 314 ff 

Scanning electron microscopy, see “Microscopy” 
Scanning transmission electron microscopy, 

1217 ff 

Scanning tunneling microscopy, see “Microscopy” 

Scheil equation, 715, 749, 751 ff 

Schmid’s law, 1852, 2086 

Schmid strain resolution tensor, 1882, 1885 

Schreinemakers’s rule, 511, 513 

Schrodinger equation, 48 

Screw dislocation, see “Dislocations” 

Secondary-ion mass spectrometry, 1215 ff 

Secondary-ion microscopy, 1217 


Segregation 

adsorbate-adsorbate interactions, 1 229 ff , 
1232 ff, 1272ff 

- and materials design, 1280 

- (during) austenizing, 1582 
-, competitive, 1272ff, 1281 ff 

-: complex effect of chromium on, 1272 

- (in) complex metallurgical systems, 1233 ff 
effect on mechanical properties, 1263 ff 
enrichment factor (ratio), 1209, 1222 

— : correlation with solubility, 1222ff 
-, equilibrium, 1202, 1203, 1218ff, 1239 
Fowler theory, 1229, 1231, 1233 
-, free energy of segregation 

— , to grain boundaries, 122 Iff 
— , to surfaces, 1225 ff 
— : temperature dependence of, 1230ff 
grain-boundary segregation, 1202ff, 2271, 
2285 

— (at) asymmetrical grain boundaries, 
1237 ff 

— : composition-depth profiles, 1213, 

1216 

— , computer simulation of, 1238 ff 
— : correlation with segregation at 
surfaces, 1240ff 

— (and) grain-boundary diffusion, 1254 ff 
— : micrographic techniques, 1216 ff 

— : grain-boundary segregation diagram, 
1224, 1226 

— at moving grain boundaries, 1248 

— (in) steels, 1214 ff, 1263 ff, 1595, 
1612ff 

— : orientation effects, 1225 

— (at) symmetrical grain boundaries, 

1235 ff 

- in multicomponent systems, 1234 
interaction of distinct segregants, 1272 

-, interfacial, methods of measurement, 

1209 ff 

interfacial, thermodynamics, 1205 ff 
irradiation-induced, 640 ff, 1708 
-kinetics, 1242ff 

-: Langmuir-McLean theory, 1219 ff 
-: Maxwell-Boltzmann relation, 1219 
-, non-equilibrium, 640, 1204, 1218 ff, 

1244 ff, 1708 

-, quench-induced, 1245 ff 
site competition, 1232 ff, 1243 
-, stress-induced, 1248 
-: substitutional segregation model, 1229 
-, surface, 1 225 ff, 1240ff 
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- (and) surface diffusion, 1254 ff 
ternary systems, 1272 

- theory, 1218 ff 

— , quasichemical, 1 225 ff 
Selenium 

- crystal structures, 26 ff, 38 
Self-diffusion, 572 ff 
Semicrystalline polymers, 903 ff 
Sendust alloy, 2533 ff 
Serrated flow, 1869 

Shape analysis, 1010 ff 
Shape-memory effect, 1538 ff 
pseudoelasticity, 1541 
— : rubber-like behavior, 1 542 IT 
thermomechanical recovery stress, 1541 
superelasticity, 1541 ff 
— : martensite-to-martensite 
transformations, 1543 ff 
training, 1540 
-, two-way, 1540 
Shear planes, crystallographic, 260 
Shockley partial dislocation, Short-range order, 
see “Order in solid solutions” 

Sigma phase, 178 
Silicon 

-, amorphous, 1729, 1761 
-, liquid, 1729, 1761 

phase change under pressure, 1768 ff 
-, porous, 933 ff 

Silicon steels (ferromagnetic), 1252, 1614 ff, 

2526 ff 

domain configuration, 2528 ff 
gamma loop, 2526 
grain size, 2530 ff 
-, high-silicon, 2533 

magnetic properties, 2526 ff 
— : effect of stress, 2528 ff 

— in relation to deviations from ideal 
grain orientation, 2528 ff 

production methods, 2531 ff 

— in relation to magnetic properties, 2532 
recrystallization, 2484 

grain-oriented, 1614 ff, 2528 ff 
(effect of) surface smoothness, 2531 
Silver-aluminum alloys, 2025, 2029 
Silver-gold alloys, 2024 
SIMS, see “Secondary ion mass spectrometry” 
Single-crystal growth, 809 ff 
Sintered aluminum powder, 2107 
Sintering, 2627 ff 

- (of) amorphous powders, 2638 ff 
densification, 2638 


dislocations, role of, 2632 ff 
driving energy, 2630 ff 
effect of chemical reactions, 2631 ff 
grain-boundary role, 2632 ff 

- (and) grain growth, 2492 ff, 2642 ff 
-, liquid-phase, 2650 ff 

- maps, 2636 

microstructure development, 2642 ff 

- monosized particles, 2641 

- neck growth equation, 2633 
pore drag and coalescence, 2643 
pore geometry, 2630, 2643 

- porosity, time dependence, 2638 
pressure-sintering, see “Hot pressing” 
(and) secondary recrystallization, 2492 ff 
shrinkage, accelerating and retarding 

influences, 2639 ff 

shrinkage kinetics (equation), 2636, 2640 
— : numerical approaches, 2640 
shrinkage, local, 2637 
particle center approach, 2635, 2637 
particle size distribution and pore size 
distribution, 2641 
-, pressureless, 2628 
-, solid-state, 2628 ff 

technological outlook, 2653 ff 

- (and) surface energy, 2630 
undercutting, 2635 
zero-creep technique, 2630 

Size factor, 144, 154 ff, 157 If, 330, 348 
Slip 

- and glide distinguished, 1883 

- band, 

— , persistent, see “Fatigue” 
bcc crystal, 1852, 1907 
coarse slip (in) fatigued alloys, 2043 
-, cross-, 2090, 2123, 2200 
easy glide in fee alloys, 2029 
fee crystal, 1852, 1907 
hep crystal, 1852, 1907 

- irreversibility in fatigue, 2370 
lattice rotation, 1884 ff 

- lines (bands) 

— , pure metals, 19 1 8 ff, 1933 
— , solid solutions, 2013 ff 
overshoot in fee alloys, 2029, 2056 

- planes, 1852 
-systems, 1 852 ff, 1906 ff 

— : tabulation, 1908 

Small-angle scattering of X-rays and neutrons, 
1161 ff 

alloys, 1 166 ff 
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- from dislocations, 1 179 ff 

- from point-defect clusters, 1 1 80 ff 
multiple scattering, 1182 
precipitation in aluminum-zinc alloys, 

1 1 66 ff, 1486 ff 
Snoek effect, 567 
Sodium, 

Wigner-Seitz theory of bonding, 5 1 
Sodium chloride structure, hardening, 2038 ff 
Solidification, 670 ff 

amorphous alloy formation, 1728 ff 
-, binary alloy, 709 ff 
-: cell formation, 725 ff, 731 ff, 754, 765 
cell spacing, 741 ff 
collision-limited growth model, 706 
-, computer modelling, 680 ff, 704, 706 
constitutional supercooling, 721 
— , criterion, 724 
constrained growth, 732 ff 
continuous growth of solid, 704 ff, 710 
-, controlled, 679, 68 1 ff 
convection, 780 

- cooling rates during rapid quenching, 1752 ff 
dendrite formation, 73 1 ff (see also 

“dendrite” and “dendritic growth”) 

-, diffusion-controlled, 714, 717 
-, directional, 679, 681 ff 
disorder trapping, 7 1 2 ff 

- (in) drop tubes, 1757 
electron-beam surface treatment, 
equilibrium freezing, 7 14 ff 

-, eutectic, 756 ff 

— : branching-limited growth, 765 
— : classification, 757 
— : coarsening after solidification, 878 ff 
— colonies, 765 
— : competitive growth, 765 ff 
— : coupled growth, 758 
— : coupled zone, 765 ff 
— , divorced, 767 
— : growth rates, 758 ff, 765 ff 
— : liquid/solid interface, 758 ff 
— : lamellar instability, 762 
— : lamellar vs rod growth, 758 
— : modification, 815 ff 
— : non-facetted-facetted, 763 ff 
— , rapid solidification, 768 ff 
— : supercooling, 761 
facetted growth, 708 
fluid flow, 780 ff 
grain size, 700 
heat transfer, 670 ff 


- (at) high undercoolings, 1756ff 
hypercooling, 1756 
inclusions, 794ff 
ingot structure, 78 Iff 
interface kinetics, 700ff 
interface temperature, 71 Off 
laser surface treatment, 
liquid-solid interface 
— , diffuse, growth, 707 
— , ledged, 1410ff 
— , local equilibrium, 683 ff 
— , non-planar, 720 
— , planar, 714ff, 720ff 
— , shape, 714ff 
— , sharp, growth, 704 ff 
— , structure, 702 ff 
— in ternary alloys, 754 
macrosegregation, see “Macrosegregation” 
-: microgravity, effect of, 821 ff 
-: microsegregation, see “Microsegregation” 
-: miscibility gap, 771 ff 
-, monotectic, 483, 771 ff 

— , directional solidification, 773 ff 
-: morphological (in)stability of planar 
liquid-solid interface, 720 ff, 726 
— : cellular structures, 725 ff 
— : effect of fluid flow, 729 ff 
— : experiments, 725 ff 
— : microsegregation, see 
“Microsegregation” 
non-equilibrium freezing 
— : no solid diffusion, 715 
— : partial mixing in liquid, 718 ff 
-: nucleation of solid, 687 ff, 1756 ff 
-: partition coefficient, 683, 728 

— , dependence on interface velocity, 710 
-, partitionless, 737 
-, peritectic, 775 ff 
— , aligned, 778 
-: porosity, 793 ff 
-, powder, 679 ff 
-: predendritic nuclei, 673 
-, rapid, 771, 775, 779ff,820ff, 1724,1748 ff, 
2544 ff 

-, rapid, practical methods, 820 ff, 1725 , 
1748 ff 

— , atomization, 820, 1748 
— , chill methods (including melt- 
spinning, melt extraction, etc.), 677 ff, 
821, 1748 ff 
— : consolidation, 1752 
— : cooling rates in, 1752 ff 
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— : crystalline alloys, see “RSP crystalline 
alloys” 

plasma spraying, 1749 
pseudo-RSP, 1758 
— , self-substrate methods, 1759 ff 
— , spark-erosion, 1752 
— : splat-quenching, 1748 
— : surfaces, 1759 ff 

- rates, direct measurement, 1761 ff 
response functions, 709 ff 
rheocasting, 826 ff, 829 

solid diffusion during freezing, 7 16 ff 
solidification path, 754 
solute-happing, 685, 712, 770 
subdivided melt method, 693 ff, 1756 ff 
-, ternary alloys, 752 ff 
thermodynamics, 682 ff 
-, weld zone, see “Welding” 

Solid solubility, 136 ff, 145, 150ff, 485 ff 
Solid solutions, 136 ff 
-, aluminum-base, 1 82 ff 
-, atomic size in, 154 ff, 159 ff 
classification, 138 ff 
creep, 1969ff, 2040 ff, 2064 ff, 
deformation twinning, 203 1 ff 
-, dislocation motion in, 1896 

electronegativity influence, 108, 1 14 ff, 147, 
161 

electron phases, 108 ff, 111, 166 ff, 170, 

225 

fatigue, 2043, 2346 ff 

- hardening, see “Hardening” 

-, Henrian, 485 

-, inhomogeneous, thermodynamics, 1481 
-, interstitial, 139, 1561 ff 
-, iron-base, 1561 ff 

— : solubility of carbon in iron in 

equilibrium with different phases, 1562 ff 
— : solubility of nitrogen in, 1564 
lattice spacing in, 180 ff 
mechanical properties, 20 10 ff 
noble-metal based, 1 80 ff 
ordered, 121, 193 ff, 198 ff, 252 ff 

- recovery (microstructural), 1979 

size effect influence, see “size factor” 
solubility prediction, 346 
stacking faults in, 191 
thermodynamic properties, analysis, 496 ff 
— , excess properties, 496 
optimization, 496 ff 
transition-metal based, 154 
Vegard’s law, 164, 330 ff 


Solid-state amorphizalion reactions, 1 764 ff 
Solid-state chemistry of intermetallic compounds, 
206 ff 

Solidus, 473 

Solute drag, 1396 ff, 1478, 1866 ff, 1970 ff, 2018, 
2440 ff 

-, dilute solid solutions, 2019 
-, concentrated solid solutions, 2019 ff 
Solute pumping, 895 
Solute-trapping, 685, 712, 770, 1744 
Solution-softening, 2035 ff 
Solutions, 

- regular, 439 ff 

-, thermodynamics of, 435 ff 
-, activity in, 438 

Sonochemical method of making amorphous iron, 
1763 ff 

Sonoluminesccnce, 1763 
Space group symbols, 5 ff 
Spallation maps, 1306 
Spectrometry 

-, Auger-electron, 989, 1212 
-, electron energy loss, 1087, 1091 ff, 1217 ff 
ion-scattering, 1214 ff 
photon probe techniques, 994 ff 
positron-annihilation, 175, 1633, 1636 ff, 
1681 

secondary-ion mass, 1215 ff 
Spark-erosion, 1752 
Sphere packing, 7 ff 

Spheroidization of cylindrical inclusions, 880 ff 
Spin waves, 2549 
Spinodal alloys 

magnetic properties, 25 1 6 ff 

Spinodal 

coarsening (late) stage, 1486, 1489 
-, coherent, 1484 
-, conditional, 1493 

-decomposition, 1 1 67 ff , 1 175 ff, 1 369 ff, 

1480 ff, 1581, 2055 
— : fastest-growing wavelength, 1483 
-: role of thermal fluctuations, 1485 
Splat-cooling, see “Solidification, rapid” 

Stability diagrams, see “structure, maps” 

Stacking fault(s), 189 ff, 1 846 ff 
-, complex, 1850ff, 2083 

- energy, 1 89 

— in fee solid solutions, 2030 ff 

— in two-phase alloys, 2046 
-, extrinsic, 190, 1848 

-, intrinsic, 190, 1846 

- in LI 2 phasses, 2086 ff 
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-, measurement, 190 ff 

(in) solid solutions, 191, 1074 

- tetrahedra, 1066, 1839, 1848 

-, superlattice extrinsic, 2099, 2150 
-, superlattice intrinsic, 2088, 2149 
-, twin growth, 190 
Standard molar Gibbs energy, 485 
Standard state, Henrian and Raoultian, 442 
Steels, 1556 ff {see also “Iron”) 

alloying elements, important, 1557 
-, ausforming, 1609 ff 

austenite grain size (prior), 1605 ff 
-, austenitic, 1568, 1610 ff 
bake-hardening, 1597 
brittleness, 

— : caused by impurity segregation, 

1270 ff, 1275 ff, 1281, 1582, 1612 
-, carbides in, 1563 ff, 1569 
continuous casting, 799 ff 
copper in steels, 1601 
deformation, 1583 ff 
-, dual-phase, 1601 ff 

- (for) electrical applications, see “Silicon 

steels” 

-, ferritic, 1568 

hardenabilily, 1578 ff 
-, heat treatment, 1577 ff 
-, high speed, 1610 

- high-strength low-alloy, 1600 ff 
hydrogen embrittlement, 1279, 2217ff, 

2282 ff 

intercritical annealing, 1601 
-, interstitial-free, 1594 ff, 1615 
iron-carbon phase diagram, 1565 
-, killed, 1615 
-, low-temperature, 1611 ff 
magnetic properties, 

-, manganese in, 1568 
-, maraging, 1 607 ff 

martensitic transformation, see “Martensitic 
transformation” 
mechanical properties, 1589 ff 
microstructure, 1573, 1575, 1577, 1600 

- (for) nuclear applications, 1613 ff 

— : for fusion reactors, 1614 
pearlite, 1564, 1570 ff 
prior austenite grain boundaries, 1582 
-rapidly quenched, 1594, 1 809 ff, 18 14ff 
rvalue, 1596 
recovery, 1587 

rccrystallization, 1587ff, 2470 
-: recrystallization-controlled rolling, 2455 


-, rimming, 1615 
-: solidification, 1615 ff 
-: solid-solution hardening, 1593 ff 
-: solute partititioning, 1456 
-: stabilization of austenite, 1575 
-, stainless, 1611 
— , fatigue, 2353 
-: strain-aging, 1596 ff 

- strength ranges in different types of steel, 1591 
-, structural, properties of, 1594 ff 

-, super-clean, 1613 

-: tempered martensite embrittlement, 1582 ff 
-: temper embrittlement, 1270ff, 1281, 1612, 
2285 ff 

-: thermo-mechanical treatment, 1609 ff 
-: tool steels, 1610, 1794, 1815 
-: transformation diagrams, 1577 ff 
-: transformation reactions, 1570 ff 
-, transformer, see “Iron-silicon” 

-, ultra-high-strength, 1607 ff 
-, ultra-low-carbon, see “interstitial-free” 
Stereology, see “Quantitative metallography” 
Stokes-Einstein relationship, 1798 
Stoner criterion, 124 

Strain hardening, 1862, 1913 ff, 2049, 205 {see 
also “Stress-strain curves”) 

- of alloys with small particles, 2115 ff 

- of metal-matrix composites, 2592 ff 
Strain localization, 1949 

Strain rate, effective, 2003 ff 
Strain softening, 1939 ff 
Stress-corrosion cracking, intergranular, 1276 ff 
Stress relaxation 

-: dispersed-phase and precipitation- 
strengthened alloys, 2126 ff, 2179 ff 

- in metal-matrix composites, 2594 

- in polymers, 2728 

-, used to determine activation volumes for 
plastic deformation, 2180 
Stress-strain curves, 1915 ff, 2010ff 
-: bcc crystal, 

-: Considfere’s criterion, 2694 ff 

-: critical (resolved) shear stress, 1885 ff, 1926 

-, cyclic, 2300 ff, 2308 ff 

— compared with monotonic deformation, 
2336 ff 

-: easy glide, 1915 ff 

-, fee crystals (solid solutions), 2011 ff, 

2023 ff 

— : dynamic recovery, 2030 
— : effect of temperature, 2012, 2021 ff 
— : linear hardening, 2029 ff 
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hep metals, 1916 
latent hardening, 1932 
-, metal-matrix composites, 2603 
-, multiphase alloys, 2113 ff 
polycrystals, 1940ff 
— , relation to stress-strain curves of 
single crystals, 1943 ff 
— : Sachs average, 1942 
— : Taylor factor, 1942 ff 
— : Taylor model, 1943 
-, rubbers, 2739 ff 

stage I, 1915 ff, 1926, 2113 
stage II, 1916ff, 1926 ff, 2029 ff, 2113 
stage III, 1916 ff, 1929 ff, 2113 
— , solid solutions, 2012, 2030 
stage IV, 1917 ff, 1930 ff 
stage V, 1917 ff 
-, superalloys, 2146 

theoretical models, 1924 ff 
-, true, 2694 

-, two-phase alloys, 2112 ff , 2127 
Stretcher strains, 1597 
Structure (crystal) 

-, alternative graphical representations, 218 
axial ratio, see “Axial ratio” 

-, binary alloy phases, 102 ff 
-, intermetallic compounds, 206 ff, 2141 ff 

- maps, 1 02 ff , 345 ff 
nomenclature, 13 ff 

-, prediction, 2 

simple metals, 2 ff, 95 ff 
size-factor influence, see “Size factor” 

-, silicon, 99 ff 

- stability 

— , elemental metals, 95 ff, 488 
valence effect, see “Valence compounds” 
Strukturbericht symbols, 226ff 
Subgrain(s) 

- boundaries, see “Creep”, “Dispersed-phase 

alloys” and “Recovery from deformation” 

- coalescence, 2435 ff 
Sulphides at surfaces, 1311 ff, 1318 ff 
Superafloys, 2142 ff 

coalescence of the precipitates, see “rafting” 
-, deformation mechanisms, 2147 ff 
-, dislocations in, 2048 
duplex structures, 2165 
-, grain-size effects, 2 1 68 ff 

— , dependence on y’ fraction, 2172 
micromechanisms of plasticity, 2 1 90 ff 
-, microstructure, 2076 


multiphase precipitation hardening, 2165ff 
persistent slip bands, 
plasticity of the y matrix, 2196ff 
— : dislocations in matrix corridors, 

2200 ff 

rafting, 2157ff, 2182, 220 If 
rapid-solidification processed, 
resistance to coarsening, 878 
-: secondary recrystallization, 

-: single-crystal plasticity at intermediate 
temperatures, 2198ff 
-, stress-strain curves, 2146 
-, temperature dependence of flow stress, 
2147 

Supercooling, see "Undercooling" 
Superdislocation, 2056 ff 
Superelasticity, 1541 
Superlattice (superstructure), 140, 248 ff 
-, long-period, 195ff 
-, semiconducting, 902ff 

- stacking faults, 22088, 2099, 2149ff 

- types, 194ff 
Supermalloy, 2536 
Superparamagnetism, 2513 
Superplasticity, 1997 ff 

-mechanism, 1998 ff 

- of nanocrystalline ceramics, 928 ff 
Supersaturation, 1377 

Surface 

- analysis techniques, 1211 ff 

- concentration, 453 ff 
-diffusion, 626 ff, 977 ff 

— : effect of adsorbed elements, 1254 

- enrichment ratios, measured and predicted, 

1229 

- free energy, 1210 

— as affected by segregation, 1249 ff 
-microchemistry, 1 202 ff 

- premelting, 978 
-protection, 1292 ff 

- segregation, 454, 1202 ff, 1225 ff 

— : correlation with grain-boundary 
segregation, 1240ff 

- structure, 626 ff, 977 ff 

— by X- ray diffuse scattering, 1139 

- tension, 456 
thermodynamics, 453 ff 

Surfaces, rapid solidification processing, 1759 ff 
Synchro-shear, 2151 

Synchrotron radiation (X-ray) sources, 1121, 
1123, 1169 ff 



S36 


Subject index 


Tantalum-rhenium alloys, 2035 
Taylor factor, 1942, 2345 ff 
Taylor lattice, 23 14 ff, 2317 
TD (thoria-disperse) nickel, 2109, 2139 ff, 

2160 

Tellurium 

- crystal structures, 26 ff, 38 
Temper-brittleness, 1270ff, 1281, 1612 
Tempering of martensite, 1579 ff 

Tensile deformation, see “Plastic deformation” 
Ternary composition triangle, 503 ff 
Textures (see also “Mesotextures”) 

-, annealing, see “Annealing textures” 
casting, 784 

-, deformation, 1943 ff, 2455, 2459 

orientation functions (polymers), 2680 ff 
orientation distribution functions, 2456 

- (of) polymers, 2676 ff 

- (and) rvalue, 1596 

secondary recrystallization, 2486 ff 
-, wire-, in metals, 2675 ff 
Thermal cycling, 884 

Thermal expansion of metal-matrix composites, 
2609 ff 

Thermal gradients, 612 ff, 883 ff 
Thermobalance, recording, 1327 
Thermobaric quenching, 1768 
Thermochemistry, metallurgical, 417 ff 
Thermodynamics, 

ideal behavior, 442 
-, laws of, 414 ff, 419 
-, metallurgical, 414 ff 

- of irreversible processes, 539 
Thermomechanical treatment of steels, 1609 ff 
Thermomigration, 611 ff, 615 
Thixocasting, 829 

Thompson tetrahedron, 1846 
Thomson-Freundlich equation, 460 
Thorium 

- crystal structure, 39 
Threshold stress, 2185 ff 
Tie-line, 473 

Tight-binding approximation, 64, 77 
Tilt boundary, 1078, 2413 

- mobility, 2415 

Time-temperature-transformation diagrams 
steel, 1455 ff 
Tin 

- crystal structures, 25, 35 
unusually large atomic radius, 25 

Titanium 

allotropy, 19 , 24 


Titanium aluminides 

-, dislocation cores in, 2099 ff 
phase equilibria, 2095 ff 
TiAl, mechanical properties, 2093 ff 
TiAL/TijAl two-phase alloys, 2094 ff 
— : two-phase ‘single crystals’, 2096 ff 
7' 0 curves, 686 ff 
Tool steels, see “Steels” 

Topochemical investigative techniques, 989 
Toughness, 2213 , 2238 
Trace elements, 1202ff 
Transformation-toughening, 2286 ff 
Transformations in the solid state (see also 
“Precipitation”) 

-, athermal, 1508 

charge-density waves, 1548ff 
-, continuous, 1451, 1480ff 
-, (of) highly defective phases, 868 
diffusive, 1364ff 
-, discontinuous, 1451, 1456ff 
— : lamellar spacing in, 1460ff 
-, displacive, 1364, 1532 (see also 
“Martensitic transformation” 

— , diffusional-displacive, 1545 
driving forces, 1365ff 
-, eutectoidal, 1451 ff 

experimental techniques, 1 372 ff 
-, first-order, 1371 

- growth, see “Growth” 

-, higher-order, 1371 

incommensurate, 1549 ff 

- involving long-range diffusion, 1400 ff, 1418 ff 
irradiation-induced, 643, 1709 

-, martensitic, see “Martensitic transformation” 
-, massive, see Massive transformation” 

-, nondiffiisive, see “Martensitic 
transformation" 

-, microstructural, 866 ff 
nucleation, see “Nucleation” 

(associated) plasticity, 1880 ff 
problems, outstanding, 1495 ff 
precursor phenomena, 1 140 ff 
recrystallization reactions, 1379 
-, reconstructive, 1364, 1532 
-, spinodal, see “Spinodal” 

- (in) steels, 1570 ff 

-, thermoelastic, 1527 ff 

- toughening, 2286 ff 
Transformation-induced plasticity, 1536 ff 
Transformation diagrams, 1577 ff 

-, continuous-cooling, 1578 
-, isothermal, 1578 ff 
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Transformer steel, see “iron-silicon” 

Transition bands, 2432 ff ( see also “Deformation 
bands”) 

Transition metals 

atomic sphere approximation, 79 
-, atomic radii and volumes, 15 ff, 18, 94 
band structure, 77 ff 
bulk properties, theory, 90 ff 
cohesive energy, 93 ff 
crystal structures, 1 8 ff, 99 ff 
energy levels, 55 

heats of formation, calculation, 112 
hybrid bands, 82 ff 

-, intermediate phases based on, 1 78 ff 
-, lattice spacings in solid solutions, 184 ff 
magnetic properties, theory, 122 ff 
solid solutions based on, 154 
-, valence states of, 149 
Wigner-Seitz radius, 94 
Transmission electron microscopy, 1034 ff 
-, analytical, 1086ff 

— : beam-spreading, 1090 
— : electron energy loss spectrometry, 
1087, 1091 ff, 1217 ff 
— : error correction, 1090 ff 

thin-film approximation, 1089 ff 
— : spatially resolved valence electron 
energy loss spectrometry, 1112 
bend contours, 1049 
bright-field image, 1036 
charge-coupled device cameras, 1111 
convergent-beam diffraction, 1040ff, 1111 
dark-field image, 1036 
diffraction contrast, theory, 1042 ff 
dislocations 

— imaging, 1056 ff 

— -: Burgers vector, determination, 106 1 ff 
— : dipoles, 1059 ff 

— : dislocation density, determination, 1062 
dislocation distribution, 1921 ff 

— (of) dislocations in fee alloys, 2014 ff 
— : dislocation-particle interaction, 2047 ff 
— : g.b product, 1057ff, 1063 

— : loops, 1063 ff 
— : strain contrast, 1054 ff 
— : superdislocations, 1110 
double diffraction, 1038 ff 
.dynamical diffraction theory, 1044 ff, 1052 ff 
— : absorption, normal and anomalous, 
1046 ff 

— : column approximation, 1050 
— : image intensities, 1047 ff 


— : PendellOsung, 1047 
— : thickness contours, 1047 ff 
electron energy loss spectrometry, see 
“analytical” 
excitation error, 1044 
extinction length, 1044 
-: field-emission guns, 1035 
-: foil thickness measurement, 1042, 1090 
-: Frauenhofer diffraction, 1096 
grain-boundary images, 1075 ff 
-, high-resolution, see “High-resolution 
electron microscopy” 
imperfect crystals, diffraction, 1050ff 
instrumentation advances, 1110 
-: interface, heterophase, imaging, 1078 ff 
-: interface, translational (faults, antiphase 
boundaries), 1072ff 
Kikuchi lines, 969, 1040 
kinematical diffraction amplitude, 1099ff 
kinematical diffraction theory, 1051, 1094ff 
-: Moird patterns, 1042 

ordered crystal patterns, 1039ff 

- (applied to) phase transformations, 1373 
-: precipitates, imaging, 1067ff 

— : black-and-white contrast, 1068 
— : coffee-bean contrast, 1068 
— : matrix contrast, 1068 
— : structure-factor contrast, 1069 
-: resolution, 1034ff 
-: scanning (STEM) mode, 1037, 1217 
-: strain contrast, 1042, 1054 
-: strong-beam image, 1054 
-: void imaging, 1066 
-: weak-beam image, 1044 
Tresca criterion, 2698 
Triple point, 424, 450 
Trouton’s rule, 420 
TTT diagram, see “Time-temperature- 
transformation diagrams” 

Twin 

-, annealing, 897 

— (in) bcc metals, 2479 
— , formation, 2477 ff 

- boundary, 1872 
-, mechanical, 

- (in) ordered alloys, 2065 ff 
-, transformation-, 1517ff 

Twinning 

-, deformation, 1907ff, 203 Iff 

- (in) ferritic steels, 1587 

— : crystallography, 1911 
— : nucleation, 1910 
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-, multiple, 2439 

- (in) ordered alloys, 2065 ff, 2096 ff, 2151 ff 
polysynthetically twinned crystals, 2096 ff 
recovery-twins, 2438 

Two-phase alloys, see 'Dispersed-phase alloys’ 
and “Hardening, precipitation-” 

Ultimate tensile strength, Undercooling, 694, 
697, 1377 

-, constitutional, 721 

-, formation of metastable phases by, 699 ff, 
1729 ff 

- (in) precipitate growth, 1399 
solidification at high undercoolings, 1756 ff 

Uranium 

- polymorphism, 39 

Vacancy(ies), 1623 ff 

-agglomerates, 1642ff, 1703 
atomic relaxation around, 1624 

- concentration, thermodynamics of, 437 ff 

- concentration, determination of, 1627 ff 
-, constitutional, 1 86 ff , 600 

- (from) dislocation intersection, 1905 
-, di vacancies, 1643 

— , binding enthalpy, calculation, 1627 
-: differential dilatometry, 1627 ff 

electrical resistivity per vacancy, listing, 
1629 ff 

-: enthalpy of formation, 

— , calculation, 1623 ff 
— , experimental determination, 1626 ff 
— , listing, 1629 ff 
-: entropy change, 

— , calculation, 1 623 ff 
— , experimental determination, 1 626 ff 
— , listing, 1629 ff 
(in) grain boundary, 2450 ff, 2632 

- interaction with solute atoms, 1644ff 

- -interstitial interaction, 1651 ff 

— : close pairs, 1653 

- lattice, 894 

- migration 

— , activation enthalpy, 1635 ff, 1639 ff 

- (in) ordered alloys, 1646 ff 
positron-annihilation spectroscopy, 1633 
properties, listing, 1629 ff 

-, quenched-in, 1 1 69 ff 

- relaxation volume, 1625, 1628 

— , listing, 1629 ff 


- solute binding energy, 1644 ff 
-, structural, see “constitutional” 

-, thermal equilibrium, 1623 

trivacancies, 1643 

- wind, 6 1 Off 

Valence compounds, 139, 322 
tetrahedral structures, 324 
Valence states, 62 ff 
Valence (valency) of metals, 148ff 
Vapor pressure and activity, 461 ff 
Vegard’s law (or rule), 164, 330ff 
-, deviations from, 164ff, 330ff 
Vicinal planes, 855 
Virtual adjunct method, 676 
Viscoelasticity of polymers, 2726 ff 
Viscosities of molten metals and alloys, 1743 
Void formation, see “Irradiation” 

Volume size factor, see “Atomic size factor” 

Von Mises yielding criterion, 1946, 2590, 2698 

W agner-Lifshitz-Slyozov theory of Ostwald 
ripening, 873 ff, 1437 ff 
Warren-Cowley parameters, 1 145 ff 
Water, phase diagram, 425 ff 
Wave function, 51 
Welding, 803 ff 

fusion zone, 803 ff 
heat-affected zone, 804 ff 
macro- and microstructure, 807 ff 
solidification rate, 807 ff 
Widmanstatten precipitates, 1389, 1405, 1396, 
1416, 1418, 1431, 1470 
-, coarsening, 1448 ff 

Widmanstatten ferrite, 1571 
— , formation kinetics, 1474 ff 
-, dissolution, 1433 
Wigner-Seitz 
-, cell, 76 

-, theory of bonding, 48 ff, 76 ff, 88 
-, radius. 76, 86 ff, 88 
Work hardening, see “Strain hardening” 

Work softening, 1939 ff 
Wulff construction, 1382 
Wulff plane, 1381 ff 

Wyckoff sequence (for crystal structures), 224 

5^-ray absorption and scattering 

absorption coefficients, 1 1 20 ff 
absorption edge, 1121 
angle of total reflection, 1131 
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Bragg peak broadening, 1 1 32 ff 
Bragg peak intensity (and changes), 1118 , 
1132 ff 

Bragg peak shifts, 1 1 30 ff 
-, Compton scattering, 112 
-detectors, 1130, 1139 
diffuse scattering, 1 1 1 8 ff, 

— between Bragg peaks, 1 139 ff (see also 
“monotonic Laue scattering”) 

— components, 1 148 ff 

— near Bragg peaks, 1 1 34 ff 
-- (due to) point defects, 1664 ff 

extended X-ray absorption fine (EXAFS), 
1183, 1777 
fluorescence, 1121 
Hijnl corrections, 1121 
Huang scattering, 1 1 35 ff , 1147, 1665, 

1673 ff 

-inelastic scattering, 1 126 ff 
— , coherent, 1126 
— , incoherent, 1127 ff 

-: isomorphous and isotopic substitution, 1771 ff 
line broadening due to plastic deformation, 
1924 

pair distribution function, 1769 ff 
-: phonon role in inelastic scattering, 1126 

- (from) point-defect clusters, 1 136 ff 

-: peak shifts due to plastic deformation, 1924 
radial distribution function, 1770 
scattering, 1 1 1 6 ff 

-: single-particle scattering function, 
size-effect scattering, 1 1 32 ff 
small-angle scattering, 1 1 6 1 ff (see also 
“Small-angle scattering...”) 
spurious radiation, 
surface sensitivity, 1131 ff 
thermal diffuse scattering, see “diffuse 
scattering” 

X-ray photoelectron spectroscopy, 1213 ff 
X-ray sources, 1121, 1128ff 
-: X-ray topography, 988 

Zwischenreflex scattering, 1139 

Y ield anomaly, see “Anomalous flow behavior 
in LI, phases” 

Yield, discontinuous (yield phenomenon) 

- in fee solid solutions, 2028 ff 

- in lithium fluoride, 1938 ff 

-in non-ferrous metals, 1939, 1941 

- in polymers, 2695 ff 


- in semiconductors, 1938 

- in steels, 1585 ff, 1596 ff, 1869, 1938, 

2053 ff 

- (due to) strain softening, 1939 ff 
Yield stress 

- (in terms of) continuum mechanics (yield 

criteria), 1 946 ff 
— : Mohr diagram, 2129 ff 
— : (for) polymers, 2698 ff, 2709 
— : von Mises condition, 1946, 2590, 
2698 

— : Tresca criterion, 2698 
critical resolved shear stress for glide, 

1885 ff 

-, dependence on cell (subgrain) size, 1930ff, 
1981, 1984, 2416 

dependence on dislocation density, 1925 
-, dependence on grain size, 2168 ff (see also 
“Hall-Petch relationship”) 

-, dependence on mesh length, 1923 
dependence on order, 2059 ff 
-, Fleischer-Friedel theory, 1903 
forest dislocation cutting, 1903 ff 
-, Labusch, 2020 

-, mechanisms determining, 1894 ff 
— , extrinsic, 1 896 ff 
— , intrinsic, 1894 ff 
-, metal-matrix composites, 2584 ff 
particle resistance, 1897 ff 

- of polymers, 2693 

-: solute resistance, 1896 

superposition of different resistances to 
plastic deformation, 1905 ff 
threshold stress, 2185 ff 
Young’s modulus, 1880 

Zener relationship, 1009, 2159, 2467 ff. 2642 

Zener relaxation, 567 ff 

Zero creep technique, 1210, 2630 

Zinc, recovery from deformation, 2401 ff 

Zinc group metals 

- crystal structures, 21 ff 
Zintl phases, 225 ff 
Zirconium 

-, purification by electromigration, 619 
fast diffusion, 595 
allotropy, 20, 24 

Zone-hardened Al-Cu alloys, 2051 
Zone-melting (zone-refining), 719ff 
Zone-refined iron, 1588 










Colour picture on front cover: 

Simulation of an alloy dendrite growing into a supercooled 
liquid using the phase-field method. The colours show 
variation of composition (atomic fraction Cu) in the liquid 
and solid for parameters approximating a Ni-Cu alloy 
with 0.41 atomic fraction Cu. See ch. 8, par. 7.5 
(Courtesy of William ). Boettinger and James A. Warren). 
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